The Ecommerce Data Playbook: Connecting Shopify, Ad Platforms, and Fulfillment into One View

The Ecommerce Data Playbook: Connecting Shopify, Ad Platforms, and Fulfillment into One View

Table of Contents

What is an ecommerce data playbook?

An ecommerce data playbook is the operating blueprint for turning disconnected commerce, marketing, fulfillment, and finance data into one view the business can actually use to make decisions.

That means more than “pipe Shopify into the warehouse.”

It means deciding:

  • which systems matter
  • which fields matter
  • which joins are real versus convenient
  • which metrics need to be gross, net, directional, or decision-grade
  • which outputs the team will actually use once the work is done

A lot of ecommerce teams already have plenty of data.

What they do not have is one view that explains revenue, acquisition cost, return behavior, and margin in the same sentence.

That is why the dashboards end up fighting.

Why ecommerce reporting breaks so easily

Ecommerce is one of the easiest environments to over-report and under-explain.

The stack usually spans multiple systems with different incentives:

  • Shopify wants to show the commerce story
  • Meta, Google, and other ad platforms want to show the conversion story
  • Klaviyo or lifecycle tools want to show the retention story
  • 3PLs and fulfillment tools want to show the operations story
  • finance needs the profit story

None of those systems are wrong.

They are just answering different questions.

The problem starts when a leadership team expects all of them to produce the same number by accident.

If you have already felt this tension, you have seen the symptoms:

  • revenue looks healthy but margin feels thin
  • CAC looks acceptable until returns and discounts enter the conversation
  • paid channels claim more credit than total orders justify
  • product “winners” create more operational pain than profit
  • nobody can answer whether the business is scaling efficiently or just getting louder

That is the exact gap the playbook is supposed to close.

Which systems should be in the first version?

Not every ecommerce stack needs a giant integration project.

Most first-pass ecommerce playbooks should start with five source groups:

Source groupTypical systemsWhat it should answer
CommerceShopify, BigCommerce, WooCommerce, custom storefrontWhat was ordered, by whom, at what price, with what discounts?
Marketing spendMeta Ads, Google Ads, TikTok, LinkedIn, affiliatesWhat did we spend, on which campaigns, for which audiences?
Lifecycle / CRMKlaviyo, HubSpot, Attentive, customer profilesWhich customers are new, repeat, engaged, or retained?
Fulfillment / returns3PL, ShipStation, Loop, return portal, WMSWhat did shipping, fulfillment, and returns do to the economics?
Finance / warehouse logicERP, accounting system, modeled warehouse tablesWhat survives the jump from gross revenue to usable profit reporting?

If one of those groups is missing, the business almost always loses part of the picture.

What data should you actually pull?

This is where teams often waste time.

The goal is not to ingest every exportable field. The goal is to ingest the fields that let you explain the business.

What to pull from Shopify or the commerce platform

At minimum, most teams need:

  • order ID
  • order timestamp
  • customer ID / email / identity keys
  • product / SKU / variant data
  • gross sales value
  • discount value
  • taxes and shipping treatment
  • refund amounts
  • first-order versus repeat-order flags
  • subscription versus one-time purchase logic, if relevant

What to pull from ad platforms

At minimum:

  • campaign / ad set / channel metadata
  • spend
  • clicks / impressions where useful
  • attributed conversions and attributed revenue
  • date grain that can be reconciled to warehouse logic
  • account identifiers and platform-specific campaign IDs

What to pull from lifecycle tools

Useful fields usually include:

  • customer status and engagement segments
  • campaign sends / opens / clicks if email performance matters to the decision
  • flows or audience membership if retention or win-back plays matter
  • unsubscribes or suppression logic when lifecycle pressure affects conversion quality

What to pull from fulfillment and returns systems

Do not skip this if profitability is the actual question.

At minimum:

  • shipment-level costs when available
  • delivery status
  • return created date and processed date
  • returned units or returned order value
  • reason codes when they change merchandising or channel decisions
  • replacement / exchange logic if it changes the revenue story

What to pull from finance

This is where the “looks good in Shopify” story meets reality.

At minimum:

  • recognized or net revenue logic
  • COGS where available
  • payment processing or marketplace fees if material
  • any adjustments the business already trusts for formal reporting

Which fields should you skip at first?

A good first version is narrower than most teams expect.

Skip or defer:

  • vanity engagement metrics that do not affect decisions
  • every campaign dimension if the business only needs channel-level truth first
  • every customer trait if segmentation is not yet part of the operating question
  • exotic attribution rules before basic revenue and return logic is trusted
  • every possible product attribute if you have not yet proven the core profitability model

The first win is not “complete coverage.”

The first win is one credible operating view that makes the next budget or merchandising decision less political.

What usually goes wrong in ecommerce data integration?

1. Gross revenue gets treated like profit

This is the most common trap.

Gross revenue is easy to pull and easy to celebrate.

But once discounts, refunds, shipping, fulfillment, marketplace fees, and return behavior show up, the apparent winners can change quickly.

If the team only sees top-line sales, it may scale the channel or product that looks busy instead of the one that makes money.

2. Return timing hides the real result

A campaign can look fantastic in the week it launches and terrible once the returns land.

If your reporting only compares same-period orders and same-period refunds without cohort logic, the decision layer will keep flattering the wrong campaigns.

3. Multi-currency and settlement logic get hand-waved

If you sell across regions, settle in different currencies, or rely on finance adjustments later in the process, a “simple revenue export” can be much less simple than it looks.

This does not mean the first version has to be perfect.

It does mean the assumptions have to be named.

4. Customer identity gets treated like a solved problem

One customer can show up as:

  • a Shopify customer record
  • an email subscriber
  • a returning purchaser with a changed email
  • a CRM contact
  • an ad-platform conversion

If the identity logic is vague, repeat-vs-new analysis and customer-quality reporting become much less trustworthy.

5. Attribution gets asked to do too much

Platform attribution can help with directional optimization.

It is rarely enough on its own to explain profitability.

A useful ecommerce playbook does not force attribution to answer every question. It lets attribution be one input inside a bigger operating model.

If that attribution gap is the immediate pain, the DTC ecommerce attribution case study shows what happens when channel reporting finally gets grounded.

What should the first decision-ready model produce?

A strong first version should help leadership answer a short list of questions without a meeting turning into a reconciliation exercise.

1. Which channels are driving profitable growth?

Not just revenue. Not just ROAS.

Profitable growth.

That usually means seeing, by channel:

  • spend
  • attributed or blended revenue
  • returns and discounts
  • CAC or customer cost view
  • contribution or margin proxy

2. Which products or categories look good until fulfillment and returns are included?

A product line can drive a lot of sales and still create mediocre economics once shipping complexity, return rates, or discount behavior are visible.

3. Which customer segments are actually worth acquiring?

For many ecommerce teams, the real insight is not simply that one channel is cheaper.

It is that one channel brings customers with:

  • lower return behavior
  • higher repeat purchase rates
  • better product mix
  • healthier margin contribution over time

4. Which part of the stack is creating false confidence?

Sometimes the answer is branded search. Sometimes it is a finance logic mismatch. Sometimes it is that Shopify is being asked to answer a margin question it was never designed to answer.

That diagnostic value is part of the win.

A practical five-step build sequence

If I were setting up the first version for a mid-size ecommerce team, I would keep it this tight:

Step 1: Name the operating decisions

Start with the questions, not the connectors:

  • where should we move budget?
  • which products deserve more push?
  • which customers are more valuable than they first appear?
  • where is margin leaking?

If you skip this, the project turns into stack assembly instead of decision support.

Step 2: Define the core grains

Decide which tables or entities matter most:

  • order
  • customer
  • product / SKU
  • refund / return
  • shipment
  • campaign / channel

This prevents half the future confusion.

Step 3: Model net reality, not just storefront reality

Create one explicit layer where gross sales, discounts, refunds, returns, and fulfillment costs can live in the same conversation.

This is where many teams realize the real problem is not the dashboard. It is the missing economic model behind it.

Step 4: Add marketing and lifecycle context

Once the economics are understandable, add the channel and customer context that changes allocation decisions.

That is where spend, acquisition source, lifecycle status, and customer quality start becoming useful instead of noisy.

Step 5: Publish one narrow operating output

Do not start by shipping ten dashboards.

Ship one operating view or weekly review table that the team can actually use.

That is enough to reveal where the next layer of work should go.

What does good look like after launch?

Good does not necessarily look like a perfect warehouse.

Good usually looks like this:

  • marketing can stop defending platform-native truth as if it were business truth
  • finance can see how acquisition and return behavior affect net performance
  • ecommerce leadership can see which channels and products deserve confidence
  • the team can explain why one number changed, not just that it changed

If your current stack is missing the warehouse-to-workflow bridge, Data Activation Playbook is the next read after this one.

Download the ecommerce data playbook

This companion PDF is intentionally lightweight. It gives you a practical checklist for the source systems, core fields, common gotchas, and the first operating outputs worth building.

Download the Ecommerce Data Playbook (PDF)

A lightweight planning guide for connecting Shopify, ad platforms, lifecycle tools, fulfillment, returns, and finance into one decision-ready reporting layer.

Or download the PDF directly.

Bottom line

Most ecommerce data projects do not fail because the APIs are impossible.

They fail because the business never decided which version of truth it actually needed.

If your team can see revenue but not real profitability by channel, product, and customer type, start with Show Me the Margin.

If the problem is broader than one profitability lens and you need a bigger measurement system leaders can trust, start with Revenue Analytics.

Download the Ecommerce Data Playbook (PDF)

A lightweight planning guide for connecting Shopify, ad platforms, lifecycle tools, fulfillment, returns, and finance into one decision-ready reporting layer.

Download

Common questions about ecommerce data integration

Do we need every Shopify field and every ad-platform metric in the warehouse?

No. Start with the fields that help you explain revenue, returns, CAC, customer quality, and margin. Most teams slow themselves down by ingesting everything before they define the decisions the stack needs to support.

Why do ecommerce revenue numbers disagree so often across teams?

Because each system measures a different version of reality. Shopify may show gross order value, ad tools show attributed conversion value, fulfillment tools show shipment cost, and finance cares about net revenue after refunds and adjustments. If you do not model those definitions explicitly, every team brings a different truth to the meeting.

What is the biggest ecommerce data-modeling mistake?

Treating orders as the whole story. Real profitability requires connecting orders to refunds, returns, discounts, shipping, acquisition cost, and customer behavior over time. Order data alone usually makes growth look cleaner than it is.

Should the first version include attribution and margin together?

Usually yes, at least directionally. Even a lightweight first version is more useful when channel spend, return behavior, and downstream profitability can be seen in the same conversation instead of in separate dashboards owned by different teams.

Share :

Jason B. Hart

About the author

Jason B. Hart

Founder & Principal Consultant

Founder & Principal Consultant at Domain Methods. Helps mid-size SaaS and ecommerce teams turn messy marketing and revenue data into decisions leaders trust.

Marketing attribution Revenue analytics Analytics engineering

Jason B. Hart is the founder of Domain Methods, where he helps mid-size SaaS and ecommerce teams build analytics they can trust and operating systems they can actually use. He has spent the better …

Get posts like this in your inbox

Subscribe for practical analytics insights — no spam, unsubscribe anytime.

Related Posts

Book a Discovery Call