Designing the SFMC Data Model Before You Create a Single DE

Most Salesforce Marketing Cloud projects end up with a Data Extension sprawl that can't be joined, segmented, or reported on. The fix is a 30-minute data model exercise before the first DE gets created.

Published Oct 08, 2025

Designing the SFMC Data Model Before You Create a Single DE

Key takeaways

Most SFMC projects end up with Data Extension sprawl that cannot be joined, segmented, or reported on. The fix is a 30-minute data model exercise before the first DE gets created — define Subscriber Key, identifier mapping, and the DE split pattern.
Subscriber Key is the canonical identifier across all DEs. Customer ID is usually right (stable, opaque, persistent across email changes); email-as-key is the trap that breaks the first time a customer updates their email. Lock the Subscriber Key choice on day 1.
Identifier mapping across CRM, e-commerce, loyalty requires explicit cross-reference DEs. A single "IdentityCrossref" DE with one row per customer mapping CRM ID to e-commerce ID to loyalty ID becomes the foundation every downstream segment can join through.
DE split pattern: master DEs (one row per customer with enriched attributes), event DEs (transactional history), sendable DEs (campaign-specific target lists). Mixing these into one giant DE breaks at scale. The 3-layer split is the production discipline.

After discovery on a typical mid-market SFMC engagement, you find out the client has subscribers spread across three systems - CRM, an e-commerce platform, and a loyalty program. Each one uses a different primary key to identify a customer. If you skip the data modeling step and start creating Data Extensions as the data arrives, you end up with DEs that can't be joined, segments that can't be built, and reports that can't be produced.

A 30-minute exercise before anyone opens the Contact Builder UI fixes this.

The setup to map first

Map what each source system actually stores, including its identifier column:

Source 1: Salesforce CRM
  Contact: ContactID, Email, FirstName, LastName, SalesRepID

Source 2: Shopify E-commerce
  Customer: ShopifyID, Email, TotalOrders, LastOrderDate, TotalSpend

Source 3: Loyalty Program
  Member: MemberID, Email, Tier, Points, JoinDate

Three different primary keys. Three different customer records describing the same person.

The unifying identifier

You need one common identifier to match a person across systems. In this case, email works - it's present in all three. If email isn't common (common when SMS-first or multi-channel identity), fall back to a deterministic hash or a CDP-provided canonical ID.

In SFMC, this unified identifier becomes the Subscriber Key - one customer, one Subscriber Key, regardless of how many systems they appear in.

The master DE pattern

Don't dump everything into one giant Data Extension. Build a Master_Customer_DE that holds the identifiers + commonly-segmented attributes, and let satellite DEs hold the detail:

Master_Customer_DE (the hub):
  - CustomerID         <- Primary Key (UUID or CRM's ContactID)
  - EmailAddress       <- Send Relationship to Subscriber Key
  - FirstName
  - LastName
  - SalesRepID         <- lookup into SalesRep_DE
  - MemberTier         <- from Loyalty
  - TotalSpend         <- from Shopify
  - LastOrderDate      <- from Shopify

Satellite DEs referenced by Master:

SalesRep_DE: SalesRepID, Name, Email, PhoneNumber
Order_History_DE: OrderID, CustomerID, OrderDate, Amount, Items
Loyalty_Activity_DE: MemberID, Activity, Points, Date

Join from Master to satellites via AMPscript Lookup() or LookupRows() at send time, or pre-compute joins via Automation Studio SQL into a ready-to-send DE.

Two principles that save you from a rewrite

Principle 1: One Subscriber Key per human

Even if the same person is a Contact in CRM, a Customer on Shopify, and a Member in Loyalty - they get one Subscriber Key in SFMC. Otherwise the same person receives three welcome emails, one for each identity, and unsubscribing from one doesn't stop the other two.

The Subscriber Key should be stable over time. Don't use fields that change (like Shopify internal IDs that can reset on a migration). Use the most durable identifier the client controls - usually their CRM ContactID or a purpose-built customer UUID.

Principle 2: Split large DEs by function

A DE with 60 columns is a DE you can't maintain. When you inherit one, good luck knowing which attributes are still populated and which are stale.

Break by function:

Master_Customer_DE: identifiers + 10-15 most-queried segmentation fields
Order_History_DE: transaction detail, grows over time
Preference_DE: channel/content preferences
Consent_DE: opt-in records with timestamps

Join on demand instead of duplicating columns.

Common mistakes we fix on audit

No Subscriber Key strategy - each DE uses its own primary key, join impossible. Fix: pick a Subscriber Key, retrofit.
Email as Subscriber Key - same person with multiple addresses = multiple subscribers. Fix: use CustomerID.
One mega-DE - 80-column monstrosity, unknown which columns are fresh. Fix: split by function, use lookups.
Dropping raw CRM fields into DEs untouched - Contact__c_External_ID__pc as a column name in SFMC reporting is a bad day. Fix: rename on import, keep SFMC-facing names clean.

Takeaway

The data model lives on a whiteboard (or a Miro board) before any DE is created. Get the identifier story right, split DEs by function, and you avoid the single biggest source of rework on SFMC engagements. When onboarding a new client, asking "can I see your data model?" within the first week often reveals that there isn't one - which is both the problem and the opportunity.

Planning an SFMC data architecture? Our Salesforce team designs data models, DE layouts, and multi-source identity resolution on production engagements. Get in touch ->

See our full platform services for the stack we cover.

Kevin Trinh

Salesforce Engineer

Share Your Story

We build trust by delivering what we promise – the first time and every time!

We'd love to hear your vision. Our IT experts will reach out to you during business hours to discuss making it happen.

WHY CHOOSE US

"Collaborate, Elevate, Celebrate where Associates - Create Project Excellence"

SapotaCorp beyond the IT industry standard, we are

Certificated
Assured quality
Extra maintenance

Designing the SFMC Data Model Before You Create a Single DE

Key takeaways

The setup to map first

The unifying identifier

The master DE pattern

Two principles that save you from a rewrite

Principle 1: One Subscriber Key per human

Principle 2: Split large DEs by function

Common mistakes we fix on audit

Takeaway

Kevin Trinh

Share Your Story

Contact Us

Email

WhatsApp

Office

WHY CHOOSE US

Tell us about your project

Contacts

Company

Services

contacts

Designing the SFMC Data Model Before You Create a Single DE

Key takeaways

The setup to map first

The unifying identifier

The master DE pattern

Two principles that save you from a rewrite

Principle 1: One Subscriber Key per human

Principle 2: Split large DEs by function

Common mistakes we fix on audit

Takeaway

Kevin Trinh

More from Marketing Cloud

Mobile Studio channels: MobileConnect vs MobilePush vs GroupConnect

Why Some SFMC Brands Skip Publication Lists Entirely

SFMC Spring '26: Four Updates Worth Actioning This Quarter

The Complete SFMC Implementation Guide: 70+ Production Patterns

Three SFMC traps the MC-202 prep made us re-document

Why SFMC Sends to the Old Email: All Subscribers List Override

Share Your Story

Contact Us

Email

WhatsApp

Office

WHY CHOOSE US

Tell us about your project

contacts