What we are solving for

Core Data Types

Conversion Activity Tracking

Deduplicating Conversions

Identifying Users

Marketing Touchpoints - How We Track

Rockerbox Tracking by Vendor

Deduplicating Events

Building a Path to Conversion - Mobile

AppLovin

Can I track purchases on the Tiktok or Meta shop?

Spend Ingestion

Attribution types in Rockerbox

What is De-duplication?

Direct Traffic

Why performance in Rockerbox looks different

Attribution Windows ("Lookback Window")

Multi-Touch Attribution (MTA) Model overview

Model Application and Impact

Modeling Deep Dive

Rockerbox Mapping Hierarchy Overview and Change Request Guidelines

Unmapped Events

Rockerbox Marketing Classifications

What to Expect - Data Foundation Onboarding

Key Stages of Data Foundation Onboarding

What to Expect - Post Data Foundation Onboarding

Data Foundation Onboarding - Onsite Conversion Pixel & Direct Mail Requirements

MMM - Implementation Timeline & Key Phases

Providing Incrementality Results

Incrementality Testing & Analysis Onboarding

Onboarding with Rockerbox - Overview

Rockerbox Training: What to Expect

Rockerbox Support Offering

Contacting Rockerbox Support

Launching New Channels and Partners

Maintaining consistent naming conventions

Migrating to a Shopify Headless Site Build

Adding & Removing Users

Maintaining Vendor Integrations

Finding Your Account ID (Pixel Source Name)

PII Data

Privacy (GDPR, CCPA, CPRA)

Data Storage & Retention

Data Retrieval & Deletion

Parameters, Pixels, and Batch Spend Reports

How does Rockerbox handle campaign name changes?

What are synthetic events?

Can I use my own custom UTMs instead of Rockerbox tracking parameters?

Excluding Parameters from GA

Should I be concerned with my URL length?

Discrepancies between internal data and Rockerbox data

What is Deduplication? Why do Platforms take Full Credit?

Why are conversion counts in Facebook different than Rockerbox?

Rockerbox last touch vs. GA last touch

Discrepancies between Ad Platforms and Rockerbox

Accounting for 3rd Party Purchases (ex. Amazon, Etsy)

Returns- Ingesting and Understanding the Impact

If I have webhooks, can I turn off the onsite pixel?

My website uses a Single-Page Application (SPA) framework? Will that work with Rockerbox?

Will the Rockerbox pixel code slow down my website?

When does the Rockerbox cookie expire?

Rockerbox Access FAQs

How to normalize performance for "walled gardens"

Impact of iOS14 Changes on Broader Marketing (Agnostic of Rockerbox)

How to exclude certain spend or marketing touchpoints from Rockerbox reporting

Understanding strong ROAS or CPA for bottom funnel channels

How to normalize historical in-platform reporting pre-Rockerbox

Can I use a Conversion Event as a Marketing Event for another Conversion?

Why would a retargeting vendor get 1st touch credit?

Direct Mail Spend Amortization

Conversion Discrepancy

Google Ads (Auto-Append)

Google Ads (Manual Tracking)

Bing

Facebook / META (Auto Append)

Facebook (Manual Append)

Pinterest - Viewthrough Integration

Snapchat

TikTok

TikTok (Manual Append)

Google Display Network (GDN)

Criteo

Teads

Standard UTMs

Shopify - Checkout Extensibility

Shopify

Segment

Google Tag Manager

Impact.com

Rakuten

Ascend™ by Partnerize (formerly Pepperjam)

CJ Affiliate (Commission Junction)

Pebble Post

LS Direct

Postie

Belardi Wong

Share Local Media (SLM)

SeQuel DM

Simulmedia Linear

MNTN

Bliss Point Media OTT/Linear

Eicoff Linear TV

Tatari OTT & Linear

MMSI

Marketing Architects (Linear)

Comcast Linear TV

Hulu

ID Media

TargetSpot

Direct Avenue

StackAdapt (OTT)

TV Scientific

Simulmedia OTT

iHeartRadio

Spotify Ad Analytics

Pandora

Taboola

DCM Tags

Using DCM Tags

PowerInbox

The Trade Desk

LiveIntent View Feature

Display & Video 360 (DV360)

AcuityAds (Illumin)

AdForm

MediaMath

AdRoll

MediaAlpha

Outbrain

StackAdapt (Display)

Xandr (AppNexus)

RTB House

LiveIntent

Quantcast

Hivewyre

Fastg8

Adelphic by Viant

Zeta Global

Yahoo DSP (formerly Verizon Oath)

Wunderkind

AdTheorent

Amobee

AdMedia

Search Ads 360 (SA360)

Apple Search Ads

Phone Calls: CallRail

Phone Calls: Invoca

Phone Calls: Other

Fairing Integration (Formerly Enquire Labs)

Singular

Branch

AppsFlyer

Adjust

Branch Marketing Events

Spend: Batch

Sending Files to Rockerbox

Report Delivery Requirements

Direct Mail: Sending Mail Logs to Rockerbox

Avoiding Scientific Notation

Partnering with Rockerbox - for New Partners

Currency

Rockerbox Guide for Marketing Partners

Conversion Events Overview

Historical Data and New vs. Repeat Customers

Mobile Apps

Webhooks

Conversions: Batch Files

Excluding Admin Users in Customer Accounts

Excluding Staging/Test Domains

Custom Tracking Domain (CNAME) Overview

In Segment - Custom Tracking Domain (CNAME)

Turn Off Cloudflare Proxy

Standard Conversion Data Formats

Verifying Conversion Pixels are Firing

BugSnag

Identify Calls

Ingesting Address Data

Shopify Integration

ReCharge Integration

GTM: Ecommerce Event Variables

Rockerbox UID to GTM Data Layer

GTM: Shopify Variables

GTM: Conversion Pixels

GTM: All Pages Pixel

Pixels Event Status: Pixel QA + Monitoring

Google Tag Manager (GTM) Template

Segment Overview

Segment Integration

Segment Onsite Data

Segment Server-Side Data

Site Direct: Conversion Pixels

Pixel Implementation outside of GTM

Stripe Integration

Conversion Data Breakouts (Child Segments)

Conversion QA

Retail Data

Linear TV Overview

Customer Service (Call, Chat, FAQs)

Direct Mail: Setup

OTT

OTT Log Files

Podcasts

Streaming Audio

UTM Based Tracking in Rockerbox

Organic Search

SMS

Organic Social

Synthetic Events

Changes in Facebook Synthetic Event Modeling

Post Purchase Surveys Overview

Sending Rockerbox Survey Data

Recommendations for Survey Response Options

How to organize and map survey responses

Amazon DSP: UTM tracking

Overview of Tracking "Hard to Track" Channels

Allowed Domains

How Rockerbox Attributes Touchpoint Credit

Promo Codes

AdWords Manager Accounts

Spend: API Integrations

Re-Authenticating Accounts

Facebook Ads Account

Outbrain: Requesting API Access

Microsoft Advertising Account (Bing)

Google Error: "This app is blocked"

Deleting an Authenticated Account

Updating Facebook Permissions during authentication

MMM - Guidelines for Feature Selection

MMM Feature Metrics - Spend, Impressions, or Clicks

MMM - Including Non-Spend Features

Model Fit for MMM

Bayesian Methodology

Setting Priors

MMM - Features Included in Modeling

MMM KPI: Revenue or Conversions

MMM - External Factors

Modeling Detailed

Google Analytics 4

Historical Data Revisions (Backfills)

Creating Ad Hoc Exports

Scheduled Reports & Ad Hoc Exports Step by Step

Data Warehousing

Google Ads Conversions API (CAPI)

Meta Attribution API (Beta)

Marketing Mix Modeling

MMM - Marketing Performance

MMM - Channel Overview

MMM - Scenario Planner

MMM - Model Comparison

MMM - Selecting Models

Understanding Geo Lift Test Results

Marketing Paths

Funnel Position

Channel Overlap

Platform-Reported Performance

Rockerbox de-duplicated view

Cross-Channel Attribution Report

Conversion Comparison

Time Period Comparison

Modifying Reporting Columns in the UI

Seeing Performance on Channel and Placement Level

Setting a de-duplicated Performance target in Rockerbox

Comparing Platform-Reported Performance to Rockerbox De-duplicated Performance

Measuring Channel Heavy-ups (Increased spend) & Resulting impact to CPA/ROAS (Diminishing returns)

Comparing Performance across Time Periods

How to tell whether your prospecting channel is working

Evaluating TOF performance

Determining optimal budget allocation in Rockerbox

How to leverage time to convert and payback periods to determine where to cut spend

When to use multipliers when leveraging Rockerbox for financial forecasting & budgeting

New channel launch - evaluating performance

Identifying the role of each channel across the funnel

How to Quantify the Impact of an increase in Branding spend on Demand driven channels

Understanding the Impact of each channel

Understanding User Behavior Across Rockerbox Views

Understanding Influencer Impact

Channel Overlap Use Cases

Financial Forecasting & Budgeting with Rockerbox

Planning and budgeting against baseline performance

Incrementality Testing Glossary

Requirements for Testing

Incrementality Testing: Frequently Asked Questions

Choosing the Right KPI for your test

Understanding Confidence Intervals & Statistical Significance

How Rockerbox Minimizes Business Disruption During a Test

Testing Methodology

Understanding Channel Incrementality

Cadence of Rockerbox Use

Your Holiday Guide

Tips for Performance Recap Analysis

Tips for leveraging Rockerbox to understand performance during an Economic downturn

Tips for Handling Spend Heavy Ups and Pacing

How to Build a Testing Roadmap

Preparing for a sale in Rockerbox

Leveraging MTA with MMM

Evaluating a New Product Launch in Rockerbox

Guide to Incrementality Testing with Rockerbox

All Categories > Setup and Technical Documentation > Modeling > MTA Model > Modeling Detailed

Modeling Detailed

Updated 2 years ago by Kelsey Kearns

Overview

Model definition

Rockerbox builds a logistic model that uses the customers’ Marketing Interactions to establish a likelihood that a customer will complete a given Conversion (eg. Purchase).

Marketing Interactions are the independent variables in our model. They are binary variables that indicate whether a customer has interacted with a specific piece / classification of marketing. Conversions are the dependent variables in our model. They are binary variables that indicate whether a customer successfully completes a Conversion.

Formalizing this a bit more:

X is a matrix describing our independent variables (Marketing Interactions). It has i rows and j columns, where i is the number of all users Rockerbox has data for and j is the number of Marketing Interactions being evaluated by the model.
- Each row of the matrix is a single customer’s marketing interactions
- Each element within the row indicates whether that customer interacted with a
  piece of marketing.
y is a vector of length i, indicating whether each customer in our model completed a
given conversion event.
f represents our trained logistic model.
ŷ is predicted probability for each customer from our logistic model. .

The regression’s model with our independent variables can be simply described using the following formula:

f(X) = ŷ

Model Usage

Although the logistic model is designed to be a predictive model, the probability associated with a user’s likelihood to convert is not the result we are after. Rather, we are looking to use the weights of the model to describe the historical behavior of marketing. In this capacity, for each user that has converted, we award credit to a marketing channel based on the normalized weights from our model.Specifically, if a single user interacted with three marketing touchpoints before converting, the conversion would be credited to the relative distribution of the weights associated with those variables from our model.

Data Preparation

Clean data is important for the model to have good descriptive / predictive power. For this model, we have a few techniques to ensure that (1) the number of independent variables trained against are limited and (2) all marketing interaction information is included in the model.

Marketing tiering

All marketing interactions are placed into a hierarchy. We call this process tiering. This tiering allows us to easily classify every marketing touchpoint with an accurate description and group similar marketing touchpoints together.

Limiting variables

A significant reason for tiering variables is so that we can limit the number of independent variables that are ultimately included in the model without losing critical details about marketing interaction. By placing marketing events into tiers, we can easily “roll-up” marketing interactions from a lower tier to a higher tier, if there is a lack of sufficient data to support a variable being included in the model as its own marketing touch point.

An example of this rollup would be a marketing campaign that is testing hundreds of different personalized variations of a creative. Although no individual creative produces enough marketing to be included in the model on its own, the campaign can encompass all of these sub-strategies, be included in the model and correctly assign credit to this set of marketing interactions.

We try to limit the number of variables in the model based on the number of conversions we see per day. We dynamically set the minimum threshold a variable would need to be included in the model to ratio to avoid overfitting.

Variable correlations

Outside of tiering the data and automatically limiting which tiers get included in the model based on threshold, we also can remove variables that are highly correlated with one another. This is currently still a manual pruning process that we perform if it is an issue for a set of independent variables.

Cohort / data selection

The users that are selected to be part of the model are chosen to ensure that a sufficient attribution window is present for all marketing interactions to have an impact on a user converting. An analysis is run for the conversion event against which we are building the model to determine the cohort window. Specifically, we are looking to set a cohort window to capture 95% of the events that interacted with users that converted.

After we know the conversion window necessary to build a model, we select cohorts to be included in our model using the following procedure:

For dates < (today - cohort_window), find all users whose first interaction occurred on a particular day. Then, select all marketing interactions going forward from this day through the end of the cohort window.

The below diagram that shows the cohort selection criteria for a cohort window of 30 days:

Using the above data selection enables us to have a complete and fair dataset against which to properly evaluate and assign weight.

Modeling Technique

With the above tiering and data selection complete, we are ready to build the model. To do this, we break up model building into two parts.

First, we dynamically determine a C value to penalize over-fitting.
Next, we train the model and evaluate the results.

In both of the above two steps, we use stratified k-folds. To determine the C value, we use a grid search across all k-folds to determine the best value.

To test, train and evaluate the model, we again use cross-validation folds to run multiple datasets through test and train to produce the evaluation metrics and plots used to evaluate the model.

Modeling Detailed

Overview

Model definition

Model Usage

Data Preparation

Marketing tiering

Limiting variables

Variable correlations

Cohort / data selection

Modeling Technique

How did we do?

Contact