Historical Data and New vs. Repeat Customers
Rockerbox distinguishes between new and returning customers, allowing you to understand the unique marketing mix and path to conversion for your new vs. repeat users.
In order to do so, Rockerbox needs a one-time log of all historical orders.
If you leverage Shopify, Rockerbox automatically pulls in your historical orders, and you do not need to separately send us a file.
The dataset we require includes the following:
| Field | Description | 
| order_id | Match the format from the Purchase event (i.e. don’t send us Shopify Order IDs from your Purchase event, and CRM Order IDs for the historical file). | 
| user_id | Match the format from the Purchase event. | 
| email_address | Optional. We can use this value to match users in cases where user_id formats change over time (ex: you switched from one order system to another). | 
| timestamp | Full timestamp of order including date, time, timezone offset—ideally in UTC but if not, please let Rockerbox know the timezone of the timestamp | 
| revenue | This should match the revenue value we get on the Purchase event. If you send us net revenue, the historical file should also be net revenue. if you send us gross, it should be gross. | 
- File Format. The file will be in a spreadsheet format, such as .csv, .tsv or .xlsx. Rockerbox will set up an SFTP endpoint to which the file can be delivered.
- Date Range. The date range of the historical order file should start at the beginning of the customer’s order history all the way to when the Purchase event was completely setup in Rockerbox.
- File Size. If your order history contains over 500,000 records, break your files up into files containing up to 500,000 records each. Keep the column headers consistent across the files.
- Processing Time. It will take Rockerbox approximately 3 business days to process the file, assuming we do not encounter any issues in the dataset.
- User Identifiers. Rockerbox only requires one of user_id or email_address to process the historical order file but recommends including both fields in the file. If you are only including one of these user identifiers in the historical order file, please ensure this user identifier is being passed to Rockerbox on the Purchase event.
How is New vs. Repeat determined by Rockerbox?
We compare the user identifiers collected on each conversion, in other words, the values that make up the identity resolution lookup chain, to the identifiers present in the historical file + conversions tracked by Rockerbox. If we have seen a conversion with an identifier such as a user_id or email_address, to name a couple examples, in previous conversions, we will consider this a repeat or returning user.
If we do not have any matches, we consider this conversion to have been completed by a new user.
Note that for each conversion, the identifiers we use for identity resolution may vary. For example, Direct Mail will include the user's hashed mailing address. If you are unclear what identifiers are used for a conversion, please contact our support team at support@rockerbox.com.