Prepare Your Data
This guide will walk you through the process of preparing your data in CSV format for submission. Adhering to these guidelines will ensure a smooth and accurate analysis of your data.
General Guidelines for CSV Files
File Format:
Use
.csvas the file extension.Ensure the file is encoded in UTF-8 without BOM for compatibility.
Headers:
Include a header row with descriptive column names.
Avoid special characters, spaces, or inconsistent capitalization in headers. Use
snake_case(e.g.,order_id,customer_email).
Delimiter:
Use commas (
,) as the delimiter.Ensure no additional delimiters are used within values unless enclosed in double quotes (
").
Data Consistency:
Ensure each column contains consistent data types (e.g., numbers, dates, text).
Remove or mark any blank or irrelevant data.
File Size:
If the file exceeds 100 MB, split it into smaller files and include a descriptive naming convention (e.g.,
orders_part1.csv,orders_part2.csv).
Naming Conventions:
Use descriptive names for files that match the data type (e.g.,
orders.csv,returns.csv).
Preferred Data Categories
We recommend providing the following data categories to help us deliver a comprehensive report:
Orders: Information about customer purchases.
Customers: Details of your customer base.
Products: Information about the items sold.
Returns: Data on returned items and associated reasons.
Optional: Column Documentation File
To enhance the clarity of your data, include a column_documentation.csv file that explains the meaning of each column in your datasets. Use the following format:
column_name
description
example
order_id
Unique identifier for each order
12345
order_date
Date when the order was placed
2024-12-01
customer_id
Unique identifier for the customer
CUST001
return_reason
Reason provided for returning a product
Incorrect Item
Last updated
Was this helpful?
