Page cover

Prepare Your Data

This guide will walk you through the process of preparing your data in CSV format for submission. Adhering to these guidelines will ensure a smooth and accurate analysis of your data.

General Guidelines for CSV Files

  1. File Format:

    • Use .csv as the file extension.

    • Ensure the file is encoded in UTF-8 without BOM for compatibility.

  2. Headers:

    • Include a header row with descriptive column names.

    • Avoid special characters, spaces, or inconsistent capitalization in headers. Use snake_case (e.g., order_id, customer_email).

  3. Delimiter:

    • Use commas (,) as the delimiter.

    • Ensure no additional delimiters are used within values unless enclosed in double quotes (").

  4. Data Consistency:

    • Ensure each column contains consistent data types (e.g., numbers, dates, text).

    • Remove or mark any blank or irrelevant data.

  5. File Size:

    • If the file exceeds 100 MB, split it into smaller files and include a descriptive naming convention (e.g., orders_part1.csv, orders_part2.csv).

  6. Naming Conventions:

    • Use descriptive names for files that match the data type (e.g., orders.csv, returns.csv).


Preferred Data Categories

We recommend providing the following data categories to help us deliver a comprehensive report:

  • Orders: Information about customer purchases.

  • Customers: Details of your customer base.

  • Products: Information about the items sold.

  • Returns: Data on returned items and associated reasons.


Optional: Column Documentation File

To enhance the clarity of your data, include a column_documentation.csv file that explains the meaning of each column in your datasets. Use the following format:

column_name

description

example

order_id

Unique identifier for each order

12345

order_date

Date when the order was placed

2024-12-01

customer_id

Unique identifier for the customer

CUST001

return_reason

Reason provided for returning a product

Incorrect Item


Last updated

Was this helpful?