Member-only story

Amazon ML Challenge 2024 Solution

Building an Image-Based Entity Extraction Model for E-commerce

7 min readSep 16, 2024

Problem Overview

In the realm of e-commerce, obtaining precise product details from images is crucial, especially when textual descriptions are absent or incomplete. This capability extends beyond e-commerce, impacting healthcare and content moderation, where accurate data, such as product dimensions, weight, volume, and other entity values, are vital for operations.

The challenge of this hackathon is to develop a machine learning model that can accurately extract and predict entity values, such as weight, volume, and dimensions, directly from product images. This task is integral to enhancing the quality of digital marketplaces and improving user experience. The model will predict these values in a predefined format and will be evaluated based on how accurately it can reproduce the ground truth using the F1 score.

Data Structure

The dataset consists of several columns, including:

index: A unique identifier for each product.
image_link: A URL to download the product image.
group_id: A category code for the product.
entity_name: The entity value label (e.g., “item_weight”).
entity_value: The actual value of the product entity (e.g., “34 gram”).

The task involves building a model to extract and predict the entity_value for unseen test data where this column is not available.

The output format must follow the structure:

index: Unique identifier from the test dataset.
prediction: A string formatted as “x unit,” where x is a float, and the unit is one of the allowed units, e.g., "2.5 gram", "12 centimeter". Invalid formats like "60 ounce/1.7 kg" or scientific notation will lead to penalties during evaluation.

Evaluation Criteria

The evaluation metric for this task is the F1 score, a balance between precision and recall. These metrics are computed as follows:

Precision: The ratio of correctly predicted values to all predicted values.
Recall: The ratio of correctly predicted values to all actual values.

Amazon ML Challenge 2024 Solution

Building an Image-Based Entity Extraction Model for E-commerce

Problem Overview

Data Structure

Evaluation Criteria

Written by Vishesh Rawal

Responses (1)