EXP-010 / Retail Analytics / Published

Market Basket Analysis

A retail analytics project that generates synthetic transactions, mines frequent itemsets with Apriori and FP-Growth, derives association rules, and exports item-support, rule, summary, and visualization artifacts.

EXP-010 / question

Technical question

Can association-rule mining uncover interpretable product co-purchase patterns for retail decisions?

EXP-010 / method

Method and workflow

  1. Generate synthetic retail transactions with realistic co-purchase bias.
  2. Mine frequent itemsets using Apriori or FP-Growth through mlxtend.
  3. Derive association rules and rank them by support, confidence, lift, leverage, and conviction.
  4. Export item support, frequent itemsets, association rules, summary JSON, and top-item visualizations.
  5. Interpret discovered rules as retail bundling, merchandising, and recommendation signals.
synthetic transactions one-hot baskets frequent itemsets association rules metrics charts retail insight

EXP-010 / skills demonstrated

Data scientist skills shown

Data mining

Uses frequent itemset mining and association rules to discover patterns rather than predict a target.

Retail analytics

Translates product co-occurrence metrics into bundling and recommendation insights.

Reproducible artifacts

Exports CSVs, JSON summaries, and figures for inspection and communication.

EXP-010 / evidence

Evidence of work

Association metrics

The README defines support, confidence, lift, leverage, and conviction as outputs for evaluating product relationships.

Algorithm comparison

Both Apriori and FP-Growth are available, demonstrating two classic approaches to frequent itemset mining.

Retail interpretation

Example rules show product co-purchase relationships such as laptop accessories appearing together more often than random.

EXP-010 / stack

Technical stack

PythonpandasmlxtendAprioriFP-GrowthMatplotlibCSVJSON
Open repository ↗

EXP-010 / limitations

Limitations and honesty check

  • The transaction data is synthetic, so business conclusions are illustrative rather than production-ready.
  • Association rules reveal correlation, not causation.
  • Large real-world catalogs would need careful threshold tuning and performance optimization.

EXP-010 / next

Next improvements

  • Run the workflow on a real public retail transaction dataset.
  • Add rule filtering by category, lift, confidence, and business value.
  • Add interactive visualizations for item networks and product bundles.
  • Compare rule-based recommendations with collaborative filtering baselines.