commodity-brief-v1 / commodity-brief-current

Commodity Research Brief v1

First actionable research brief for a Tableau-oriented project comparing coffee, tea, and cocoa, with lighter consideration of yerba mate and matcha only where they genuinely fit. This brief is designed to help the team choose sane source combinations and a clean data model.

1. Comparison dimensions

Layer A

Price

Primary comparison layer. Use price per kilogram or a directly comparable unit where possible.

Layer B

Production

Country-year production, harvested area, and related agricultural measures.

Layer C

Trade

Country-year exports/imports using well-defined commodity codes.

Removed from first-pass core: sales / market size as a primary layer.

2. Top 10 candidate datasets

DatasetBest forAuthorityTableau fitNotes
FAOSTAT Production (QCL)ProductionHighHighBest production backbone for coffee, tea, cocoa.
FAOSTAT Trade (TCL)TradeHighHighUseful for country-year export quantity/value within FAO system.
UN ComtradeTradeHighMedium-HighBest detailed trade source; requires HS-code discipline.
World Bank Pink SheetPriceHighHighStrong cross-commodity monthly price layer in comparable units.
FRED coffee price seriesCoffee price contextHighHighStrong coffee-only supporting price series.
ICO statisticsCoffee validation/contextHighMediumExcellent support source; some deeper data access may be limited.
International Tea CommitteeTea validation/contextHighLow-MediumRespected, but more publication-oriented and less frictionless to ingest.
ICCO statisticsCocoa validation/contextHighLow-MediumGreat for cocoa-specific support, weaker as a simple first-download source.
Our World in DataFast prototypingMedium-HighHighConvenient mirror/convenience source, often FAOSTAT-derived.
USDA PSD / FASSecondary validationHighMediumPotentially useful, but not the simplest first backbone compared with FAOSTAT + World Bank + Comtrade.

3. Top 5 composition strategies

  1. Safest: use World Bank Pink Sheet + FAOSTAT Production for price + production.
  2. Best full academic build: use World Bank Pink Sheet + FAOSTAT + UN Comtrade for price + production + trade.
  3. Keep price separate: monthly price data should stay in its own table, not forced into annual production/trade rows.
  4. Use a commodity lookup table: keep a small table defining coffee, tea, cocoa concepts and relevant codes.
  5. Validate with specialist sources: use ICO / ITC / ICCO for context, caveats, and cross-checking rather than building the first fact table directly from their reports.

4. Tableau-readiness comparison

SourceAuthorityScopeCleanlinessTableau-readinessReproducibility
FAOSTAT5/55/54/55/55/5
UN Comtrade5/55/53/54/55/5
World Bank Pink Sheet5/54/54/55/55/5
ICO5/53/53/53/54/5
ITC5/53/52/52/53/5
ICCO5/53/52/52/53/5
OWID4/53/55/55/54/5
USDA PSD4/53/53/53/54/5

5. Expert / institutional sources

  • ICO — best institutional coffee statistics/context source
  • International Tea Committee — best tea-specific institutional statistics/context source
  • ICCO — best cocoa-specific institutional statistics/context source
  • FAO / FAOSTAT — best official agricultural backbone
  • UN Comtrade / UNSD — best official global trade backbone
  • World Bank commodity markets / Pink Sheet — strong cross-commodity price context

These are better as trusted institutions than as interchangeable flat-file data sources.

6. Data model recommendation

Recommended Tableau architecture:

  • prices_monthly — commodity, date/month, unit, value
  • production_country_year — country, year, commodity, production/harvest area/yield
  • trade_country_year — reporter, partner(optional), year, commodity, import/export measure
  • commodity_lookup — commodity definitions and code mappings

Country-year is the preferred architecture for production and trade, but not a red line for price because price data may remain global-monthly.

7. Scope viability options

Safest

Coffee + tea + cocoa using price + production

Use World Bank Pink Sheet + FAOSTAT only. Cleanest and most Tableau-ready.

Richer

Coffee + tea + cocoa using price + production + trade

Use World Bank Pink Sheet + FAOSTAT + UN Comtrade. Strongest overall research build.

Ambitious

Add yerba / matcha fit analysis

Keep as a sidecar analysis, not the core model. Matcha should remain a tea subcategory if included.

Which source combinations are actually sane to build with?