Objective:

This assessment evaluates ETL platforms against LMNT’s actual volume profile and operational needs.

Note:

  • Analysis covers revenue-related sources; SPINS estimates still pending.
  • Excludes marketing, CX, support, finance systems, and other non-revenue sources.

Considerations:

  • ware2go connector is missing in both recommended tools.
  • Spins connector is missing in both recommended tools.

1. LMNT Data Ingestion Volume Overview

SourceEst. Monthly RowsEst. Annual RowsNotes
Emerson250k–260k~3MPOS + Omni; store-item & traits refreshes total ~36M static rows
Shopify DTC700k–1M8M–12MBased on 150–160k orders/month (estimate)
Shopify Wholesale20k–60k250k–700kDirectional
Amazon Marketplace1.5M–2M16M–20MBased on ~300k orders/month (estimate)
Ware2Go360k–1M700k–1.4MShipping spikes up to 12k/day
SpinsUnknownUnknownUnknown

Note: Shopify, Amazon, Wholesale, Ware2Go estimates based on order volume + typical connector schemas. Actual number may vary.


2. ETL Platform Comparison Matrix

CriteriaFivetranPolytomic (ETL)Polytomic (Reverse ETL)Dagster
ScalabilityHighHighHighMedium
ReliabilityVery HighHighHighVariable
MaintenanceVery LowLowLowHigh
Connector CoverageBroadGoodMarketing destinationsCustom
New Connector SpeedSlowFast (~1 week)N/AEngineering-built
CostHigh ($50 / 100k rows)Low (~6M rows ≈ $500)$400/connectorLow base + engineering
Best ForShopify, Amazon, Stripe, Walmart POSunsupported sources in Fivetran/High Volume for FivetranKlaviyo & Ads syncInternal APIs

3. Other Tools Considered

ToolWhy Not Selected for LMNT
AirbyteHigh maintenance; unstable for multi-million row retail datasets
Hevo DataLimited track record with complex omnichannel retail loads
MatillionHeavy ETL architecture; requires extensive engineering
Stitch / TalendDeprecated or low investment in new connectors
RudderstackStrong CDP, weaker ETL ingestion for LMNT’s needs

Primary ETL → Fivetran

Use for revenue-critical, daily ingestion:

  • Shopify DTC + Wholesale

  • Amazon

  • Stripe

  • Ads

  • Walmart POS + Omni

    Why: Strongest reliability & connector quality.


Supporting ETL → Polytomic (ETL)

Use for high-volume or unsupported datasets:

  • Ware2Go

  • SPINS (future)

  • Any connector missing in Fivetran

    Why: Cost-efficient, flexible, rapid connector turnaround.


Activation → Polytomic (Reverse ETL)

Use for syncing Snowflake →

  • Klaviyo

  • Meta Ads

  • Google Ads

  • GA4

  • CRM

    Price: ~$400 per connector.


Custom Pipelines → Dagster (Optional)

For internal APIs or pipelines where pre-built connectors aren’t available.


5. Final Recommendation

Fivetran + Polytomic is the optimal ETL architecture for LMNT.