Data Engineering & Modern Data Stack
A modern data stack to deliver reliable, scalable analytics platforms for enterprise teams.
Core Competencies
Six pillars of delivery
A structured approach aligned with enterprise expectations for governance, reliability, and business impact.
Data Transformation
- dbt & Dataform modeling
- Medallion architecture
- Documentation and lineage
Data Ingestion
- Airbyte, dlt, Fivetran
- Incremental loading
- API and SaaS integration
Python Automation
- Custom connectors
- Operational tooling
- Data validation scripts
Orchestration
- Airflow and Kestra
- Retries and SLAs
- Workflow observability
Data Quality
- dbt tests and monitors
- Freshness and volume guards
- Incident management
Cloud Warehousing
- BigQuery and Snowflake
- Cost optimization (FinOps)
- Performance tuning
Tools & Technologies
Modern data stack
A scalable toolkit for enterprise-grade data delivery.
dbt
Analytics engineering framework to transform, test, and document data models using SQL.
BigQuery
Serverless cloud data warehouse optimized for large-scale analytics workloads.
Airbyte
Open-source data ingestion platform to extract and sync data from APIs, databases, and SaaS tools.
Kestra
Event-driven orchestration engine designed for modern data workflows.
Python
Programming language used for automation, ingestion, and custom data processing.
dlt
Python-based ingestion framework for incrementally loading APIs and databases.
Looker StudioLooker Studio
Cloud-based reporting tool for client-facing dashboards.
AirflowAirflow
Workflow orchestrator to schedule, monitor, and manage data pipelines.
Google Cloud Platform
Cloud platform hosting data infrastructure, analytics workloads, and security services.
Fivetran
Fully managed connectors ensuring reliable, low-maintenance data replication into warehouses.
DataformDataform
SQL-based modeling and orchestration tool integrated with BigQuery.
Snowflake
Cloud data warehouse enabling scalable, governed analytics across teams.
SQLSQL
Query language for data modeling, analytics, and business logic.
Power BI
Business intelligence platform for operational and executive reporting.
TableauTableau
Advanced data visualization tool for exploratory analysis and dashboards.
Streamlit
Python framework for building interactive data applications and prototypes.
PostgreSQL
Relational database used for transactional systems and trusted reporting sources.
Docker
Containerization platform ensuring consistent and reproducible environments.
Git
Version control system for managing analytics and infrastructure codebases.
Elementary
dbt package for data observability, monitoring test results, and anomaly detection.
MongoDB
Document-oriented NoSQL database for flexible, semi-structured application data.
Redis
In-memory data store for caching, queues, and low-latency access patterns.
Metabase
Self-service analytics tool enabling fast access to data insights.
Funnel.io
Marketing data ingestion platform consolidating advertising and analytics sources.
Cloud FunctionsCloud Functions
Serverless compute for event-driven data processing and lightweight backend logic.
Cloud Run
Container-based compute service for deploying scalable APIs and background workers.
Ready to Build Modern Data Platforms
For data / Analytics Engineering missions in Toulouse and remote.
Get in touch