Data Lab Tech
From Data Science and Data Engineering, to MLOps and DevOps, I talk about all things data, occasionally going into tangent or off-topic subjects as well. I try to stay away from the AI trend, as there is already an overflow of content on this topic, but I do cover it from time to time, usually as a secondary topic.
Here's a bit about me as well. I have worked many years as an academic researcher of computer science, and I own a PhD in this area. I mainly focused on information retrieval (search, recommender systems), and network science (graphs, hypergraphs, network analysis), but I also did work on information extraction (entity and relation extraction, knowledge graph construction), and data analysis and visualization. Before I started doing YouTube, I was a senior data scientist working on the whole data lifecycle, from collecting and organizing data into a data lakehouse, to training classifiers and doing MLOps to bring them into production systems.
Migrating DuckLake Catalog from SQLite to PostgreSQL
Data Lab Infra - Part 5: Retrospective & MLOps - Part 2: Model Deployment
Data Lab Infra - Part 4: Core Services
Data Lab Infra - Part 3: Platform Setup with Terraform
Data Lab Infra - Part 2: Bootstrapping with Terraform
Data Lab Infra - Part 1: Architecture Design
MLOps: A/B Testing with MLflow, Kafka, and DuckLake
Economic Competition Networks
GraphRAG with KùzuDB
Automated Semantic Releases on GitHub
Automating Hugo Blog and Social Media with GitHub Actions
Data Lakehouse with dbt and DuckLake
PostgreSQL Maximalism - Extensions for Every Use Case - Part 4
PostgreSQL Maximalism - Extensions for Every Use Case - Part 3
PostgreSQL Maximalism - Extensions for Every Use Case - Part 2
PostgreSQL Maximalism - Extensions for Every Use Case - Part 1