Footer

Logo

Resources

  • Rust Tiếng Việt
  • /archives
  • /series
  • /tags
  • Status

me@duyet.net

  • About
  • LinkedIn
  • Resume
  • Projects

© 2026 duyet.net | Sr. Data Engineer

DuyệtData Engineering
HomeAboutPhotosInsightsCV
Thu Jan 01 2026 00:00:00 GMT+0000 (Coordinated Universal Time)AI
Pushing Frontier AI to Its Limits

Reflect on what I'm thinking and doing in this LLM era

Fri Nov 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
ClickHouse Rust UDFs

In Data Platform System with ClickHouse, rather than extracting data from ClickHouse for processing in external systems, we can perform transformations directly within ClickHouse itself. ClickHouse can call any external executable program or script to process data. My idea is using custom **User-Defined Functions (UDFs) written in Rust** to handle data transformations between tables.

Sun Jun 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
ReplicatedReplacingMergeTree

Learn how to set up and manage ReplicatedReplacingMergeTree in ClickHouse on Kubernetes. This comprehensive guide covers cluster setup with ClickHouse Operator, data replication, performance tuning, and best practices for high availability deployments.

Sat Jun 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
ReplacingMergeTree

My favorite ClickHouse table engine is `ReplacingMergeTree`. The main reason is that it is similar to `MergeTree` but can automatically deduplicate based on columns in the `ORDER BY` clause, which is very useful.

Fri May 31 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
MergeTree

After starting this series ClickHouse on Kubernetes, you can now configure your first single-node ClickHouse server. Let's dive into creating your first table and understanding the basic concepts behind the ClickHouse engine, its data storage, and some cool features

Wed Mar 27 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
Monitoring ClickHouse on Kubernetes

Complete guide to monitoring ClickHouse on Kubernetes. Learn about built-in dashboards, Prometheus + Grafana setup, powerful system tables for monitoring queries, and the ClickHouse Monitoring UI dashboard. Includes practical examples, essential monitoring queries, and best practices for production observability.

Tue Mar 26 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
ClickHouse SELECT Advances

Dynamic column selection (also known as a `COLUMNS` expression) allows you to match some columns in a result with a re2 regular expression.

Wed Mar 13 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Data
ClickHouse on Kubernetes

Complete guide to deploying ClickHouse on Kubernetes using the Altinity ClickHouse Operator. Learn how to set up your first single-node cluster, configure persistent storage, manage users, and customize ClickHouse versions. Includes practical examples and best practices from production experience managing clusters with trillions of rows.

Fri Feb 09 2024 00:00:00 GMT+0000 (Coordinated Universal Time)Story
2023 - A Year of Moving

So, I hadn't really planned on writing a summary post for this year, as lazy as I am, but somehow, here we are.

Sat Sep 09 2023 00:00:00 GMT+0000 (Coordinated Universal Time)Rust
Apache OpenDAL in Rust to Access Any Kind of Data Services

OpenDAL is a data access layer that allows users to easily and efficiently retrieve data from various storage services in a unified way such as S3, FTP, FS, Google Drive, HDFS, etc. They has been rewritten in Rust for the Core and have a binding from many various language like Python, Node.js, C, etc..

See more posts