leminhnguyen's blog

Apr 28, 2025 6 min read Speech, Speaker-Darization

Speaker Diarization: From Traditional Methods to the Modern Models

Speaker Diarization, the task of answering “Who spoken when?” - is an crucial component in many speech processing systems. From meeting transcription to customer service call analysis, diarization allows to segment signal by speakers, making down-stream tasks like speech-to-text, emotion analysis, or intent identification much more effective.

Apr 4, 2025 4 min read NLP, Speech, Machine Learning

Why Entropy Matters in Machine Learning?

Low vs High Entropy Entropy is a powerful and fundamental concept that quietly drives some of the most effective algorithms in machine learning. From decision trees to deep neural networks, entropy plays a central role in helping models navigate uncertainty and make better predictions.

Mar 15, 2025 3 min read Speech, Automatic Speech Recognition

LoRA-Whisper: A Scalable and Efficient Solution for Multilingual ASR

1. Background & Motivation Automatic Speech Recognition (ASR) has made significant strides in recent years, particularly with the rise of large-scale multilingual models like OpenAI’s Whisper, Google USM, and Meta’s MMS.

Feb 1, 2025 2 min read NLP, FlashAttention

Understanding FlashAttention: Inner vs Outer Loop Optimization

Understanding FlashAttention: Inner vs Outer Loop Optimization FlashAttention is a groundbreaking optimization technique for computing attention in Transformer models. It drastically improves performance by reducing memory bottlenecks and utilizing GPU memory more efficiently.

Jan 11, 2025 4 min read NLP, Large Language Models

Adversarial Attacks on Large Language Models (LLMs)

Adversarial Attacks on Large Language Models (LLMs) Adversarial attacks on large language models (LLMs) involve manipulating inputs to deceive the model into generating harmful, biased, or incorrect outputs. These attacks exploit the vulnerabilities of LLMs, which rely on patterns in training data to generate responses.

Minh Nguyen Le

AI Engineer

HUST | Vbee | CMC