NLP

Why Do Language Models Hallucinate?

An analysis of why language models hallucinate — hallucinations arise from statistical pressures in training and evaluation procedures that reward guessing over acknowledging …

Sep 7, 2025 • 5 min read

Machine Learning

Why Entropy Matters in Machine Learning?

Understanding entropy and why it's a core concept in decision trees, neural networks, and loss functions like cross-entropy.

Apr 4, 2025 • 3 min read

Deep Learning

Understanding FlashAttention: Inner vs Outer Loop Optimization

FlashAttention is a groundbreaking optimization technique for computing attention in Transformer models, drastically improving GPU memory efficiency through inner vs outer loop …

Feb 1, 2025 • 1 min read

Deep Learning

Adversarial Attacks on Large Language Models (LLMs)

An overview of adversarial attacks on large language models (LLMs) — how manipulated inputs can deceive models into generating harmful or incorrect outputs, covering key attack …

Jan 11, 2025 • 3 min read

GLiNER

GLiNER: A Generalist Model for Named Entity Recognition using Bidirectional Transformers

A detailed summary of the GLiNER paper, introducing a lightweight, scalable, and highly effective model for open-type named entity recognition using bidirectional transformers with …

Nov 2, 2024 • 3 min read

Tts

Comparing batch vs layer normalization

The purpose of this post is just to understand the key difference between two types of well-known normalization techniques.

admin

• Mar 9, 2022 • 1 min read

No results found

NLP