Tag: large language models

Scaling Laws in NLP: How Bigger Data and Models Created Modern LLMs

Discover how scaling laws transformed AI from guesswork to engineering. Learn about Chinchilla scaling, power laws, and the shift to inference-time compute in modern LLMs.

Bias in Large Language Models: Sources, Measurement, and Mitigation Strategies for 2026

Explore the sources, measurement, and mitigation of bias in Large Language Models. Discover new 2026 findings on pro-AI bias, internal representation steering, and practical strategies for reducing algorithmic prejudice.

Bias in Large Language Models: Sources, Measurement, and Mitigation

Explore the sources, measurement, and mitigation of bias in Large Language Models. Learn about pro-AI bias, first-item bias, and new 2026 detection methods from MIT.

How Large Language Models Learn: Self-Supervised Training at Internet Scale

Large language models learn by predicting the next word in massive amounts of internet text. This self-supervised approach, powered by Transformer architectures, enables unprecedented scale and versatility-but comes with costs, biases, and limitations that shape how they're used today.

In-Context Learning Explained: How LLMs Learn from Prompts Without Training

In-Context Learning allows LLMs to adapt to new tasks using examples in prompts-no retraining needed. Discover how it works, its benefits, limitations, and real-world applications in AI today.

Model Parallelism and Pipeline Parallelism in Large Generative AI Training

Pipeline parallelism enables training of massive generative AI models by splitting them across GPUs, overcoming memory limits. Learn how it works, why it's essential, and how it compares to other parallelization methods.

Emergent Abilities in NLP: When LLMs Start Reasoning Without Explicit Training

Large language models suddenly gain reasoning skills at certain sizes-without being trained for them. This phenomenon, called emergent ability, is reshaping AI development-and creating serious risks.

Red Teaming for Privacy: How to Test Large Language Models for Data Leakage

Learn how to test large language models for data leakage using red teaming techniques. Discover real-world risks, free tools like garak, legal requirements, and how companies are preventing privacy breaches.