Natural Language Processing (NLP)

Text Summarization

Definition

Text summarization produces a shorter version of a source document while retaining its core information. Two paradigms exist: extractive summarization selects and concatenates the most important sentences from the original text; abstractive summarization generates new sentences that may not appear verbatim in the source, similar to how a human would paraphrase. Modern abstractive summarizers use encoder-decoder transformer architectures (like BART, T5, Pegasus) fine-tuned on human-written summary pairs. Evaluation metrics include ROUGE (n-gram overlap with reference summaries) and BERTScore (semantic similarity).

Why It Matters

Summarization is critical for customer support operations where agents must quickly understand prior conversation history before responding. Auto-generated summaries of chat transcripts allow agents to grasp a 20-message conversation in 30 seconds instead of reading every message. For knowledge management, summarization condenses lengthy technical documents into concise articles, and for executive reporting, it distills thousands of customer feedback items into actionable themes.

How It Works

Abstractive summarizers use a sequence-to-sequence architecture: the encoder processes the full source document into a contextual representation, and the decoder generates the summary token-by-token using cross-attention to attend to relevant source segments. Models like BART pre-train with a denoising objective (reconstruct corrupted text) that develops strong text understanding and generation capabilities. At inference, beam search explores multiple generation paths to produce fluent, coherent summaries. Length penalties control summary verbosity.

Text Summarization — Extractive vs Abstractive

Source document

Researchers at MIT have developed a new AI system.← key sentence

The project began in 2021 with a team of 12 engineers.

The system achieves 94% accuracy on benchmark tests.← key sentence

Funding came from both public and private sources.

It outperforms all previous methods by a large margin.← key sentence

Extractive summary

Selects key sentences verbatim

"Researchers at MIT have developed a new AI system."

"The system achieves 94% accuracy on benchmark tests."

"It outperforms all previous methods by a large margin."

Abstractive summary

Generates new text (like a human)

"MIT researchers created an AI achieving 94% accuracy, surpassing all prior approaches."

Extractive

Abstractive

Faithfulness

High

Medium

Fluency

Medium

High

Compression

Low

High

Real-World Example

A CRM platform automatically summarizes completed support conversations before closing each ticket. Agents reviewing escalated cases see a 3-sentence summary ('Customer reported billing discrepancy on 2026-02-15. Issue was a duplicate charge. Refund of $49.99 was processed on 2026-02-16.') before reading the full thread. This reduced case review time by 55% and enabled managers to review 3x more cases per day.

Common Mistakes

✕Using extractive summarization for conversational text—extracted sentences often lose coherence without surrounding context
✕Evaluating only with ROUGE scores—ROUGE misses semantic correctness and can reward fluent but inaccurate summaries
✕Over-compressing long documents—crucial detail is lost when compression ratio exceeds practical limits

Related Terms

Natural Language Generation (NLG)

Natural Language Generation (NLG) is the NLP subfield concerned with automatically producing coherent, fluent, and contextually appropriate text from data, structured inputs, or internal representations.

Reading Comprehension

Reading comprehension is the NLP task of answering questions about a given passage by locating or generating the answer from within the text, serving as the core capability behind document-grounded chatbots and RAG systems.

Text Classification

Text classification automatically assigns predefined labels to text documents—such as topic, urgency, language, or intent—enabling large-scale categorization of unstructured content without manual review.

Transformer Encoder

The transformer encoder is a neural network architecture that processes entire input sequences bidirectionally using self-attention, producing rich contextual representations of each token that power state-of-the-art NLP models.

BERT

BERT (Bidirectional Encoder Representations from Transformers) is a transformer-based language model pre-trained on massive text corpora that revolutionized NLP by providing rich contextual word representations that dramatically improved nearly every language task.

← Natural Language Processing (NLP)← Glossary Hub

Ready to build your AI chatbot?

Put these concepts into practice with 99helpers — no code required.

Start free trial →