Learning Semi-Interpretable Prototypes for Multi-label Text Classification

[Submitted on 14 Oct 2025 (v1), last revised 18 Dec 2025 (this version, v3)]

View a PDF of the paper titled ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification, by Utsav Kumar Nareti and 4 other authors

View PDF

Abstract:The rapid growth of user-generated text across digital platforms has intensified the need for interpretable models capable of fine-grained text classification and explanation. Existing prototype-based models offer intuitive explanations but typically operate at coarse granularity (sentence or document level) and fail to address the multi-label nature of real-world text classification. We propose ProtoSiTex, a semi-interpretable framework designed for fine-grained multi-label text classification. ProtoSiTex employs a dual-phase alternate training strategy: an unsupervised prototype discovery phase that learns semantically coherent and diverse prototypes, and a supervised classification phase that maps these prototypes to class labels. A hierarchical loss function enforces consistency across subsentence, sentence, and document levels, enhancing interpretability and alignment. Unlike prior approaches, ProtoSiTex captures overlapping and conflicting semantics using adaptive prototypes and multi-head attention. We also introduce a benchmark dataset of hotel reviews annotated at the subsentence level with multiple labels. Experiments on this dataset and two public benchmarks (binary and multi-class) show that ProtoSiTex achieves state-of-the-art performance while delivering faithful, human-aligned explanations, establishing it as a robust solution for semi-interpretable multi-label text classification.

Submission history

From: Utsav Nareti [view email]
[v1]
Tue, 14 Oct 2025 13:59:28 UTC (1,531 KB)
[v2]
Thu, 27 Nov 2025 19:20:01 UTC (965 KB)
[v3]
Thu, 18 Dec 2025 11:14:07 UTC (966 KB)

What's Hot

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Vibe Coding with AI: Best Practices for Human-AI Collaboration in Software Development

GSI Agent: Domain Knowledge Enhancement for Large Language Models in Green Stormwater Infrastructure

Beyond Prompt Caching: 5 More Things You Should Cache in RAG Pipelines

CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems

The Basics of Vibe Engineering

DynaTrust: Defending Multi-Agent Systems Against Sleeper Agents via Dynamic Trust Graphs

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Vibe Coding with AI: Best Practices for Human-AI Collaboration in Software Development

404 Crawling Means Google Is Open To More Of Your Content

Gen Z Social Media Trends & Usage

How to use VLOOKUP in Google Sheets: A complete guide

How to create a dropdown list in Google Sheets

CraniMem: Cranial Inspired Gated and Bounded Memory for Agentic Systems

Most Popular

13 Trending Songs on TikTok in Nov 2025 (+ How to Use Them)

How to watch the 2026 GRAMMY Awards online from anywhere

Corporate Reputation Management Strategies | Sprout Social

Our Picks

At Least 32 People Dead After a Mine Bridge Collapsed Due to Overcrowding

Here’s how I turned a Raspberry Pi into an in-car media server

Beloved SF cat’s death fuels Waymo criticism

Subscribe to Updates

What's Hot

Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Submission history

Related Posts

Subscribe to Updates