View a PDF of the paper titled Multi-Personality Generation of LLMs at Decoding-time, by Rongxin Chen and 4 other authors
View PDF
HTML (experimental)
Abstract:Multi-personality generation for LLMs, enabling simultaneous embodiment of multiple personalization attributes, is a fundamental challenge. Existing retraining-based approaches are costly and poorly scalable, while decoding-time methods often rely on external models or heuristics, limiting flexibility and robustness. In this paper, we propose a novel Multi-Personality Generation (MPG) framework under the decoding-time combination paradigm. It flexibly controls multi-personality without relying on scarce multi-dimensional models or extra training, leveraging implicit density ratios in single-dimensional models as a “free lunch” to reformulate the task as sampling from a target strategy aggregating these ratios. To implement MPG efficiently, we design Speculative Chunk-level based Rejection sampling (SCR), which generates responses in chunks and parallelly validates them via estimated thresholds within a sliding window. This significantly reduces computational overhead while maintaining high-quality generation. Experiments on MBTI personality and Role-Playing demonstrate the effectiveness of MPG, showing improvements up to 16%-18%. Code and data are available at this https URL .
Submission history
From: Rongxin Chen [view email]
[v1]
Mon, 27 Oct 2025 09:45:11 UTC (732 KB)
[v2]
Mon, 17 Nov 2025 07:41:03 UTC (737 KB)
[v3]
Tue, 13 Jan 2026 13:22:05 UTC (738 KB)
[v4]
Thu, 15 Jan 2026 09:29:50 UTC (738 KB)


![[2511.01891] Multi-Personality Generation of LLMs at Decoding-time Measuring Intelligence Efficiency of Local AI](https://skytik.cc/wp-content/uploads/2025/11/Measuring-Intelligence-Efficiency-of-Local-AI-768x448.png)