Hierarchical Generative Model Boosts Slate Recommendation Accuracy and Speed
Global: Hierarchical Generative Model Boosts Slate Recommendation Accuracy and Speed
Researchers announced a new framework called HiGR that aims to improve slate recommendation on online platforms by addressing semantic entanglement in item tokenization and the inefficiency of sequential decoding. The study, posted on arXiv in December 2025, describes how HiGR integrates hierarchical planning with listwise preference alignment to generate ranked lists of items more effectively. Evaluations were conducted on a large‑scale commercial media platform, where the model demonstrated notable gains in both offline metrics and live user engagement.
Background and Motivation
Slate recommendation systems typically present users with a simultaneous list of items, yet many existing autoregressive models struggle with tangled semantic representations and slow inference. According to the authors, these challenges limit the ability to plan holistically for the entire slate, prompting the development of a more structured approach.
Hierarchical Tokenization
HiGR employs an auto‑encoder that combines residual quantization with contrastive constraints to convert items into semantically organized identifiers. This tokenization strategy is intended to give the generative component finer control over item semantics, reducing the overlap that can occur in conventional token schemes.
Two‑Stage Generation Process
The framework separates generation into a list‑level planning stage, which establishes a global intent for the slate, followed by an item‑level decoding stage that selects specific items to fulfill that intent. By decoupling these steps, the authors claim the model can plan more coherently while maintaining fast inference.
Preference Alignment Objective
To directly optimize slate quality, HiGR introduces a listwise preference alignment objective that leverages implicit user feedback. This objective aligns the generated list with observed user preferences without requiring explicit labels for each item.
Experimental Evaluation
Offline tests reported that HiGR outperformed state‑of‑the‑art methods by more than 10% in recommendation quality while achieving a fivefold increase in inference speed. In live A/B experiments, the model delivered a 1.22% rise in average watch time and a 1.73% increase in average video views compared with the incumbent system.
Implications and Future Directions
The results suggest that hierarchical planning and structured tokenization can substantially enhance both the efficiency and effectiveness of slate recommendation engines. The authors note that further research will explore cross‑domain applicability and the integration of explicit user feedback to refine the alignment objective.
This report is based on information from arXiv, licensed under Academic Preprint / Open Access. Based on the abstract of the research paper. Full text available via ArXiv.
Ende der Übertragung