Boosting Slate Recommendation Accuracy and Speed with HiGR

Global: Hierarchical Generative Model Boosts Slate Recommendation Accuracy and Speed

Researchers announced a new framework called HiGR that aims to improve slate recommendation on online platforms by addressing semantic entanglement in item tokenization and the inefficiency of sequential decoding. The study, posted on arXiv in December 2025, describes how HiGR integrates hierarchical planning with listwise preference alignment to generate ranked lists of items more effectively. Evaluations were conducted on a large‑scale commercial media platform, where the model demonstrated notable gains in both offline metrics and live user engagement.

Background and Motivation

Slate recommendation systems typically present users with a simultaneous list of items, yet many existing autoregressive models struggle with tangled semantic representations and slow inference. According to the authors, these challenges limit the ability to plan holistically for the entire slate, prompting the development of a more structured approach.

Hierarchical Tokenization

HiGR employs an auto‑encoder that combines residual quantization with contrastive constraints to convert items into semantically organized identifiers. This tokenization strategy is intended to give the generative component finer control over item semantics, reducing the overlap that can occur in conventional token schemes.

Two‑Stage Generation Process

The framework separates generation into a list‑level planning stage, which establishes a global intent for the slate, followed by an item‑level decoding stage that selects specific items to fulfill that intent. By decoupling these steps, the authors claim the model can plan more coherently while maintaining fast inference.

Preference Alignment Objective

To directly optimize slate quality, HiGR introduces a listwise preference alignment objective that leverages implicit user feedback. This objective aligns the generated list with observed user preferences without requiring explicit labels for each item.

Experimental Evaluation

Offline tests reported that HiGR outperformed state‑of‑the‑art methods by more than 10% in recommendation quality while achieving a fivefold increase in inference speed. In live A/B experiments, the model delivered a 1.22% rise in average watch time and a 1.73% increase in average video views compared with the incumbent system.

Implications and Future Directions

The results suggest that hierarchical planning and structured tokenization can substantially enhance both the efficiency and effectiveness of slate recommendation engines. The authors note that further research will explore cross‑domain applicability and the integration of explicit user feedback to refine the alignment objective.

This report is based on information from arXiv, licensed under Academic Preprint / Open Access. Based on the abstract of the research paper. Full text available via ArXiv.

Hierarchical Generative Model Boosts Slate Recommendation Accuracy and Speed