From Text to Recommendations: LLM-Guided Interest Tokenization via T5 and Cross-Attention in Generative Models

Shanghai Jiao Tong University
AI3602 Data Mining Group5

*Indicates Equal Contribution

Demo of our proposed model.

Abstract

ETEGRec is a SOTA end-to-end generative recommender system that integrates item tokenization and recommendation generation. However, its latent item tokenization process fails to take into account the high-level semantic information of user interests, which can lead to suboptimal recommendations. To address this, we introduce a brand new interest fusion mechanism that leverages user interest text embeddings to enhance the recommendation process. We propose a recommendation framework that leverages a large language model (LLM) to extract structured user interests from purchase history and item metadata. A multi-layer cross-attention mechanism fuses interest embeddings with item sequences, enhancing semantic relevance modeling. Training uses cached batch processing for consistency, while inference employs asynchronous updates for real-time performance. This approach improves recommendation accuracy with efficient user interest integration.

Results

We evaluate our work on the Amazon Reviews'23 Industrial and Scientific dataset. The evaluation results of ETEGRec on the test set are as follows:

Result visualization