Rikiya Takehi
4th year Undergraduate Student at Waseda University

My research has revolved around IR, NLP, and ML. I am interested in building reliable AI systems, including LLMs, LLM Agents, RecSys, and retrievers.
I am an undergraduate student at Waseda University, supervised by Prof. Tetsuya Sakai. I am now also at Mixedbread. Previously, I was a guest researcher at NIST working with Dr. Ian Soboroff and Dr. Ellen Voorhees, and I have collaborated with Prof. Fernando Diaz of CMU LTI on retrieval rankings. I co-organize the Product Search and Recommendations Track at TREC.
My first two years of graduate studies will be fully funded by the Toyota PhD Fellowship.
News
scroll ↓
- Jan.2026: Co-first-authored full paper Retention-Driven Two-Sided Matching got accepted to ICLR 2026.
- Oct.2025: First authored full paper Diversity as Risk Minimization got accepted to WSDM 2026.
- Oct.2025: Released two open-source ColBERT models mxbai-edge-colbert-v0-17m and mxbai-edge-colbert-v0-32m. Tech report here.
- Aug.2025: My co-authored paper got accepted to CIKM 2025
- Aug.2025: Started research internship at Mixedbread.
- Jul.2025: Selected as a Toyota PhD Fellow: 2 yrs of full funding.
- Jul.2025: Gave an invited talk (w/ Prof. ChengXiang Zhai) at SIGIR 2025 eCOM Workshop invited by Dr. Tracy Holloway King.
- Jun.2025: Invited as a panelist at NTCIR 2025 with Prof. Maarten De Rijke, Prof. Mark Sanderson, Prof. Charles Clarke, & Prof. Ian Soboroff.
- Jun.2025: Gave an invited talk at EVIA 2025 about Using LLMs as Assistants for Building Test Collections invited by Prof. Charles Clarke, Prof. Noriko Kando, & Prof. Makoto Kato. Slides can be found here.
- Apr.2025: First authored full paper LLM-Assisted Relevance Assessments: When Should We Ask LLMs for Help? got accepted to SIGIR 2025!!
- Jan.2025: First authored full paper General Framework for Off-Policy Learning with Partially-Observed Reward got accepted to ICLR 2025.
- Nov.2024: Gave an invited talk at NII about Using LLMs as Assistants for Building Test Collections invited by Prof. Noriko Kando.
- Nov.2024: First authored paper LLM-Assisted Relevance Assessments: When Should We Ask LLMs for Help? preprint available on arXiv.
- Oct.2024: Started research internship at CyberAgent AI Lab. Algorithm Team.
- Aug.2024: Gave an invited talk at UMD College Park about Nugget-Based Evaluation and the Use of LLMs invited by Prof. Douglas Oard.
- Oct.2023: First-authored paper Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores accepted to SIGIR AP 2023.
Publications
You can also find my articles on my Google Scholar profile.
- Diversification as Risk Minimization
Rikiya Takehi, Fernando Diaz, Tetsuya Sakai. 2025.
WSDM 2026.
arXiv - Beyond Match Maximization and Fairness: Retention-Objectified Two-Sided Matching
Rikiya Takehi*, Ren Kishimoto*, Koichi Tanaka, Masahiro Nomura, Riku Togashi, Yuta Saito. 2025.
ICLR 2026.
preprint - General Framework for Off-Policy Learning with Partially-Observed Reward
Rikiya Takehi, Kosuke Kawakami, Masahiro Asami, Yuta Saito. 2025.
ICLR 2025.
arXiv | OpenReview | presentation | poster - LLM‑Assisted Relevance Assessments: When Should We Ask LLMs for Help?
Rikiya Takehi, Ellen M. Voorhees, Tetsuya Sakai, and Ian Soboroff. 2025.
SIGIR 2025.
arXiv | slides | code - Open-Source LLM-based Relevance Assessment vs. Highly Reliable Manual Relevance Assessment: A Case Study
Tetsuya Sakai, Khant Myoe Rain, Rikiya Takehi, Sijie Tao, Youngin Song. 2025.
CIKM 2025.
proceedings - Open-Domain Dialogue Quality Evaluation: Deriving Nugget-level Scores from Turn-level Scores
Rikiya Takehi, Akihisa Watanabe, and Tetsuya Sakai. 2023.
SIGIR-AP 2023.
code | poster | slides | proceedings - Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report.
Rikiya Takehi, Benjamin Clavié, Sean Lee, Aamir Shakir. 2025.
Tech Report.
Tech Report | Blog | 17M ColBERT model | 32M ColBERT model - Objective-driven Calibrated Recommendations
Rikiya Takehi*, Koichi Tanaka*, Ren Kishimoto, Masahiro Nomura, Riku Togashi, Yuta Saito. 2025.
preprint
Experience
- Research Intern, Mixedbread
- Research Intern, CyberAgent AI Lab. Algorithm Team
- Guest Researcher, NIST Retrieval Group
- Research Intern, Hakuhodo Tech Inc.
Invited Talks
- SIGIR eCom Workshop 2025Product Search and Recommendations (with Prof. ChengXiang Zhai)
- EVIA 2025Using LLMs as Assistants for Building Large Test Collections
- National Institute of InformaticsUsing LLMs as Assistants for Building Test Collections (Trends and Problems of Test Collections)
- University of Maryland, College ParkNugget-Based Evaluation and the Use of LLMs
- NTCIR 2025 PanelistWith Profs. Maarten de Rijke (UvA), Mark Sanderson (RMIT), Charles Clarke (UWaterloo), and Ian Soboroff (NIST).