SIGIR-AP 2024 Tutorial: Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

About this tutorial

Retrieval-enhanced machine learning (REML) refers to the use of information retrieval methods to support reasoning and inference in machine learning tasks. Although relatively recent, these approaches can substantially improve model performance. This includes improved generalization, knowledge grounding, scalability, freshness, attribution, interpretability and on-device learning. To date, despite being influenced by work in the information retrieval community, REML research has predominantly been presented in natural language processing (NLP) conferences.

Our tutorial addresses this disconnect by introducing core REML concepts and synthesizing the literature from various domains in machine learning (ML), including but beyond NLP. What is unique to our approach is that we used consistent notations, to provide researchers with a unified and expandable framework. This tutorial will be delivered in lecture format based on an existing manuscript: "Retrieval-Enhanced Machine Learning: Synthesis and Opportunities"

Schedule

Our tutorial is scheduled for December 9th from 14:15 to 17:30 (GMT+9).
Combined Slides: [Slides]

Time	Section	Presenter	In Manuscript
14:15 — 14:35	Section 1: Introduction	Fernando Diaz	Chapter 1 - 2
14:35 — 14:55	Section 2: Querying	Alireza Salemi	Chapter 3
14:55 — 15:05	Section 3: Searching	Alireza Salemi	Chapter 4
15:05 — 15:35	Section 4: Presentation & Consumption	Andrew Drozdov	Chapter 5
15:35 — 16:45	Q & A	All
15:45 — 16:00	Coffee Break
16:00 — 16:30	Section 5: Storing	To Eun Kim	Chapter 6
16:30 — 16:50	Section 6: Optimization	Hamed Zamani	Chapter 7
16:50 — 17:05	Section 7: Evaluation	Fernando Diaz	Chapter 8
17:05 — 17:20	Section 8: Future Direction & Conclusion	Fernando Diaz	Chapter 9 - 10
17:20 — 17:30	Q & A	All

BibTeX (Manuscript)

      
        @misc{kim2024retrievalenhancedmachinelearning,
          title={Retrieval-Enhanced Machine Learning: Synthesis and Opportunities}, 
          author={To Eun Kim and Alireza Salemi and Andrew Drozdov and Fernando Diaz and Hamed Zamani},
          year={2024},
          eprint={2407.12982},
          archivePrefix={arXiv},
          primaryClass={cs.LG},
          url={https://arxiv.org/abs/2407.12982}, 
        }

BibTeX (Tutorial Proposal)

      
        @inproceedings{10.1145/3673791.3698439, 
          author = {Diaz, Fernando and Drozdov, Andrew and Kim, To Eun and Salemi, Alireza and Zamani, Hamed}, 
          title = {Retrieval-Enhanced Machine Learning: Synthesis and Opportunities}, 
          year = {2024}, 
          isbn = {9798400707247}, 
          publisher = {Association for Computing Machinery}, 
          address = {New York, NY, USA}, 
          url = {https://doi.org/10.1145/3673791.3698439}, 
          doi = {10.1145/3673791.3698439}, 
          booktitle = {Proceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region}, 
          pages = {299–302}, 
          numpages = {4}, 
          keywords = {information retrieval, machine learning}, 
          location = {Tokyo, Japan}, 
          series = {SIGIR-AP 2024} 
        }


Fernando Diaz¹	Andrew Drozdov²	To Eun Kim¹	Alireza Salemi³	Hamed Zamani³

SIGIR-AP 2024 Tutorial: Retrieval-Enhanced Machine Learning:Synthesis and Opportunities

About this tutorial

Schedule

BibTeX (Manuscript)

BibTeX (Tutorial Proposal)

SIGIR-AP 2024 Tutorial:
Retrieval-Enhanced Machine Learning:
Synthesis and Opportunities