The SIGIR 2024 Tutorial:
Robust Information Retrieval


Yu-An Liu¹,	Ruqing Zhang¹,	Jiafeng Guo¹,	Maarten de Rijke²

¹CAS Key Lab of Network Data Science and Technology, ICT, CAS, University of Chinese Academy of Sciences, ²University of Amsterdam

Sunday July 14th 1:30 PM - 5:00 PM (EDT) @ South American A

About this tutorial

Beyond effectiveness, the robustness of an information retrieval (IR) system is increasingly attracting attention. When deployed, a critical technology such as IR should not only deliver strong performance on average but also have the ability to handle a variety of exceptional situations. In recent years, research into the robustness of IR has seen significant growth, with numerous researchers offering extensive analyses and proposing myriad strategies to address robustness challenges.

In this tutorial, we first provide background information covering the basics and a taxonomy of robustness in IR. Then, we examine adversarial robustness and out-of-distribution (OOD) robustness within IR-specific contexts, extensively reviewing recent progress in methods to enhance robustness. The tutorial concludes with a discussion on the robustness of IR in the context of large language models (LLMs), highlighting ongoing challenges and promising directions for future research. This tutorial aims to generate broader attention to robustness issues in IR, facilitate an understanding of the relevant literature, and lower the barrier to entry for interested researchers and practitioners.

Slides

Section 1: Introduction
Section 2: Preliminaries
Section 3: Adversarial robustness
Section 4: Out-of-distribution robustness
Section 5: Robust IR in the age of LLMs
Section 6: Challenges and future directions

Schedule

Time	Section	Presenter
13:30 - 13:50	Section 1: Introduction	Maarten de Rijke
13:50 - 14:10	Section 2: Preliminaries	Yu-An Liu
14:10 - 15:00	Section 3: Adversarial robustness	Yu-An Liu
15:00 - 15:30	30min coffee break
15:30 - 16:20	Section 4: Out-of-distribution robustness	Yu-An Liu
16:20 - 16:30	Section 5: Robust IR in the age of LLMs	Yu-An Liu
16:30 - 16:50	Section 6: Challenges and future directions	Maarten de Rijke
16:50 - 17:00	Q & A	All

Survey

Robust Neural Information Retrieval: An Adversarial and Out-of-distribution Perspective (liu et al., 2024)

Benchmark

BestIR: Benchmark of robust information retrieval (Liu et al. 2024, BestIR)
Beir: A heterogenous benchmark for zero-shot evaluation of information retrieval models (Thakur et al. 2021, BEIR)

Perspective Papers

The probability ranking principle in IR (Robertson et al. 1977, Probability ranking principle)
A Game Theoretic Analysis of the Adversarial Retrieval Setting (Basat et al. 2017, PRP is sub-optimal)
Competitive Search (Kurland et al. 2022, SIGIR, Competitive search)
Ranking-Incentivized Document Manipulations for Multiple Queries (Nachimovsky et al. 2024, ICTIR, Topic-oriented competitive search)

Reading List

A curated list of papers related to robustness in IR can be found at Awesome Robustness in Information Retrieval.

The tutorial extensively covers papers highlighted in bold.

Section 3: Adversarial robustness

3.1 Adversarial attacks

3.1.0 Classification of adversarial attack tasks

Adversarial retrieval attack

Black-box Adversarial Attacks against Dense Retrieval Models: A Multi-view Contrastive Learning Method (Liu et.al. 2023, black-box, dense retrieval attack)
Ranking-Incentivized Quality Preserving Content Modification (Gregory et.al. 2020)
Boosting Big Brother: Attacking Search Engines with Encodings (Boucher et.al. 2023, encoding attack)
MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion (Lin et.al. 2024, adversarial revisions)
Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation (Long et.al. 2024, backdoor attack)
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models (Parry et.al. 2024, attacking T5)

Adversarial ranking attack

PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models (Wu et.al. 2022, black-box, word substitution)
Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking Models (Liu et.al. 2022, black-box, trigger)
One word at a time: adversarial attacks on retrieval models (Raval and Verma 2020, white-box)
Adversarial Semantic Collisions (Song et.al. 2020, white-box)
Bert rankers are brittle: A study using adversarial document perturbations (Wang et.al. 2022, white-box)
TRAttack: Text Rewriting Attack Against Text Retrieval (Song et.al. 2022, rewriting attack, matching model)
Towards Imperceptible Document Manipulations against Neural Ranking Models (Chen et.al. 2023, black-box, prompt)
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2024, multi-granular attack)

Topic-oriented adversarial retrieval/ranking attack

Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2023, black-box, TARA task)
Poisoning Retrieval Corpora by Injecting Adversarial Passages (Zhong et.al. 2023, dense retrieval attack)

3.1.1 Steal knowledge from black-box models

Surrogate model training

PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models (Wu et.al. 2022)
Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking Models (Liu et.al. 2022)

3.1.2 Identify vulnerable positions in documents

Pre-defined position

Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking Models (Liu et.al. 2022)
Adversarial Semantic Collisions (Wu et.al. 2022)
Bert rankers are brittle: A study using adversarial document perturbations (Wang et.al. 2022, white-box)
Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2023, black-box, TARA task)
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models (Parry et.al. 2024, attacking T5)

Output-guided position

Towards Imperceptible Document Manipulations against Neural Ranking Models (Chen et.al. 2023, black-box, prompt)
One word at a time: adversarial attacks on retrieval models (Raval and Verma 2020, white-box)
TRAttack: Text Rewriting Attack Against Text Retrieval (Song et.al. 2022, rewriting attack, matching model)
Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation (Long et.al. 2024, backdoor attack)

Gradient-guided position

PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models (Wu et.al. 2022)
Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2023, black-box, TARA task)
Black-box Adversarial Attacks against Dense Retrieval Models: A Multi-view Contrastive Learning Method (Liu et.al. 2023, black-box, dense retrieval attack)
Poisoning Retrieval Corpora by Injecting Adversarial Passages (Zhong et.al. 2023, dense retrieval attack)
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2024, multi-granular attack)

3.1.3 Add Perturbation to identified positions

3.1.3.1 Perturbation type

Word substitution

Black-box Adversarial Attacks against Dense Retrieval Models: A Multi-view Contrastive Learning Method (Liu et.al. 2023, black-box, dense retrieval attack)
PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models (Wu et.al. 2022, black-box, word substitution)
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2024, multi-granular attack)
One word at a time: adversarial attacks on retrieval models (Raval and Verma 2020, white-box)
Poisoning Retrieval Corpora by Injecting Adversarial Passages (Zhong et.al. 2023, dense retrieval attack)

Trigger sentence

Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking Models (Liu et.al. 2022, black-box, trigger)
Adversarial Semantic Collisions (Song et.al. 2020, white-box)
Bert rankers are brittle: A study using adversarial document perturbations (Wang et.al. 2022, white-box)
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models (Parry et.al. 2024, attacking T5)

Multi-granular

Multi-granular Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2024, multi-granular attack)
Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2023, black-box, TARA task)

Encoding error

Boosting Big Brother: Attacking Search Engines with Encodings (Boucher et.al. 2023, encoding attack)

Grammatical error

Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation (Long et.al. 2024, backdoor attack)

3.1.3.2 Perturb strategy

Static: greedy search

Poisoning Retrieval Corpora by Injecting Adversarial Passages (Zhong et.al. 2023, dense retrieval attack)
Adversarial Semantic Collisions (Song et.al. 2020, white-box)
PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models (Wu et.al. 2022, black-box, word substitution)
Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking Models (Liu et.al. 2022, black-box, trigger)
Black-box Adversarial Attacks against Dense Retrieval Models: A Multi-view Contrastive Learning Method (Liu et.al. 2023, black-box, dense retrieval attack)

Dynamic: reinforcement learning

Topic-oriented Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2023, black-box, TARA task)
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models (Liu et.al. 2024, multi-granular attack)

3.2 Adversarial defenses

3.2.1 Empirical defense

Data augmentation

Dealing with textual noise for robust and effective BERT re-ranking (Chen et al. 2023)

Traditional adversarial training

A Study on FGSM Adversarial Training for Neural Retrieval (Lupart and Clinchant 2023)
IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models (Wang et al. 2017, IRGAN)
Adversarial Sampling and Training for Semi-Supervised Information Retrieval (Park and Chang 2019, AdvIR)
Adversarial Retriever-Ranker for dense text retrieval (Zhang et al. 2022, AR2)
Towards Robust Ranker for Text Retrieval (Yucheng et al. 2022, R2ANKER)

Theory-guided adversarial training

Perturbation-Invariant Adversarial Training for Neural Ranking Models: Improving the Effectiveness-Robustness Trade-Off (Liu et.al. 2024, Perturbation-invariance theory)

3.2.2 Certified defense

Certified robustness

Certified Robustness to Word Substitution Ranking Attack for Neural Ranking Models (Wu et.al. 2022, Certified Top-K robustness)

3.2.3 Attack detection

Perplexity-based detection

Adversarial Semantic Collisions (Wu et.al. 2022)

Language-based detection

Order-Disorder: Imitation Adversarial Attacks for Black-box Neural Ranking Models (Liu et.al. 2022)

Learning-based detection

Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection (Chen et.al. 2023)

Section 4: Out-of-distribution robustness

4.1 OOD generalizability on unforeseen documents

4.1.1 Adaptation to new corpus

Data augmentation

GPL: Generative pseudo labeling for unsupervised domain adaptation of dense retrieval (Wang et al. 2022, GPL)
Unsupervised dense information retrieval with contrastive learning (Izacard et al. 2021, Contriever)
InPars: Data Augmentation for Information Retrieval using Large Language Models (Bonifacio et al. 2022, InPars)
Data augmentation for sample efficient and robust document ranking (Anand et al. 2023)
Data augmentation and transfer learning for brain tumor detection in magnetic resonance imaging (Anaya-Isaza et al. 2022)
HypeR: Multitask Hyper-Prompted Training Enables Large-Scale Retrieval Generalization (Cai et al. 2023)
DUQGen: Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation (Chandradevan et al. 2024)
Cross-domain augmentation networks for click-through rate prediction (Chen et al. 2023, CDAnet)
Promptagator: Few-shot dense retrieval from 8 examples (Dai et al. 2022, PROMPTAGATOR)
Augmenting zero-shot dense retrievers with plug-in mixture-of-memories (Ge et al. 2023, MoMA)
InRanker: Distilled Rankers for Zero-shot Information Retrieval (Laitz et al. 2024, InRanker)
Domain Adaptation for Dense Retrieval and Conversational Dense Retrieval through Self-Supervision by Meticulous Pseudo-Relevance Labeling (Li and Gaussier 2024, DoDress)
Embedding-based zero-shot retrieval through query generation (Liang et al. 2020)
Challenges in generalization in open domain question answering (Liu et al. 2022)
Zero-shot neural passage retrieval via domain-targeted synthetic question generation (Ma et al. 2021)
Text and code embeddings by contrastive pre-training (Neelakantan et al. 2022)
Data augmentation for neural machine translation using generative language model (Oh et al. 2023)
Learning to retrieve passages without supervision (Ram et al. 2022, Spider)
Towards robust neural retrieval models with synthetic pre-training (Reddy et al. 2021)
Questions are all you need to train a dense passage retriever (Sachan et al.)

Domain modeling

COCO-DR: Combating distribution shifts in zero-shot dense retrieval with contrastive and distributionally robust learning (Yu et al. 2022, COCO-DR)
Improving Retrieval in Theme-specific Applications using a Corpus Topical Taxonomy (Kang et al. 2024, ToTER)
Learning list-level domain-invariant representations for ranking (Xian et al. 2023)
Zero-shot dense retrieval with momentum adversarial domain invariant representations (Xin et al. 2022, MoDIR)
BERM: Training the balanced and extractable representation for matching to improve generalization ability of dense retrieval (Xu et al. 2023, BERM)
Disentangled modeling of domain and relevance for adaptable dense retrieval (Zhan et al., DDR)

Architectural modifications

DESIRE-ME: Domain-Enhanced Supervised Information Retrieval Using Mixture-of-Experts (Kasela et al. 2024, DESIRE-ME)
Out-of-domain semantics to the rescue! zero-shot hybrid retrieval models (Tao Chen et al. 2022)
From distillation to hard negative sampling: Making sparse neural ir models more effective (Formal et al. 2022)
Zero-shot retrieval with search agents and hybrid environments (Huebscher et al. 2022)
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders (Lee et al. 2023)

Scaling up the model capacity

Large dual encoders are generalizable retrievers (Ni et al. 2022)
Ernie-search: Bridging cross-encoder with dual-encoder via self on-the-fly distillation for dense passage retrieval (Lu et al. 2022)

4.1.2 Updates to a corpus

Continual learning for dense retrieval

L2R: Lifelong Learning for First-stage Retrieval with Backward-Compatible Representations (Cai et al. 2023, L$^2$R)

Continual learning for generative retrieval

Continual learning for generative retrieval over dynamic corpora (Chen et al. 2023, CLEVER)
Corpusbrain++: A continual generative pre-training framework for knowledge-intensive language tasks (Guo et al. 2024, CorpusBrain++)
Incdsi: incrementally updatable document retrieval (Kishore et al. 2023, IncDSI)
DSI++: Updating transformer memory with new documents (Mehta et al. 2023, DSI++)
Continually Updating Generative Retrieval on Dynamic Corpora (Yoon et al. 2023)

4.2 OOD generalizability on unforeseen queries

4.2.1 Query variation

Self-teaching

CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos (Zhuang et al. 2022, CBST)
Contrastive fine-tuning improves robustness for neural rankers (Ma et al. 2021)
Towards Robust Dense Retrieval via Local Ranking Alignment (Chen et al. 2022, RoDR)
Typos-aware bottlenecked pre-training for robust dense retrieval (Zhuang et al. 2023, ToRoDer)

Contrastive learning

Analysing the Robustness of Dual Encoders for Dense Retrieval Against Misspellings (Sidiropoulos et al. 2022, DRCL)
MIRS: [MASK] Insertion Based Retrieval Stabilizer for Query Variations (Liu et al. 2023, MIRS)
Contrastive fine-tuning improves robustness for neural rankers (Ma et al. 2021)
Dealing with Typos for BERT-based Passage Retrieval and Ranking (Zhuang et al. 2021, DRTA)

Hybrid training

Typo-robust representation learning for dense retrieval (Tasawong et al. 2023, DST)
Towards Robust Neural Rankers with Large Language Model: A Contrastive Training Approach (Pan et al. 2023)
Noise-robust dense retrieval via contrastive alignment post training (Campos et al. 2023, CAPOT)
Improving the Robustness of Dense Retrievers Against Typos via Multi-Positive Contrastive Learning (Sidiropoulos et al. 2024)

4.2.2 Unseen query type

Are Neural Ranking Models Robust? (Wu et al. 2022)
Cross domain regularization for neural ranking models using adversarial learning (Cohen et al. 2018)
Learning to Jointly Transform and Rank Difficult Queries (Bigdeli et al. 2024)
Ms-shift: An analysis of ms marco distribution shifts on neural retrieval (Lupart et al. 2023, MS-Shift)
Contrastive fine-tuning improves robustness for neural rankers (Ma et al. 2021)
Simple entity-centric questions challenge dense retrievers (Sciavolino et al. 2021)

BibTeX

@inproceedings{liu2024robust,
author = {Liu, Yu-An and Zhang, Ruqing and Guo, Jiafeng and de Rijke, Maarten},
title = {Robust Information Retrieval},
year = {2024},
booktitle = {SIGIR},
}

The SIGIR 2024 Tutorial: Robust Information Retrieval