Academic
Academic
Home
Papers
Mentoring
Posts
CV
Light
Dark
Automatic
1
Enhancing Extreme Multi-Label Text Classification: Addressing Challenges in Model, Data, and Evaluation
We enhance multi-label classification model in the perspective of model, data, and evalution, using a BiEncoder-CrossEncoder based label ranking model, active learning and ChatGPT.
Dan Li
,
Zi Long Zhu
,
Janneke van de Loo
,
Agnés Masip Gómez
,
Vikrant Yadav
,
Georgios Tsatsaronis
,
Zubair Afzal
Extending Label Aggregation Models with a Gaussian Process to Denoise Crowdsourcing Labels
Label aggregation (LA) is the task of inferring a high-quality label for an example from multiple noisy labels generated by either …
Dan Li
,
Maarten de Rijke
PDF
Code
Slides
Unsupervised Dense Retrieval for Scientific Articles
We build a semantic search engine on scientific articles. The major challenge is that there is no labeled data for training and testing. We apply a state-of-the-art unsupervised dense retrieval model called Generative Pseudo Labeling that generates high-quality pseudo training labels.
Dan Li
,
Vikrant Yadav
,
Zubair Afzal
,
Georgios Tsatsaronis
PDF
Poster
Video
CrowdGP: A Gaussian Process Model for Inferring Relevance from Crowd Annotations
Test collection has been a crucial factor for developing information retrieval systems. Constructing a test collection requires …
Dan Li
,
Maarten de Rijke
PDF
Code
Slides
Video
Effective collection construction for information retrieval evaluation and optimization
The availability of test collections in Cranfield paradigm has significantly benefited the development of models, methods and tools in …
Dan Li
PDF
Source Document
APS: An active PubMed search system for technology assisted reviews
Systematic reviews constitute the cornerstone of Evidence-based Medicine. They can provide guidance to medical policy-making by …
Dan Li
,
Panos Zafeiriadis
,
Evangelos Kanoulas
PDF
Video
Query resolution for conversational search with limited supervision
In this work we focus on multi-turn passage retrieval as a crucial component of conversational search. One of the key challenges in …
Nikos Voskarides
,
Dan Li
,
Pengjie Ren
,
Maarten de Rijke
PDF
Code
Video
Bayesian optimization for optimizing retrieval systems
The effectiveness of information retrieval systems heavily depends on a large number of hyperparameters that need to be tuned. …
Dan Li
,
Evangelos Kanoulas
PDF
Slides
Studying topical relevance with evidence-based crowdsourcing
Information Retrieval systems rely on large test collections to measure their effectiveness in retrieving relevant documents. While the …
Oana Inel
,
Giannis Haralabopoulos
,
Dan Li
,
Christophe Van Gysel
,
Zoltán Szlávik
,
Elena Simperl
,
Evangelos Kanoulas
PDF
Code
Technology assisted reviews: Finding the last few relevant documents by asking yes/no questions to reviewers
The goal of a technology-assisted review is to achieve high recall with low human effort. Continuous active learning algorithms have …
Jie Zou
,
Dan Li
,
Evangelos Kanoulas
PDF
»
Cite
×