Skip to content

Runs - Conversational Assistance 2020

ASCFDA_baseline

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ASCFDA_baseline
  • Participant: ASCFDA
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: bbfbb6ae59ac12f900b2a614e8020dc1
  • Run description: Our reranking model, based on T5, is trained with MS MARCO dataset.

ASCFDA_d2q_emb

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ASCFDA_d2q_emb
  • Participant: ASCFDA
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 83ad133ce9caab9c45453798f3808a09
  • Run description: Our reranking model, based on T5, is trained with MS MARCO dataset. We use Doc2Query and BM25 to expand the raw utterances, re-retrieve the passages and rerank them by the T5-based reranking method.

ASCFDA_qa

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ASCFDA_qa
  • Participant: ASCFDA
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: 72d5efbb2a40d6989fe5e8f8e971251c
  • Run description: Our reranking model, based on T5, is trained with MS MARCO dataset. We use both manual canonical results and topic-turns as inputs of T5-QA model, and use the previous output for query expansion.

ASCFDA_rm3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ASCFDA_rm3
  • Participant: ASCFDA
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: ae393ff447b50158e074b8e180e823cf
  • Run description: Our reranking model, based on T5, is trained with MS MARCO dataset. We use RM3 to expand the raw utterances, re-retrieve the passages and rerank them by the T5-based reranking method.

AUTO_BERT100

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: AUTO_BERT100
  • Participant: nova-search
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 5a909880b123209f5ad18d632335cffb
  • Run description: To maintain the conversational context across turns we used the automatically rewritten queries from CAsT Y2. Initial retrieval is done with the rewritten query using a QL model with LMD smoothing. After this we perform reranking using a BERT model finetuned on the MS MARCO dataset for the passage reranking task.

AUTO_T5_RRF

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: AUTO_T5_RRF
  • Participant: nova-search
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 1c4edb482d4d9e20fb14bd39084007d1
  • Run description: To maintain the conversational context across turns we use two query rewriting methods. The first one is a T5 model finetuned on the conversational query rewriting task using the CANARD dataset. The input to the T5 model is the concatenation of the current query and all of the previous queries, and the output of the model is the query rewritten for that turn. Our second method uses the automatically rewritten queries from CAsT Y2. Initial retrieval is done with both rewritten queries using a QL model with LMD smoothing. After this we perform reranking using a BERT model finetuned on the MS MARCO dataset for the passage reranking task. To obtain the final ranking we use RRF to fuse the BERT reranked results from both queries.

baselineQR

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: baselineQR
  • Participant: UvA.ILPS
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: b8533fb775842868c9926b2732c6bda3
  • Run description: PR: BM25 Anserini + BERT for passage ranking combined with the score from Roberta fine-tuned on MRQA formulated as a binary classification task (paragraph contains answer or doesn't)

castur_albert

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: castur_albert
  • Participant: USI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: bc3171b367ac69ee318338db5f26d3f1
  • Run description: First, we perform query expansion by concatenating relevant raw utterances from history with the current raw one, based on an output of a model trained on CAsTUR dataset (Aliannejadi et al., 2020). We perform initial retrieval with Anserini Query Likelihood with Dirichlet smoothing (default parameters). Then we re-rank the passages with Albert, which is pretrained on MS MARCO passage retrieval task.

duth

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: duth
  • Participant: DUTH
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: c3f4edc2c1c8b3d9e3b591bda511e04a
  • Run description: First, we run a linguistic analysis of the given turn (query) and perform coreference resolution (AllenNLP) from previous queries, POS tagging (SpaCy), and named entity recognition. Then we rewrite the queries and perform retrieval (indri) by using weights. Nouns, verbs, adjectives, and adverbs from the current query weight 1.0 and nouns, adjectives, and adverbs from the previous 2 turns (queries) weight 0.5 and 0.25 respectively. No reranking is performed.

duth_arq

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: duth_arq
  • Participant: DUTH
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 78ca7ce3c379c5cb969e4cbec53a240a
  • Run description: We used the automatically rewritten utterances (queries) from CAsT Y2. First, we run a linguistic analysis of the given turn (query) and perform coreference resolution (AllenNLP) from previous queries, POS tagging (SpaCy), and named entity recognition. Then we rewrite the queries and perform retrieval (indri) by using weights. Nouns, verbs, adjectives, and adverbs from the current query weight 1.0 and nouns, adjectives, and adverbs from the previous 2 turns (queries) weight 0.5 and 0.25 respectively. No reranking is performed.

duth_manual

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: duth_manual
  • Participant: DUTH
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Type: manual
  • Task: primary
  • MD5: 9e0bc5b9ad69e0dd6cb7b6adbf747a7d
  • Run description: We used the manually rewritten utterances from CAsT Y2. We rewrite the queries and perform retrieval (indri) by using weights. Nouns, verbs, adjectives, and adverbs from the current query weight 1.0 and nouns, adjectives, and adverbs from the previous 2 turns (queries) weight 0.5 and 0.25 respectively. No reranking is performed.

grill_bmDuo

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: grill_bmDuo
  • Participant: grill
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Type: manual
  • Task: primary
  • MD5: 753011a389d33600c4f56b5bac33ba95
  • Run description: This run uses the manually resolved queries as well as provided canonical results. No additional rewriting is performed. Base retrieval using a tuned BM25 model. The resulting first pass retrieval is reranked with Mono + Duo BERT trained on MS MARCO.

grill_ctxDuo

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: grill_ctxDuo
  • Participant: grill
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Type: manual
  • Task: primary
  • MD5: 62c7ad7752f32e3d34888a2285e4b4c6
  • Run description: This run uses the manually resolved queries as well as provided canonical results. No additional rewriting is performed. Base retrieval is performed using the sequential dependence model on the current turn combined with a weighted QL model (using first turn and previous turn) as well as feedback from the canonical result provided for the previous turn. The resulting first pass retrieval is reranked with Mono + Duo BERT trained on MS MARCO.

grill_duoBART

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: grill_duoBART
  • Participant: grill
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 0323ac5e0056f2f0c23cb1e84f3f3789
  • Run description: This run uses the automatic rewrites as well as provided canonical results. Additional rewriting is performed using a BART rewriter trained on Y1. BM25 turned for the first pass. The resulting first pass retrieval is reranked with Mono + Duo BERT trained on MS MARCO.

grill_fuseDuo

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: grill_fuseDuo
  • Participant: grill
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 2f4513dfacdb7865a9c62d85689db63f
  • Run description: This run uses the automatic rewrites as well as provided canonical results. Additional rewriting is performed using a BART rewriter trained on Y1 and a simple fusion (combination of both rewrites is used). Base retrieval is performed using the sequential dependence model on the current turn combined with a weighted QL model (using first turn and previous turn (with rewrites)) as well as feedback from the canonical result provided for the previous turn. The resulting first pass retrieval is reranked with Mono + Duo BERT trained on MS MARCO.

h2oloo_RUN1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: h2oloo_RUN1
  • Participant: h2oloo
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: c5cd69f8a3692640b02bbf81bf108027
  • Run description: For each turn, we use word matching rule-base to extract the sentence from its previous manual canonical response (passage). We then concat the extracted sentence and current query to conduct query reformulation. At next turn, we use the historical reformulated query as context combined with the extracted sentence from the last manual canonical response to conduct query reformulation. In this run, we also fuse the result between retrieval and rerank stage.

h2oloo_RUN2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: h2oloo_RUN2
  • Participant: h2oloo
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: c69a9576b4dcf76ffff4f1bbb694fb12
  • Run description: For each turn, we use word matching rule-base to extract the sentence from its previous manual canonical response (passage). We then concat the extracted sentence and current query to conduct query reformulation. At next turn, we use the historical reformulated query as context combined with the extracted sentence from the last manual canonical response to conduct query reformulation. As for retrieval, we combine BERT embedding search and BM25 as first-stage retrieval and use T5 3B for the second stage reranker.

h2oloo_RUN3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: h2oloo_RUN3
  • Participant: h2oloo
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 8e1aa44e0df21a3f1961c53af9a94071
  • Run description: This run uses CANARD dataset to facilitate two tasks regarding conversational query reformulation. Specifically, for the first task, we use concated raw historical queries as context to infer human resolved queries. For the second task, we concated human resolved historical queries in each dialogue. After that, we facilitate Masked Language Model on these dialogue context. We finetune the T5 model one these two task jointly and conduct conversational query reformulation as the first task with CAsT's raw queries with historical context. As for retrieval, we combine BERT embedding search and BM25 as first-stage retrieval and use T5 3B for the second stage reranker.

h2oloo_RUN4

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: h2oloo_RUN4
  • Participant: h2oloo
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: f123fca5fb65b3560e63a83ee5aee3c0
  • Run description: This is our automatic baseline method for which we train T5-base to conduct query reformulation by concatenating historical raw queries on CANARD dataset. As for retrieval, we combine BERT embedding search and BM25 as first-stage retrieval and use T5 3B for the second stage reranker.

HBKU_t2_1v1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t2_1v1
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 7a305d6c9e2a798026ea88e7c7404f14
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run adds context from both historical queries and top-3 keywords from top-3 retrieved passages of the current query. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Both historical query and passage context was used in the re-ranking phase.

HBKU_t2_1v1_mnl

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t2_1v1_mnl
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: 77b29cf1edd0bf5a7ad6c03b67c5ba5b
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run adds context from both historical queries and top-3 keywords from top-3 retrieved passages of the current query. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Both historical query and passage context was used in the re-ranking phase.

HBKU_t2_1v2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t2_1v2
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 81b135a0340dd5c97bded66b6ed2f756
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run adds context from both historical queries and top-3 keywords from top-3 retrieved passages of the current query. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Only historical query context was used in the re-ranking phase.

HBKU_t2_1v2_mnl

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t2_1v2_mnl
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: 7f71df0913be15fd31e37ef4963fad35
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run adds context from both historical queries and top-3 keywords from top-3 retrieved passages of the current query. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Only historical query context was used in the re-ranking phase.

HBKU_t5_1v1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t5_1v1
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: a0fa22f2157b4210188a73caa6cda5cf
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run first categorizes queries into two categories: explicit and implicit. Explicit queries are ones that contain no pronouns while implicit queries contain at least one pronoun. Explicit queries are assumed to contain sufficient information and so only historical query context is used to expand the query. Implicit queries contain at least one pronoun and are assumed to need more clarification, so both historical queries and passage context is added. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Both historical query and passage context was used in the re-ranking phase.

HBKU_t5_1v1_mnl

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t5_1v1_mnl
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: a50d2fc3c0a91b8bce862190048fb002
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run first categorizes queries into two categories: explicit and implicit. Explicit queries are ones that contain no pronouns while implicit queries contain at least one pronoun. Explicit queries are assumed to contain sufficient information and so only historical query context is used to expand the query. Implicit queries contain at least one pronoun and are assumed to need more clarification, so both historical queries and passage context is added. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Both historical query and passage context was used in the re-ranking phase.

HBKU_t5_1v2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t5_1v2
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: b9cd235bf4248a7d79eefc24fa3cdcc1
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run first categorizes queries into two categories: explicit and implicit. Explicit queries are ones that contain no pronouns while implicit queries contain at least one pronoun. Explicit queries are assumed to contain sufficient information and so only historical query context is used to expand the query. Implicit queries contain at least one pronoun and are assumed to need more clarification, so both historical queries and passage context is added. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. only historical query context was used in the re-ranking phase.

HBKU_t5_1v2_mnl

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HBKU_t5_1v2_mnl
  • Participant: HBKU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: 6287c98c6838f8d7bde98673220d90d1
  • Run description: The methods used for this submission aims to resolve context in two phases. The first phase is to incorporate historical query context by extracting keywords from previous queries in a method based on last year's winning team's algorithm. The second phase incorporates context from top-k retrieved passages through extracting top-k keywords from these passages using TF-IDF. After that, a pre-trained BERT passage re-ranker was used. This run first categorizes queries into two categories: explicit and implicit. Explicit queries are ones that contain no pronouns while implicit queries contain at least one pronoun. Explicit queries are assumed to contain sufficient information and so only historical query context is used to expand the query. Implicit queries contain at least one pronoun and are assumed to need more clarification, so both historical queries and passage context is added. After that, the list was re-ranked using a pre-trained BERT re-ranker trained on MS Marco dataset. Only historical query context was used in the re-ranking phase.

hist_attention

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: hist_attention
  • Participant: USI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: ed112a2cdd88361ca88ee36308d0c812
  • Run description: Conversational query understanding is done by expanding the current query with previous relevant queries, which are predicted by a model trained on CAsTUR dataset. Initial ranking is done by Anserini's QL (default params), followed by a re-ranking model that encodes the queries and passages with Albert, pretrained on MS MARCO passage ranking task. The encodings of the current utterance are combined with previous relevant ones with self-attentive sum component. The summed representation is then forwarded through a classifier to predict passage relevance.

hist_concat

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: hist_concat
  • Participant: USI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 4031b9094e0ccfc0316dd09f504b3663
  • Run description: Conversational query understanding is done by expanding the current query with previous relevant queries, which are predicted by a model trained on CAsTUR dataset. Initial ranking is done by Anserini's QL (default params), followed by a re-ranking model that encodes the queries and passages with Albert, pretrained on MS MARCO passage ranking task. The encodings of the current utterance are concatenated with the most relevant query and the corresponding automatic canonical response from history. The representation is then forwarded through a classifier to predict passage relevance.

HPCLab-CNR-run1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HPCLab-CNR-run1
  • Participant: HPCLab-CNR
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 42a865f51addf95cdf38db73aaac924a
  • Run description: For this run, we manually labeled the utterances as SE, FT, PT, PR (labels are described in the external resources). If the utterance label is SE, no rewriting is applied. If the utterance label is FT, the automatic approach rewrites the utterance by using the topic extracted from the first utterance of the conversation. When the label is PT, the rewriting is performed using the topic extracted from the previous utterance in the conversation. The topics are extracted from utterances using the noun chunks (objects or subjects). When the utterance label is PR the keywords of the automatic canonical response (after stopwords removal) of the previous utterance are added. Every utterance is rewritten using the automatically-rewritten utterance of the previous turn so that the context is propagated through the conversation. INDRI is used for indexing and querying the CAST-2020 dataset. In particular, we index the two datasets (MS-MARCO and TREC CAR) by removing stopwords and using the Krovetz stemmer. As a querying method, we use the INDRI language model based on Dirichlet smoothing with parameter \mu = 2500. We also apply pseudo-relevance feedback (PRF) based on the RM3 algorithm. The query enrichment is done with 20 keywords taken from the top-20 results with mixing parameter \gamma = 0.5. Lastly, the passage re-ranking is performed using a model that fine-tunes the BERT base pre-trained model for re-ranking on the MS-MARCO passage retrieval dataset. INDRI retrieves 1000 results and all of them are used for the re-ranking step.

HPCLab-CNR-run2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HPCLab-CNR-run2
  • Participant: HPCLab-CNR
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 36c6c79fc2e25e566e22f86dfb31c5d1
  • Run description: This run uses raw utterances. The approach automatically rewrites the utterance by using the topics extracted from all previous utterances of the conversation. The topics are extracted from utterances using the noun chunks (objects or subjects). We use a list of cue phrases to detect a topic shift, when a topic shift occurs the list of topics to propagate is updated. INDRI is used for indexing and querying the CAST-2020 dataset. In particular, we index the two datasets (MS-MARCO and TREC CAR) by removing stopwords and using the Krovetz stemmer. As a querying method, we use the INDRI language model based on Dirichlet smoothing with parameter \mu = 2500. We also apply pseudo-relevance feedback (PRF) based on the RM3 algorithm. The query enrichment is done with 20 keywords taken from the top-20 results with mixing parameter \gamma = 0.5. Lastly, the passage re-ranking is performed using a model that fine-tunes the BERT base pre-trained model for re-ranking on the MS-MARCO passage retrieval dataset. INDRI retrieves 1000 results and all of them are used for the re-ranking step.

HPCLab-CNR-run3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HPCLab-CNR-run3
  • Participant: HPCLab-CNR
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: e2d305d05c0eb4254bea125a274d5cc8
  • Run description: This run uses the raw utterances and the candidate responses for the automatic utterances provided by CAST. It leverages utterance manual labels (i.e., SE, FT, PT, PR) which are described in the external resources. No rewriting is applied if the label is SE. When the label is FT, the utterance rewriting is preformed with the topic extracted from the first utterance, while when it is PT the topic from the previous utterance is used. For utterances labeled as PR, the topic extracted from the automatic canonical response of the previous utterance is added, too. The topics are extracted from utterances using the noun chunks (objects or subjects in the utterances) that are recognized as named entities by TagMe (with threshold = 0.1). The same approach is applied to extract the topic from the automatic canonical responses. Every utterance is rewritten using the automatically-rewritten utterance of the previous turn so that the context is propagated through the conversation. INDRI is used for indexing and querying the CAST-2020 dataset. In particular, we index the two datasets (MS-MARCO and TREC CAR) by removing stopwords and using the Krovetz stemmer. As a querying method, we use the INDRI language model based on Dirichlet smoothing with parameter \mu = 2500. We also apply pseudo-relevance feedback (PRF) based on the RM3 algorithm. The query enrichment is done with 20 keywords taken from the top-20 results with mixing parameter \gamma = 0.5. Lastly, the passage re-ranking is performed using a model that fine-tunes the BERT base pre-trained model for re-ranking on the MS-MARCO passage retrieval dataset. INDRI retrieves 1000 results and all of them are used for the re-ranking step.

HPCLab-CNR-run4

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: HPCLab-CNR-run4
  • Participant: HPCLab-CNR
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 8e347219320cebcb93c1563f6a60fd37
  • Run description: This run uses raw utterances. The approach automatically rewrites the utterance by using the current and previous topics. The topics are extracted from utterances using the noun chunks (objects or subjects) and full verbs. We use a list of cue phrases to detect a topic shift. At the beginning of the conversation, the current topic is the topic extracted from the first utterance, when a topic shift is detected the current topic is updated. INDRI is used for indexing and querying the CAST-2020 dataset. In particular, we index the two datasets (MS-MARCO and TREC CAR) by removing stopwords and using the Krovetz stemmer. As a querying method, we use the INDRI language model based on Dirichlet smoothing with parameter \mu = 2500. We also apply pseudo-relevance feedback (PRF) based on the RM3 algorithm. The query enrichment is done with 20 keywords taken from the top-20 results with mixing parameter \gamma = 0.5. Lastly, the passage re-ranking is performed using a model that fine-tunes the BERT base pre-trained model for re-ranking on the MS-MARCO passage retrieval dataset. INDRI retrieves 1000 results and all of them are used for the re-ranking step.

humanQR

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: humanQR
  • Participant: UvA.ILPS
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: f31fe6172bee0e176645f4f6cb831a7c
  • Run description: PR BM25 Anserini + BERT for passage ranking combined with the score from Roberta fine-tuned on MRQA formulated as a binary classification task (paragraph contains answer or doesn't)

ielab-bertAQ

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ielab-bertAQ
  • Participant: ielab
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 4bdd2477f76ae881f9b90b70b0e8f9a0
  • Run description: BERT based re-ranking of BM25+RM3 baseline. Top k=10 documents only are re-ranked using BERT. For each topic, for utterances beyond the first, the first query is appended.

ielab-bertPRFAQ

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ielab-bertPRFAQ
  • Participant: ielab
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: df4cb673c3b367b24ba8d3f60135193e
  • Run description: BERT based re-ranking of BM25+RM3 baseline. Top k=10 documents only are re-ranked using BERT. When re-ranking, the query and the first p=3 passages are embedded using BERT to create the embedding for the query. For each topic, for utterances beyond the first, the first query is appended.

ielab-bm25AQ

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ielab-bm25AQ
  • Participant: ielab
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: bc22231e86fe09d17bf2943ba6a2ba04
  • Run description: BM25+RM3 baseline. For each topic, for utterances beyond the first, the first query is appended.

ielab-bm25T5QLM

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ielab-bm25T5QLM
  • Participant: ielab
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 5eea924c92da0a84c094fb10c337e9cd
  • Run description: Generative QLM based on T5 from initial baseline retrieval using BM25+RM3. For each topic, for utterances beyond the first, the first query is appended.

man_polyu1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: man_polyu1
  • Participant: POLYU_SOME
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Type: manual
  • Task: primary
  • MD5: 22879358a9c05a84e8063a0dae562fd6
  • Run description: BM25+BERT rerank

PAS_BQUERY_MER

Results | Participants | Input | Summary | Appendix

  • Run ID: PAS_BQUERY_MER
  • Participant: CMU-LTI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 487e2867c09bb661c4211d89fb7c93dd
  • Run description: For a given query, a term selection classifier first selects important terms from the conversation history relevant to the given query. The query along with the selected terms is used to retrieve 30 passages. From these passages, 3 phrases are selected (using off-the-shelf coreference resolution) as additional contextual terms. The terms selected by the classifier and the 3 phrases and the initial query itself is used for first-round of passage retrieval which is done using Indri. Finally, BERT (fine-tuned on MS-MARCO) is used for reranking the top 500 initially retrieved passages. The input to BERT are the passages, the original query, and terms selected by the classifier and the top 3 phrases selected from passages.

PAS_RQUERY_MER

Results | Participants | Input | Summary | Appendix

  • Run ID: PAS_RQUERY_MER
  • Participant: CMU-LTI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: ff05cfe310a3663c3e5fbc83b2e8c8bf
  • Run description: For a given query, a POS tagger first selects important terms (adjectives, nouns, verbs) from the conversation history relevant to the given query. The query along with the selected terms is used to retrieve 30 passages. From these passages, 3 phrases are selected (using off-the-shelf coreference resolution) as additional contextual terms. The terms selected by the classifier and the 3 phrases and the initial query itself is used for first-round of passage retrieval which is done using Indri. Finally, BERT (fine-tuned on MS-MARCO) is used for reranking. The input to BERT are the passages, the original query, and terms selected using the POS tagger and the top 3 phrases selected from passages.

PASO_BQUERO_MER

Results | Participants | Input | Summary | Appendix

  • Run ID: PASO_BQUERO_MER
  • Participant: CMU-LTI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 236c8581f077a2e04736b5431af628d5
  • Run description: Two different initial retrievals are done for a given query. First, a term selection classifier is used for selecting important terms for the conversation history. The selected terms along with the input query is used for retrieving a set of passages. In the next step, the query is used to retrieve 30 passages. From these passages, 3 phrases are selected (using off-the-shelf coreference resolution) as additional contextual terms. Both the two initial retrievals are reranked using BERT (fine-tuned on MS-MARCO). The two reranked lists are then merged into one.

quretecNoRerank

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: quretecNoRerank
  • Participant: UvA.ILPS
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: c2f9b6b2dc1b62f7321a9c71c59c0e8c
  • Run description: PR: BM25 Anserini QR input: current turn query, the answer from the previous turn only taken from the automatic canonical answer and the previous queries

quretecQR

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: quretecQR
  • Participant: UvA.ILPS
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: c3414182c1f9a79cbeb57a3eae0aec3e
  • Run description: PR: BM25 Anserini + BERT for passage ranking combined with the score from Roberta fine-tuned on MRQA formulated as a binary classification task (paragraph contains answer or doesn't) QR input: current turn query, the answer from the previous turn only taken from the automatic canonical answer and the previous queries

raw_polyu1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: raw_polyu1
  • Participant: POLYU_SOME
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: ea9d5f36fb2f6a8512677bea10f345d1
  • Run description: query rewriting + BM25 +Bert rerank

rewrite_albert

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: rewrite_albert
  • Participant: USI
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 0de3ed7093ff261f9aa0aeebe8b2dacd
  • Run description: Initial retrieval is performed with Anserini QL with Dirichlet smoothing with automatically rewritten queries provided by TREC organizers. Re-ranking step utilizes Albert, which is pretrained on MS MARCO passage re-ranking task.

T5_BERT100

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: T5_BERT100
  • Participant: nova-search
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: d32f02aada2a30c797e0906c7643d295
  • Run description: To maintain the conversational context across turns we finetuned a T5 model on the conversational query rewriting task using the CANARD dataset. The input to the T5 model is the concatenation of the current query and all of the previous queries, and the output of the model is the query rewritten for that turn. Initial retrieval is done with the rewritten query using a QL model with LMD smoothing. After this we perform reranking using a BERT model finetuned on the MS MARCO dataset for the passage reranking task.

umass_4prev_rm3

Results | Participants | Input | Summary | Appendix

  • Run ID: umass_4prev_rm3
  • Participant: UMass_CIIR
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 45392c98b8d4cf0f6711156cabe22932
  • Run description: In this run, we expand the current turn of each topic using the expansion terms that are extrated from the feedback documnets of 4 previous turns. We use the automatic runs as the feedback documents.

umass_curr_rm3

Results | Participants | Input | Summary | Appendix

  • Run ID: umass_curr_rm3
  • Participant: UMass_CIIR
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Type: manual
  • Task: primary
  • MD5: 8dc18ea29db29ac493d4d86d0afa7644
  • Run description: In this run we expand the current turn of the conversation with the terms from top retrieved documents for that turn. We use the provided automatic baseline as the top documents to extract expansion terms.

WatACBase

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: WatACBase
  • Participant: WaterlooClarke
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: 50c1a4e0bf57288ad51470a4afce4fd7
  • Run description: Utterance augmented with the initial utterance of the conversation, with coreference resolution.

WatACBaseRe

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: WatACBaseRe
  • Participant: WaterlooClarke
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: d6d96811d78e59a6a128f776c2225913
  • Run description: Utterance augmented with the initial utterance of the conversation, with coreference resolution. Reranked with Bert Passage reranking.

WatACGPT2Re

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: WatACGPT2Re
  • Participant: WaterlooClarke
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 1771f61b6b94c8611e2ca7f6485ce1b4
  • Run description: automatically rewritten utterances, reranked with Bert Passage reranker

WatACReAll

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: WatACReAll
  • Participant: WaterlooClarke
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/20/2020
  • Task: primary
  • MD5: 04399d0045e30b66502daa9e4482c8b6
  • Run description: Reranking the retrieved documents by the following runs: WatACBase, WatACBaseRe and WatACGPT2Re

WLU_ManUttOnly

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: WLU_ManUttOnly
  • Participant: WLU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Type: manual
  • Task: primary
  • MD5: e88f3d3c9a123aa3b038ab6d63842e17
  • Run description: This run was generated by encoding the queries and documents using a pre-trained BERT 768 model. This data was used by our deep learning model (trained on CAST & MARCO data) to rerank documents.

WLU_RawUttOnly

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: WLU_RawUttOnly
  • Participant: WLU
  • Track: Conversational Assistance
  • Year: 2020
  • Submission: 8/19/2020
  • Task: primary
  • MD5: fdbc364cb5f1545b810ad4b700b80a62
  • Run description: This run was generated by encoding the queries and documents using a pre-trained BERT 768 model. This data was used by our deep learning model (trained on CAST & MARCO data) to rerank documents