Runs - Health Misinformation 2021¶
all_use_sup_cre¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: all_use_sup_cre
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
52659ee5c32f45bd055be01d0bd6678f
- Run description: Run 7: This automatic run was created using a rank fusion based on RRF of the individual models used to create the i) usefulness (5 individual models), ii) supportiveness (3 individual models), and iii) credibility (2 individual models).
baselineBM25¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: baselineBM25
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: auto
- Task: main
- MD5:
eae994bd8547ab2af61f6372c1e4d95f
- Run description: Anserinis default BM25.
bm25¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: bm25
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: auto
- Task: main
- MD5:
4aaeeeaa7b75356abd4d4a48441a5b86
- Run description: Pyserini's Default BM25
bm25_rob_rf¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: bm25_rob_rf
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
13a4087bb7894504bfaa8a28e34a9853
- Run description: Run 1: Baseline run. This automatic run was created using a rank fusion based on RRF of three models: i) usefulness, created using a default BM25, ii) supportiveness, created using a RoBERTa large model fine-tuned on the FEVER+SciFact corpus, and iii) credibility, created using a random forest model trained on the Microsoft Credibility dataset.
bow_sup_cred¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: bow_sup_cred
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
891eec36e5c09a81ff21107fc3aea426
- Run description: Run 2: This automatic run was created using a rank fusion based on RRF of three models: i) usefulness, created using a combined default BM25 with a fine-tuned BM25 model using known item search with query and description been generated using transfer learning from language models, ii) supportiveness, created using a combined rank of three transformer-based models fine-tuned on the FEVER+SciFact corpus, and iii) credibility, created using a random forest model trained on the Microsoft Credibility dataset combined with a list of credible sites.
citius.r1¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r1
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
eb4216d22a4bb77699dc838323a55815
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 documents using a hand-crafted expression generated from the description and the stance fields.
citius.r10¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r10
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
bb81d2d704f795cb3dce910a6d6f4390
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using a hand-crafted expression generated from the description and the stance field + sentence similarity between passages of the top 100 docs and the same hand-crafted expression using a RoBERTa Base model. Finally, we reordered the top 100 documents using CombSUM ranking fusion technique and all three scores.
citius.r2¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r2
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
526f0f45180a3ab31cc497c221bc6f4b
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + sentence similarity between passages of the top 100 docs and a hand-crafted expression generated from the description and the stance fields using a RoBERTa Large model. Finally, we reorder the top 100 documents using the CombSUM ranking fusion technique and all three scores.
citius.r3¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r3
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
451b13e581a40c33d1189a14ec806d39
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + sentence similarity between passages of the top 100 docs and a hand-crafted expression generated from the description and the stance fields using RoBERTa Base model. Finally, we reordered the top 100 documents using CombSUM ranking fusion technique and all three scores.
citius.r4¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r4
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
63b5e82473e3ed28be9577031a38a5c3
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + sentence similarity between passages of the top 100 docs and a hand-crafted expression generated from the description and the stance fields and its query variations using RoBERTa Large model. Finally, we reordered the top 100 documents using CombSUM ranking fusion technique and all three scores.
citius.r5¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r5
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
d26f23444b291438bc86a0d8a3c95cc8
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + sentence similarity between passages of the top 100 docs and a hand-crafted expression generated from the description and the stance fields and its query variations using RoBERTa Base model. Finally, we reordered the top 100 documents using CombSUM ranking fusion technique and all three scores.
citius.r6¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r6
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
7a9caa3fc45c208b5d42c243488e9434
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + sentence similarity between passages of the top 100 docs and a hand-crafted expression generated from the description and the stance field using RoBERTa Large model. Before performing the sentence similarity step, we apply a passage cleaning phase based on NLP unsupervised techniques. Finally, we reordered the top 100 documents using CombSUM ranking fusion technique and all three scores.
citius.r7¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r7
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
5cc98a403480f8136906a62e86929fb2
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + passage reliability classifier of the top 100 docs trained with 2019 and 2020 data in the form correct sentence + passage + label. Finally, we reordered the top 100 documents using the Borda Count ranking fusion technique and all three scores.
citius.r8¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r8
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
62acf8a73a2ee21eb6be92638b497c5f
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using the query field + passage reliability classifier of the top 100 docs trained with 2019 and 2020 data in the form query + passage + label. Finally, we reordered the top 100 documents using the Borda Count ranking fusion technique and all three scores.
citius.r9¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: citius.r9
- Participant: CiTIUS
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: manual
- Task: main
- MD5:
5a581a5f1e272266fd9d90115e119338
- Run description: Initial BM25 retrieval based on the query field + passage reranking of the top 100 docs using a hand-crafted expression generated from the description and the stance field + sentence similarity between passages of the top 100 docs and the same hand-crafted expression using a RoBERTa Large model. Finally, we reordered the top 100 documents using CombSUM ranking fusion technique and all three scores.
lin_use_sup_rf¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: lin_use_sup_rf
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
42728fbbf230774478932487f88678df
- Run description: Run 5: This automatic run was created using a rank fusion based on RRF of three models: i) usefulness, created using a combined BoW model with three transformed-based language models trained on the MS MARCO corpus, ii) supportiveness, created using a combined rank of three transformer-based models fine-tuned on the FEVER+SciFact corpus, and iii) credibility, create using a random forest model trained on the Microsoft Credibility dataset.
mdt5¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: mdt5
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: auto
- Task: main
- MD5:
e346edfa6eb6dd49e2cc14537b814000
- Run description: Pyserini's Default BM25. MedMonoT5/DuoT5 using Description only
mdt5_r¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: mdt5_r
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
cca9961fdad60c3ff1e302a3909e9c0b
- Run description: Pyserini's Default BM25. MedMonoT5/DuoT5 using Description and Stance
mlm_sup_cred¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: mlm_sup_cred
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
08e64a177aa31f73ba6b2560f98b77f2
- Run description: Run 3: This automatic run was created using a rank fusion based on RRF of three models: i) usefulness, created using a combination of three transformed-based language models trained on the MS MARCO corpus, ii) supportiveness, created using a combined rank of three transformer-based models fine-tuned on the FEVER+SciFact corpus, and iii) credibility, created using a random forest model trained on the Microsoft Credibility dataset combined with a list of credible sites.
mt5¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: mt5
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: auto
- Task: main
- MD5:
e4338585930a63484a87f590a3fb1442
- Run description: Pyserini's Default BM25. MedMonoT5 using Description only
mt5_r¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: mt5_r
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
c855e8ad6635900e3cbb3d1afe73f438
- Run description: Pyserini's Default BM25. MedMonoT5 using Description and Stance
upv_bm25¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_bm25
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
549b3611315c7ebf9eb4d1e39805e346
- Run description: It is a baseline that is obtained by using pyserini bm25.
upv_fuse_10¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_10
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
aad92770286ac574baf690995c69d08b
- Run description: It is a fused of results by cos similarity by bio SBERT, bm25 and credibility Kullback divergence similarity from the base Roberta credibility classifier.
upv_fuse_2¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_2
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
f836bfe3963010fc08854bfce6c2ce56
- Run description: We calculate the cos similarity between documents (limited with 20 sentences) and the description by using Bio Sentence Transformer. The result is fused of bm25 and the cos similarity by using reciprocal rank fusion.
upv_fuse_3¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_3
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
5e697f4599e033073735ad4488222031
- Run description: We define a model that measures the credibility of an article which is trained with Roberta base model and we set up reference article as satisfying 4 criteria of credibility. We then calculate the cosine similarity between the document and reference vector. Lastly, we fused the results of bm-25 and the cos similarities.
upv_fuse_4¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_4
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
c6f9bd7393318abcd85b3033a6b2d353
- Run description: We define a model that measures the credibility of an article which is trained with Roberta base model and we set up reference article as satisfying 4 criteria of credibility. We then calculate the kullback-divergence score between the document and reference vector. Lastly, we fused the results of bm-25 and the cos similarities.
upv_fuse_5¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_5
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
af0e3cf82b98e6eebbba97d0532a506a
- Run description: We define a model that measures the credibility of an article which is trained with Roberta large model and we set up reference article as satisfying 4 criteria of credibility. We then calculate the cos similarity score between the document and reference vector. Lastly, we fused the results of bm-25 and the cos similarities.
upv_fuse_6¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_6
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
05995987ec18520d765fc3597adae05d
- Run description: We define a model that measures the credibility of an article which is trained with Roberta large model and we set up reference article as satisfying 4 criteria of credibility. We then calculate the Kullback Divergence score between the document and reference vector. Lastly, we fused the results of bm-25 and the kullback divergence score.
upv_fuse_7¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_7
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
28fb5d99694866f814a582a1e2f74c58
- Run description: It is a fused of results by cos similarity by bio SBERT, bm25 and credibility cos similarity from the large Roberta credibility classifier.
upv_fuse_8¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_8
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
2c59c03416503b4b560c827e07fbe88c
- Run description: It is a fused of results by cos similarity by bio SBERT, bm25 and credibility Kullback divergence similarity from the large Roberta credibility classifier.
upv_fuse_9¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: upv_fuse_9
- Participant: UPV
- Track: Health Misinformation
- Year: 2021
- Submission: 8/31/2021
- Type: auto
- Task: main
- MD5:
e47f4f91564c53a73b163f49c0ef1099
- Run description: It is a fused of results by cos similarity by bio SBERT, bm25 and credibility cos similarity from the base Roberta credibility classifier.
use_rob_cred¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: use_rob_cred
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
92626b95faf276cf7dcc5c44882ed552
- Run description: Run 4: This automatic run was created using a rank fusion based on RRF of three models: i) usefulness, created using a combined BoW model with three transformed-based language models trained on the MS MARCO corpus, ii) supportiveness, created using a RoBERTa large model fine-tuned on the FEVER+SciFact corpus, and iii) credibility, created using a random forest model trained on the Microsoft Credibility dataset combined with a list of credible sites.
use_sup_cred¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: use_sup_cred
- Participant: DigiLab
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
804880792d09d7d31bb583aae6778055
- Run description: Run 6: This automatic run was created using a rank fusion based on RRF of three models: i) usefulness, created using a combined BoW model with three transformed-based language models trained on the MS MARCO corpus, ii) supportiveness, created using a combined rank of three transformer-based models fine-tuned on the FEVER+SciFact corpus, and iii) credibility, created using a random forest model trained on the Microsoft Credibility dataset combined with a list of credible sites.
vera0¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: vera0
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
e39b7425e9b5756ebf2d51a4bee8c69e
- Run description: Pyserini's Default BM25. Vera - label prediction only
vera_mdt5_0.5¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: vera_mdt5_0.5
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
d1bbd06f09411b658ff0fd285aef70a1
- Run description: Pyserini's Default BM25. Linear combination of mono-duo-T5 with Vera (duo1, 0.5)
vera_mdt5_0.95¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: vera_mdt5_0.95
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
582abbf4859c8a7402d873d59826feff
- Run description: Pyserini's Default BM25. Linear combination of mono-duo-T5 with Vera (duo1, 0.95)
vera_mt5_0.5¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: vera_mt5_0.5
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
a0d43e3e208ef172467d0dd17f2e4427
- Run description: Pyserini's Default BM25. Linear combination of mono-T5 with Vera (mono, 0.5)
vera_mt5_0.95¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: vera_mt5_0.95
- Participant: h2oloo
- Track: Health Misinformation
- Year: 2021
- Submission: 9/3/2021
- Type: manual
- Task: main
- MD5:
175f51667279d9ce992891471064ed1d
- Run description: Pyserini's Default BM25. Linear combination of mono-T5 with Vera (mono, 0.95)
watbm25¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watbm25
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
51254f49ca93354638c5d30a48ba283e
- Run description: BM25, no relevance feedback.
watbm25p¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watbm25p
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
59a878796b44f8ceddce89bd1489129b
- Run description: BM25, additional search term "pubmed" added; no relevance feedback.
watgoog¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watgoog
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
2cd6a92a74bf8f3307f159603f9ea32d
- Run description: Logistic regression, top Google hits as training docs.
watgoogp¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watgoogp
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
6dc2d9bfb3bd25c7fbb8e93350113ee2
- Run description: Logistic regression, top Google hits as training docs. "Pubmed" as additional search term.
watmed¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watmed
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
a056990f607e319dbf9c3d06bb783d1b
- Run description: Logistic regression, top medline BM25 hits as training docs.
watrrfall¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watrrfall
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
65d2a9b32c0133628d6b1df6def1359a
- Run description: Reciprocal rank fusion of everything
watrrfg¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watrrfg
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
086a32cdd9118e564450dd0c1ead4a80
- Run description: Reciprocal rank fusion of two Google-seeded classifiers
watrrfm¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watrrfm
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
b5efe126d6978c91447ddb0a52f1b557
- Run description: Fusion watbm25 and watmed
watrrfnp¶
Results
| Participants
| Input
| Summary
| Appendix
- Run ID: watrrfnp
- Participant: Waterloo_Cormack
- Track: Health Misinformation
- Year: 2021
- Submission: 8/30/2021
- Type: auto
- Task: main
- MD5:
5c9346ca48482140958581e584e4b506
- Run description: Reciprocal rank fusion of runs, minus pubmed seed.
WatSAE-BM25¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSAE-BM25
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
7eb278d49af0331957d5275f4eebe8fe
- Run description: We use the common crawl host graph to find domains the (HONCode + handpicked) domains have linked to. We perform pagerank on this subset of the host graph and take the top 10000 hosts. We filter c4/no.clean for these hosts documents. We filter these documents using a medical text classifier. With this collection we run BM25 with Anserini.
WatSAE-BM25-RR¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSAE-BM25-RR
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
a8e14e7afe999b9c93fc62522ef0f939
- Run description: We use the common crawl host graph to find domains the (HONCode + handpicked) domains have linked to. We perform pagerank on this subset of the host graph and take the top 10000 hosts. We filter c4/no.clean for these hosts documents. We filter these documents using a medical text classifier. With this collection we run BM25 with Anserini. We rank with query independent features including pagerank ratio of medical pages on the host, and manual url features.
WatSAE-BM25RM3¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSAE-BM25RM3
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
058194e6e64fa9eb30af4b5a5bbd6254
- Run description: We use the common crawl host graph to find domains the (HONCode + handpicked) domains have linked to. We perform pagerank on this subset of the host graph and take the top 10000 hosts. We filter c4/no.clean for these hosts documents. We filter these documents using a medical text classifier. With this collection we run BM25 plus RM3 with Anserini.
WatSAM-BM25¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSAM-BM25
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: auto
- Task: main
- MD5:
b0784fd5e374d5e7d4c476249582a8a4
- Run description: Anserini's BM25 on filtered collection (HONCode + handpicked) domains. This collection only includes documents with domains having an HONcode certification (see www.hon.ch for more details) or are part of small list of handpicked health related websites (e.g. health.harvard.edu).
WatSMC-CAL¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMC-CAL
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
ac633b766fc6c488e07fac826c784688
- Run description: Documents are scored using a Continuous Active Learning (Logistic Regression) model trained with two round of judging: 10 minutes per topic on filtered HONCode collection and 5 minutes per topic on HONCode+10kBM25 collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed.
WatSMC-CALQA100¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMC-CALQA100
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
623c72dda2c396688beaa38deada9edb
- Run description: Paragraphs are scored using a Continuous Active Learning (Logistic Regression) model trained with two round of judging: 10 minutes per topic on filtered HONCode collection and 5 minutes per topic on HONCode+10kBM25 collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. Top 100 scoring paragraphs are reranked based on RoBERTa, fine tuned on BoolQ dataset, with the paragraph as context and topic's description as the yes/no question. Reranking is done to match the topic's stance field.
WatSMC-CALQAAll¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMC-CALQAAll
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
8a60fd145feedcfcc95ed6846bcf228f
- Run description: Paragraphs are scored using a Continuous Active Learning (Logistic Regression) model trained with two round of judging: 10 minutes per topic on filtered HONCode collection and 5 minutes per topic on HONCode+10kBM25 collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. All 1000 paragraphs are reranked based on RoBERTa, fine tuned on BoolQ dataset, with the paragraph as context and topic's description as the yes/no question. Reranking is done to match the topic's stance field.
WatSMC-CALQAHC1¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMC-CALQAHC1
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
a4b41b7fa5e195e7f55e3c9740cc1053
- Run description: Paragraphs are scored using a Continuous Active Learning (Logistic Regression) model trained with two round of judging: 10 minutes per topic on filtered HONCode collection and 5 minutes per topic on HONCode+10kBM25 collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. All 1000 paragraphs are reranked based on RoBERTa, fine tuned on BoolQ dataset, with the paragraph as context and topic's description as the yes/no question. Reranking is done based the topic's stance field and harmonic centrality.
WatSMC-CALQAHC2¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMC-CALQAHC2
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
5745a60f7e1caa62e8a453a9d4568990
- Run description: Paragraphs are scored using a Continuous Active Learning (Logistic Regression) model trained with two round of judging: 10 minutes per topic on filtered HONCode collection and 5 minutes per topic on HONCode+10kBM25 collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. All 1000 paragraphs are reranked based on RoBERTa, fine tuned on BoolQ dataset, with the paragraph as context and topic's description as the yes/no question. Aggressive reranking is done based the topic's stance field and harmonic centrality.
WatSMC-Correct¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMC-Correct
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
1addced4013566793af13e9792c8c3bc
- Run description: Correct documents are manually found using search and continuous active learning. Correct documents are placed first, followed by documents returned by a continuous active learning model trained using correct judgments only.
WatSMM-CAL¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMM-CAL
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
b54cd092a78c834843be7b9909b79f67
- Run description: Documents are scored using a Continuous Active Learning (Logistic Regression) model trained with one round of judging (maximum of 10 minutes per topic). Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. Used filtered collection (HONCode + handpicked) domains.
WatSMM-CALHC¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMM-CALHC
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
25c853b2b65abd0bba443c572a70f731
- Run description: Documents are scored using a Continuous Active Learning (Logistic Regression) model trained with one round of judging (maximum of 10 minutes per topic). Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. Used filtered collection (HONCode + handpicked) domains. Top 50 documents are reranked based on model score and normalized harmonic centrality score.
WatSMM-CALPR¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMM-CALPR
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
d1fac02651e334d03c5d6d0cbaf0c014
- Run description: Documents are scored using a Continuous Active Learning (Logistic Regression) model trained with one round of judging (maximum of 10 minutes per topic). Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. Used filtered collection (HONCode + handpicked) domains. Top 50 documents are reranked based on model score and normalized pagerank score.
WatSMM-CALQA100¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMM-CALQA100
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
f15e5aa3d45ceb4f42b773bed5cef971
- Run description: Paragraphs are scored using a Continuous Active Learning (Logistic Regression) model trained with one round of judging: 10 minutes per topic on filtered HONCode collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. Top 100 scoring paragraphs are reranked based on RoBERTa, fine tuned on BoolQ dataset, with the paragraph as context and topic's description as the yes/no question. Reranking is done to match the topic's stance field.
WatSMM-CALQAAll¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMM-CALQAAll
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
c2de0fff7c7f1bce74a14b9cbc2fcae9
- Run description: Paragraphs are scored using a Continuous Active Learning (Logistic Regression) model trained with one round of judging: 10 minutes per topic on filtered HONCode collection. Training was focused on usefulness only. Each topic was initialized with the query as seed judgment. Interactive Search and Judging was allowed. All 1000 paragraphs are reranked based on RoBERTa, fine tuned on BoolQ dataset, with the paragraph as context and topic's description as the yes/no question. Reranking is done to match the topic's stance field.
WatSMM-Fused¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMM-Fused
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/1/2021
- Type: manual
- Task: main
- MD5:
2e260a14432a0feb7b05fbf33ff40905
- Run description: Reciprocal rank fusion on runs WatSMM-CAL, WatSMM-CALHC, and WatSMM-CALPR.
WatSMT-SD-S1¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMT-SD-S1
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: manual
- Task: main
- MD5:
ede6c9947300b70ca3820e82b1eacda7
- Run description: We fine-tune the T5-large model on a balanced subset of 2019 qrels to predict the stance of each document ("helpful" or "unhelpful"). We apply the stance detection model to re-rank the top 3k results from the BM25 baseline. To combine the BM25 scores and the stance prediction, we use the following fusion strategy: BM25_score * e^(probability - 0.5), where probability = helpful_probability if topic.stance == "helpful" else unhelpful_probability.
WatSMT-SD-S2¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: WatSMT-SD-S2
- Participant: UWaterlooMDS
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: manual
- Task: main
- MD5:
fafd396c8dc2c75c2729b92643176137
- Run description: We fine-tune the T5-large model on a balanced subset of 2019 qrels to predict the stance of each document ("helpful" or "unhelpful"). We apply the stance detection model to re-rank the top 3k results from the BM25 baseline. To combine the BM25 scores and the stance prediction, we use the following fusion strategy: if probability > 0.75 then BM25_score * 10, else if probability < 0.25 then BM25_socre * -1, where probability = helpful_probability if topic.stance == "helpful" else unhelpful_probability.
webis-bm25¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: webis-bm25
- Participant: Webis
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
e3bf2564d6e39ba733443a2eda5fbafe
- Run description: We retrieve the top 1000 documents with Anserini using BM25 (k1=0.9 and b=0.4), processing documents and queries with the porter stemmer and stopword removal.
webis-bm25-ax1¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: webis-bm25-ax1
- Participant: Webis
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
122205f6500824419eef86506da4f7db
- Run description: We re-rank top 20 documents initially retrieved with Anserini BM25 (k1=0.9 and b=0.4) using three argumentative axioms, where the axiom weights are chosen such that either one or two axioms decide to swap document positions.
webis-bm25-ax3¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: webis-bm25-ax3
- Participant: Webis
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
e29c63a84bf6de4dd3deacfe46886b05
- Run description: We re-rank top 20 documents initially retrieved with Anserini BM25 (k1=0.9 and b=0.4) using three argumentative axioms, where the axiom weights are chosen such that all three axioms decide to swap document positions.
webis-t5¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: webis-t5
- Participant: Webis
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
d0b50189e46970bfb73a752f01fd86ce
- Run description: We rerank the top 100 results of webis-bm25 with the MonoT5 model "castorini/monot5-base-msmarco" available on Hugging Face with PyGaggle.
webis-t5-ax1¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: webis-t5-ax1
- Participant: Webis
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
2850d40b78be05232d2d726ccec7aa96
- Run description: We re-rank top 20 documents of the webis-t5.txt run using three argumentative axioms, where the axiom weights are chosen such that either one or two axioms decide to swap document positions.
webis-t5-ax3¶
Results
| Participants
| Proceedings
| Input
| Summary
| Appendix
- Run ID: webis-t5-ax3
- Participant: Webis
- Track: Health Misinformation
- Year: 2021
- Submission: 9/2/2021
- Type: auto
- Task: main
- MD5:
d4330120b5164639297c617735241979
- Run description: We re-rank top 20 documents of the webis-t5.txt run using three argumentative axioms, where the axiom weights are chosen such that all three axioms decide to swap document positions.