Runs - Dynamic Domain 2016¶
FifthIterBaseline¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: FifthIterBaseline
- Participant: georgetown
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/9/2016
- Type: automatic
- Task: main
- Run description: Baseline for integration of feedback information
- Code: https://github.com/NeginR/DD16
FirstIterBaseline¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: FirstIterBaseline
- Participant: georgetown
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: Baseline results
- Code: https://github.com/NeginR/DD16
LDA_Indri73¶
Participants
| Input
| Appendix
- Run ID: LDA_Indri73
- Participant: IAPLab
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/7/2016
- Type: automatic
- Task: main
- Run description: We use Indri and LDA to access the first iteration of the first query. Then we use the MDP model which we has modified and get the next iteration.During the MDP model,we use the Indri to help to search and then get the final result.
- Code: https://github.com/TrecDD2016/trec_dd/
rmit_lm_nqe¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: rmit_lm_nqe
- Participant: RMIT
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: In this method, we used the Language modeling approach as implemented in Apache Solr using Dirichlet smoothing and default parameters. We leveraged Solr's edismax query parser that scores documents by the similarity score between the page content and the sum of bi-gram and uni-gram queries. No query expansion (nqe) was applied.
rmit_lm_psg.max¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: rmit_lm_psg.max
- Participant: RMIT
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: We split documents into half overlapped passages with a passage size of 200 words and index them as documents alongside their parent documents in Apache Solr. We then use Solr's block join query to score documents based on the maximum of their passage level relevance scores. The method scores passages using the sum of the passage language model score for a unigram query and a bigram based phrase query.
rmit_lm_rocchio.Rp.NRd.10¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: rmit_lm_rocchio.Rp.NRd.10
- Participant: RMIT
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/9/2016
- Type: automatic
- Task: main
- Run description: We use the content of documents to build a content language model and get the top 5 documents. We then use the Rocchio algorithm to reformulate the current iteration query using the feedback provided by JIG. To represent relevant documents, we concatenate relevant passages from relevant documents into a pseudo relevant passages (Rp) whereas we use the content of the non relevant documents as the non relevant units of Rocchio (NRd). Lastly, we use the top 10 non negative terms from the new query vector generated by Rocchio to build the new query. In addition, we set Rocchio parameters to alpha=1, beta=0.75 and gamma=0.25.
rmit_oracle.lm.1000¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: rmit_oracle.lm.1000
- Participant: RMIT
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: manual
- Task: main
- Run description: We run Solr with the content language model to get the first 1000 documents, then we use the ground truth to remove non relevant documents from the initial list of documents. For each iteration, we return the next 5 relevant documents from the initial list. A document is relevant if it was found in the topic's list of judged documents. The motive is to estimate an upper bound of the task and understand if the first 1000 documents are enough to get all relevant documents.
SecondIterationBaseline¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: SecondIterationBaseline
- Participant: georgetown
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/9/2016
- Type: automatic
- Task: main
- Run description: Baseline for using feedback
- Code: https://github.com/NeginR/DD16
TenthIterBaseline¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: TenthIterBaseline
- Participant: georgetown
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/9/2016
- Type: automatic
- Task: main
- Run description: Baseline for tenth iteration
- Code: https://github.com/NeginR/DD16
ufmgHM2¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: ufmgHM2
- Participant: ufmg
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/6/2016
- Type: automatic
- Task: main
- Run description: Hierarchical diversification with multi-source subtopics and cumulative stopping condition.
ufmgHM3¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: ufmgHM3
- Participant: ufmg
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/6/2016
- Type: automatic
- Task: main
- Run description: Hierarchical diversification with multi-source subtopics and window-based stopping condition.
ufmgHS2¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: ufmgHS2
- Participant: ufmg
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/6/2016
- Type: automatic
- Task: main
- Run description: Hierarchical diversification with single-source subtopics and cumulative stopping condition.
ufmgXM2¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: ufmgXM2
- Participant: ufmg
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/6/2016
- Type: automatic
- Task: main
- Run description: Flat diversification with multi-source subtopics and cumulative stopping condition.
ufmgXS2¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: ufmgXS2
- Participant: ufmg
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/6/2016
- Type: automatic
- Task: main
- Run description: Flat diversification with single-source subtopics and cumulative stopping condition.
UL_BM25¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UL_BM25
- Participant: LavalLakehead
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: BM25 similarity
UL_Kmeans¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UL_Kmeans
- Participant: LavalLakehead
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: Kmeans applied on a subset of documents retrieved by Solr, best document of each cluster is returned to the user.
UL_LDA_200¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UL_LDA_200
- Participant: LavalLakehead
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: LDA is used to create 5 different topics from documents. We take 100 results from Solr, we remove documents which are too similar to others documents, then we fill the dataset with other documents to have 100 document to run LDA over the sample.
UL_LDA_NE¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UL_LDA_NE
- Participant: LavalLakehead
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: LDA used on a corpus of 25 documents from solr, Oriented for NE topics by reducing the text from each document to sentences which contains a part of the NE.
UL_LDA_Psum¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UL_LDA_Psum
- Participant: LavalLakehead
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/8/2016
- Type: automatic
- Task: main
- Run description: Probability for each document to be assigned to each topic multiplied by the global probability of each topic to obtain the document which covers the maximum of topic information.
UPD_IA_BiQBDiJ¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UPD_IA_BiQBDiJ
- Participant: UPD_IA
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/9/2016
- Type: automatic
- Task: main
- Run description: BM25 followed by max 5 iterations of feedback based on an algorithm inspired by Quantum Detection (QB) that exploits binary representation for documents. Feedback consists in re-ranking the (residual) top 1000 documents. When relevant documents are present in the feedback set explicit feedback is performed; when no relevant documents are present, residual collection is re-ranked by PRF on the top 100 documents. Description selection is based on WPQ; top 35 terms + topic terms are used. After two PRF-based re-ranking no additional iterations are performed.
UPD_IA_BiQBFi¶
Participants
| Proceedings
| Input
| Appendix
- Run ID: UPD_IA_BiQBFi
- Participant: UPD_IA
- Track: Dynamic Domain
- Year: 2016
- Submission: 9/7/2016
- Type: automatic
- Task: main
- Run description: BM25 followed by 5 iterations of feedback based on an algorithm inspired by Quantum Detection (QB) that exploits binary representation for documents. Feedback consists in re-ranking the (residual) top 1000 documents. When relevant documents are present in the feedback set explicit feedback is performed; when no relevant documents are present, residual collection is re-ranked by PRF on the top 100 documents. Description selection is based on WPQ; top 35 terms + topic terms are used.