Runs - Dynamic Domain 2016¶

FifthIterBaseline¶

Participants | Proceedings | Input | Appendix

Run ID: FifthIterBaseline
Participant: georgetown
Track: Dynamic Domain
Year: 2016
Submission: 9/9/2016
Type: automatic
Task: main
Run description: Baseline for integration of feedback information
Code: https://github.com/NeginR/DD16

FirstIterBaseline¶

Participants | Proceedings | Input | Appendix

Run ID: FirstIterBaseline
Participant: georgetown
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: Baseline results
Code: https://github.com/NeginR/DD16

LDA_Indri73¶

Participants | Input | Appendix

Run ID: LDA_Indri73
Participant: IAPLab
Track: Dynamic Domain
Year: 2016
Submission: 9/7/2016
Type: automatic
Task: main
Run description: We use Indri and LDA to access the first iteration of the first query. Then we use the MDP model which we has modified and get the next iteration.During the MDP model,we use the Indri to help to search and then get the final result.
Code: https://github.com/TrecDD2016/trec_dd/

rmit_lm_nqe¶

Participants | Proceedings | Input | Appendix

Run ID: rmit_lm_nqe
Participant: RMIT
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: In this method, we used the Language modeling approach as implemented in Apache Solr using Dirichlet smoothing and default parameters. We leveraged Solr's edismax query parser that scores documents by the similarity score between the page content and the sum of bi-gram and uni-gram queries. No query expansion (nqe) was applied.

rmit_lm_psg.max¶

Participants | Proceedings | Input | Appendix

Run ID: rmit_lm_psg.max
Participant: RMIT
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: We split documents into half overlapped passages with a passage size of 200 words and index them as documents alongside their parent documents in Apache Solr. We then use Solr's block join query to score documents based on the maximum of their passage level relevance scores. The method scores passages using the sum of the passage language model score for a unigram query and a bigram based phrase query.

rmit_lm_rocchio.Rp.NRd.10¶

Participants | Proceedings | Input | Appendix

Run ID: rmit_lm_rocchio.Rp.NRd.10
Participant: RMIT
Track: Dynamic Domain
Year: 2016
Submission: 9/9/2016
Type: automatic
Task: main
Run description: We use the content of documents to build a content language model and get the top 5 documents. We then use the Rocchio algorithm to reformulate the current iteration query using the feedback provided by JIG. To represent relevant documents, we concatenate relevant passages from relevant documents into a pseudo relevant passages (Rp) whereas we use the content of the non relevant documents as the non relevant units of Rocchio (NRd). Lastly, we use the top 10 non negative terms from the new query vector generated by Rocchio to build the new query. In addition, we set Rocchio parameters to alpha=1, beta=0.75 and gamma=0.25.

rmit_oracle.lm.1000¶

Participants | Proceedings | Input | Appendix

Run ID: rmit_oracle.lm.1000
Participant: RMIT
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: manual
Task: main
Run description: We run Solr with the content language model to get the first 1000 documents, then we use the ground truth to remove non relevant documents from the initial list of documents. For each iteration, we return the next 5 relevant documents from the initial list. A document is relevant if it was found in the topic's list of judged documents. The motive is to estimate an upper bound of the task and understand if the first 1000 documents are enough to get all relevant documents.

SecondIterationBaseline¶

Participants | Proceedings | Input | Appendix

Run ID: SecondIterationBaseline
Participant: georgetown
Track: Dynamic Domain
Year: 2016
Submission: 9/9/2016
Type: automatic
Task: main
Run description: Baseline for using feedback
Code: https://github.com/NeginR/DD16

TenthIterBaseline¶

Participants | Proceedings | Input | Appendix

Run ID: TenthIterBaseline
Participant: georgetown
Track: Dynamic Domain
Year: 2016
Submission: 9/9/2016
Type: automatic
Task: main
Run description: Baseline for tenth iteration
Code: https://github.com/NeginR/DD16

ufmgHM2¶

Participants | Proceedings | Input | Appendix

Run ID: ufmgHM2
Participant: ufmg
Track: Dynamic Domain
Year: 2016
Submission: 9/6/2016
Type: automatic
Task: main
Run description: Hierarchical diversification with multi-source subtopics and cumulative stopping condition.

ufmgHM3¶

Participants | Proceedings | Input | Appendix

Run ID: ufmgHM3
Participant: ufmg
Track: Dynamic Domain
Year: 2016
Submission: 9/6/2016
Type: automatic
Task: main
Run description: Hierarchical diversification with multi-source subtopics and window-based stopping condition.

ufmgHS2¶

Participants | Proceedings | Input | Appendix

Run ID: ufmgHS2
Participant: ufmg
Track: Dynamic Domain
Year: 2016
Submission: 9/6/2016
Type: automatic
Task: main
Run description: Hierarchical diversification with single-source subtopics and cumulative stopping condition.

ufmgXM2¶

Participants | Proceedings | Input | Appendix

Run ID: ufmgXM2
Participant: ufmg
Track: Dynamic Domain
Year: 2016
Submission: 9/6/2016
Type: automatic
Task: main
Run description: Flat diversification with multi-source subtopics and cumulative stopping condition.

ufmgXS2¶

Participants | Proceedings | Input | Appendix

Run ID: ufmgXS2
Participant: ufmg
Track: Dynamic Domain
Year: 2016
Submission: 9/6/2016
Type: automatic
Task: main
Run description: Flat diversification with single-source subtopics and cumulative stopping condition.

UL_BM25¶

Participants | Proceedings | Input | Appendix

Run ID: UL_BM25
Participant: LavalLakehead
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: BM25 similarity

UL_Kmeans¶

Participants | Proceedings | Input | Appendix

Run ID: UL_Kmeans
Participant: LavalLakehead
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: Kmeans applied on a subset of documents retrieved by Solr, best document of each cluster is returned to the user.

UL_LDA_200¶

Participants | Proceedings | Input | Appendix

Run ID: UL_LDA_200
Participant: LavalLakehead
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: LDA is used to create 5 different topics from documents. We take 100 results from Solr, we remove documents which are too similar to others documents, then we fill the dataset with other documents to have 100 document to run LDA over the sample.

UL_LDA_NE¶

Participants | Proceedings | Input | Appendix

Run ID: UL_LDA_NE
Participant: LavalLakehead
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: LDA used on a corpus of 25 documents from solr, Oriented for NE topics by reducing the text from each document to sentences which contains a part of the NE.

UL_LDA_Psum¶

Participants | Proceedings | Input | Appendix

Run ID: UL_LDA_Psum
Participant: LavalLakehead
Track: Dynamic Domain
Year: 2016
Submission: 9/8/2016
Type: automatic
Task: main
Run description: Probability for each document to be assigned to each topic multiplied by the global probability of each topic to obtain the document which covers the maximum of topic information.

UPD_IA_BiQBDiJ¶

Participants | Proceedings | Input | Appendix

Run ID: UPD_IA_BiQBDiJ
Participant: UPD_IA
Track: Dynamic Domain
Year: 2016
Submission: 9/9/2016
Type: automatic
Task: main
Run description: BM25 followed by max 5 iterations of feedback based on an algorithm inspired by Quantum Detection (QB) that exploits binary representation for documents. Feedback consists in re-ranking the (residual) top 1000 documents. When relevant documents are present in the feedback set explicit feedback is performed; when no relevant documents are present, residual collection is re-ranked by PRF on the top 100 documents. Description selection is based on WPQ; top 35 terms + topic terms are used. After two PRF-based re-ranking no additional iterations are performed.

UPD_IA_BiQBFi¶

Participants | Proceedings | Input | Appendix

Run ID: UPD_IA_BiQBFi
Participant: UPD_IA
Track: Dynamic Domain
Year: 2016
Submission: 9/7/2016
Type: automatic
Task: main
Run description: BM25 followed by 5 iterations of feedback based on an algorithm inspired by Quantum Detection (QB) that exploits binary representation for documents. Feedback consists in re-ranking the (residual) top 1000 documents. When relevant documents are present in the feedback set explicit feedback is performed; when no relevant documents are present, residual collection is re-ranked by PRF on the top 100 documents. Description selection is based on WPQ; top 35 terms + topic terms are used.