Skip to content

Runs - Genomics 2004

akoike

Results | Participants | Input | Summary | Appendix

  • Run ID: akoike
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: manual
  • Task: adhoc
  • Run description: Strong heuristic

akoyama

Results | Participants | Input | Summary | Appendix

  • Run ID: akoyama
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: manual
  • Task: adhoc
  • Run description: Initial result screening

aliasiBase

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: aliasiBase
  • Participant: alias-i
  • Track: Genomics
  • Year: 2004
  • Submission: 7/7/2004
  • Type: automatic
  • Task: adhoc
  • Run description: SYSTEM Jakarta Lucene 1.3 (jakarta.apache.org/lucene) LINGUISTICS Alias-i LingPipe (www.aliasi.com/lingpipe) INDEXING Tokenized with LingPipe IndoEuropeanTokenizer, lowercased, without a stop list. Combined document text into single field with boosts 4title, 1abstract, 2ordinary MESH, 4starred MESH, 1chemical names. Took 5 hours, producing 9GB index, including storing original titles, abstracts, MESH terms and chemical names. SEARCH Tokenized with LingPipe IndoEuropeanTokenizer, lowercased, removed stop terms. Used fields with boosts 4title, 2need, 1context. Lucene's standard TF/IDF implementation TF is square root of raw frequency; IDF is log(num docs/doc freq + 1) + 1; doc and query normalized to unit length. Took 15 minutes per result set of 50 topics with 1000 results/topic.

aliasiTerms

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: aliasiTerms
  • Participant: alias-i
  • Track: Genomics
  • Year: 2004
  • Submission: 7/7/2004
  • Type: automatic
  • Task: adhoc
  • Run description: SYSTEM Jakarta Lucene 1.3 (jakarta.apache.org/lucene) LINGUISTICS Alias-i LingPipe (www.aliasi.com/lingpipe) INDEXING Tokenized with LingPipe IndoEuropeanTokenizer, lowercased, without a stop list. Combined document text into single field with boosts 4title, 1abstract, 2ordinary MESH, 4starred MESH, 1chemical names. Took 5 hours, producing 9GB index, including storing original titles, abstracts, MESH terms and chemical names. SEARCH Tokenized with LingPipe IndoEuropeanTokenizer, lowercased, removed stop terms. Used fields with boosts 4title, 2need, 1context. Lucene's standard TF/IDF implementation TF is square root of raw frequency; IDF is log(num docs/doc freq + 1) + 1; doc and query normalized to unit length. Took 15 minutes per result set of 50 topics with 1000 results/topic. TERMS Added all terms recognized by LingPipe's GENIA-corpus trained Genomics named entity annotator as quoted search terms with a boost of 4 (multiplied by field boost).

biotext1trge

Results | Participants | Input | Summary | Appendix

  • Run ID: biotext1trge
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Threshold change

BIOTEXT21

Results | Participants | Input | Summary | Appendix

  • Run ID: BIOTEXT21
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: GO code extraction, MGI before 2003

BIOTEXT22

Results | Participants | Input | Summary | Appendix

  • Run ID: BIOTEXT22
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: GO code extraction

BIOTEXT23

Results | Participants | Input | Summary | Appendix

  • Run ID: BIOTEXT23
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: GO identification + classification (SVM) + human

BIOTEXT24

Results | Participants | Input | Summary | Appendix

  • Run ID: BIOTEXT24
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: GO identification + classification (SVM)

BIOTEXT25

Results | Participants | Input | Summary | Appendix

  • Run ID: BIOTEXT25
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: GO identification + classification (SVM); window of 50

biotext2trge

Results | Participants | Input | Summary | Appendix

  • Run ID: biotext2trge
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Threshold change

biotext3trge

Results | Participants | Input | Summary | Appendix

  • Run ID: biotext3trge
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Threshold change

biotext4trge

Results | Participants | Input | Summary | Appendix

  • Run ID: biotext4trge
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Threshold change

biotext5trge

Results | Participants | Input | Summary | Appendix

  • Run ID: biotext5trge
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Threshold change

BioTextAdHoc

Results | Participants | Input | Summary | Appendix

  • Run ID: BioTextAdHoc
  • Participant: u.cberkeley.hearst
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Features include MeSH descriptors, gene mappings, and gene families.

ConversAuto

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ConversAuto
  • Participant: converspeech
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Automatic Query terms and filter terms are constructed from the topics using automated term extraction. Query terms are combined to form queries using a federated biomedical and English-language ontology. Filter terms order and score the results of the searches, which are implemented using PubMed e-utilities.

ConversManu

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: ConversManu
  • Participant: converspeech
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: manual
  • Task: adhoc
  • Run description: Manual Queries and filter terms generated automatically were modified in consultation with a biologist based on an understanding of the intended search for each topic. Three kinds of general modifications were made.

csusm

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: csusm
  • Participant: u.sanmarcos
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: manual
  • Task: adhoc
  • Run description: Removed stop words from XML Medline files and topics file. Query constructed using title field, in some cases information from the need field. Word-based search, each term in the query is assigned an equivalent weight. If there is a match add term weight to totalweight. If totalweight is >= total# terms for query * .5 * weight then document is retrieved.

cuhkrun1

Results | Participants | Input | Summary | Appendix

  • Run ID: cuhkrun1
  • Participant: chinese.u.hongkong
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: SVM classification is used with term weighting scheme of tf-idf; and with cost ration automatically tuned.

cuhkrun2

Results | Participants | Input | Summary | Appendix

  • Run ID: cuhkrun2
  • Participant: chinese.u.hongkong
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: Target gene identification is conducted which SVM classification is used with term weight considering distance between the term and target gene. Also, with cost ration automatically tuned.

cuhkrun3

Results | Participants | Input | Summary | Appendix

  • Run ID: cuhkrun3
  • Participant: chinese.u.hongkong
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: Using SVM with term weighting tf-idf and with different set of features. Also, automatically tuning the cost ration.

DCUma

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: DCUma
  • Participant: dubblincity.u
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: manual
  • Task: adhoc
  • Run description: We built a MeSH term index and it is the only index used by our system for this run. With the help of an expert, we manually build mesh queries from the topics.

DCUmatn1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: DCUmatn1
  • Participant: dubblincity.u
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: manual
  • Task: adhoc
  • Run description: Two indices were built for this run, one with MeSH descriptors and one with title/abstract terms. We used the title and need fields to automatically generate abstract/title query and produced a ranking with the BM25 algorithm. We then queried the MeSH index using MeSH queries generated manually from the topics with the help of an expert and obtained a second ranking. The two ranking were combined and only the top 1000 documents were retained for each topic.

dimacsAabsw1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsAabsw1
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: annhi
  • Run description: Title, subject and abstract of the article were used to represent document-gene pair. Text representation stemmed, logtf-idf, cosine normalization. A classifier, Bayesian Binary Regression (BBR) with Gaussian prior, was trained separately for each domain. The test results across domains were then combined into one file.

dimacsAg3mh

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsAg3mh
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/29/2004
  • Type: automatic
  • Task: annhi
  • Run description: MeSH terms from MEDLINE abstracts were used to represent document-gene pairs. Each MeSH term was represented as a token. Text representation binary. A classifier, Bayesian Binary Regression (BBR) with Gaussian prior was trained separately for each domain. The test results across domains were then combined into one file.

dimacsAl3w

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsAl3w
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/29/2004
  • Type: automatic
  • Task: annhi
  • Run description: Fulltext taken from journal articles' subject, title, abstract, and body fields were used to represent document-gene pairs. Text representation stemmed, logtf-idf and cosine normalization. A classifier, Bayesian Binary Regression (BBR) with Laplace prior was trained separately for each domain. The test results across domains were then combined into one file.

dimacsAp5w5

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsAp5w5
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: annhi
  • Run description: Paragraphs containing gene symbol/description (taken from gtrain and from LocusLink) were used to represent document-gene pair. Text representation stemmed, logtf-idf, cosine normalization. Positive instances in the training were given weight 5. A classifier, Bayesian Binary Regression (BBR) with Gaussian prior, was trained separately for each domain. The test results across domains were then combined into one file.

dimacsAw20w5

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsAw20w5
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: annhi
  • Run description: Join of windows of half-size 20 around gene symbol/description (taken from gtrain and from LocusLink) were used to represent document-gene pair. Text representation stemmed, logtf-idf, cosine normalization. Positive instances in the training were given weight 5. A classifier, Bayesian Binary Regression (BBR) with Gaussian prior, was trained separately for each domain. The test results across domains were then combined into one file.

dimacsTfl9d

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsTfl9d
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: Training and test set documents were prefiltered to obtain documents with MeSH term Mice. Prefiltered training set was used to construct a classifier. Article title, abstract and MEDLINE MeSH terms were used for document representation. Text representation stemmed, logtf-idf, cosine normalization. A classifier, Bayesian Binary Regression (BBR) with Laplace prior, was trained, and used to make predictions with maximum expected effectiveness threshold tuning.

dimacsTfl9w

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsTfl9w
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: Training and test set documents were prefiltered to obtain documents with MeSH term Mice. Prefiltered training set was used to construct a classifier. Fulltext obtained from journal articles' title, abstract and body fields were used for document representation. Text representation stemmed, logtf-idf, cosine normalization. A classifier, Bayesian Binary Regression (BBR) with Laplace prior, was trained, and used to make predictions with maximum expected effectiveness threshold tuning.

dimacsTl9md

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsTl9md
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: Article title, abstract and MEDLINE MeSH terms were used for document representation. Text representation stemmed, logtf-idf, cosine normalization. A classifier, Bayesian Binary Regression (BBR) with Laplace prior, was trained and used to make predictions with maximum expected effectiveness threshold tuning.

dimacsTl9mhg

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsTl9mhg
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: For each document, MeSH terms and GENBANK references were taken from the corresponding MEDLINE abstracts, and each MeSH term and organism from the corresponding GENBANK entry was represented as a token. Text representation binary. A classifier, Bayesian Binary Regression (BBR) with Laplace prior, was trained and used to make predictions with maximum expected effectiveness threshold tuning.

dimacsTl9w

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: dimacsTl9w
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: Fulltext obtained from journal articles' subject, title, abstract and body fields were used for document representation. Text representation stemmed, logtf-idf, cosine normalization. A classifier, Bayesian Binary Regression (BBR) with Laplace prior, was trained, and used to make predictions with maximum expected effectiveness threshold tuning.

edinauto2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: edinauto2
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Lucene retrieval engine; queries weighted according to Web counts and expanded with noun phrases and synonyms

edinauto5

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: edinauto5
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Lucene retrieval engine; queries weighted according to Web counts and expanded with noun phrases and synonyms

edis2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: edis2
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Mice relevance based on Discussion sections and MH and RN headings, with triage decision based on Discussion section only.

eint2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: eint2
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Based on introduction and MH and RN headings only

EMCTNOT1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: EMCTNOT1
  • Participant: tno.kraaij
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Documents were indexed for three different ontologies using concept recognition (Collexis). MesH terms were also encoded using semantic groups. This resulted in 4 different feature representations for which 5 different classifiers were trained. The output of these classifiers was combined in a meta-classifier.

emet2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: emet2
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Based on Methods sections and MH and RN headings only

epub2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: epub2
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Based on title, abstract and MH and RN headings only

eres2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: eres2
  • Participant: u.edinburgh.sinclair
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Mice relevance based on Results sections and MH and RN headings, with triage decision based on Results section only.

geneteam1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteam1
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Triage with bigrams only

geneteam2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteam2
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Triage with bigrams and monograms

geneteam3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteam3
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Triage with only monograms

geneteamA1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteamA1
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: sentence selection V1 categorizer "categMatcher" threshold selection

geneteamA2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteamA2
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: sentence selection V1 categorizer "GOMAP" threshold selection

geneteamA3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteamA3
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: sentence selection V2 categorizer "categMatcher" threshold selection

geneteamA4

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteamA4
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: sentence selection V2 categorizer "GOMAP" threshold selection

geneteamA5

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: geneteamA5
  • Participant: u.hospital.geneva
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Use of naive bayes info gain feature selection sentence selection V2

GUCbase

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCbase
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Baseline run all possible combinations generated

GUCir30

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCir30
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Lemur Text segments with gene names only Threshold for + Okapi score 30 w/ Backup

GUCir50

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCir50
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Lemur Text segments with gene names only Threshold for + Okapi score 50 w/ Backup

GUClin1260

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUClin1260
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Linear SVM model top 1260 returned documents

GUClin1700

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUClin1700
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Linear SVM model top 1700 returned documents

GUCply1260

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCply1260
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Polynomial SVM model top 1260 returned documents captions only

GUCply1700

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCply1700
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Polynomial SVM model top 1700 returned documents captions only

GUCsvm0

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCsvm0
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Linear SVM Text segments with gene names only Threshold for + 0.0

GUCsvm5

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCsvm5
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Linear SVM Text segments with gene names only Threshold for + -5.0

GUCwdply2000

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: GUCwdply2000
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Polynomial SVM model top 2000 returned documents full documents

IBMIRLver1

Results | Participants | Input | Summary | Appendix

  • Run ID: IBMIRLver1
  • Participant: ibm.india
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: used bioannotator, followed by classifier

iowarun1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: iowarun1
  • Participant: u.iowa
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Iowa team's first run 390 lines

iowarun2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: iowarun2
  • Participant: u.iowa
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Iowa's team run 2 305 lines

iowarun3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: iowarun3
  • Participant: u.iowa
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: Iowa's team run 3 447 lines

iowarun4

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: iowarun4
  • Participant: u.iowa
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: annhi
  • Run description: Used a retrieval approach based on citation sentences.

KoikeyaHi1

Results | Participants | Input | Summary | Appendix

  • Run ID: KoikeyaHi1
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: manual
  • Task: annhi
  • Run description: Efficient structural analyses of sentences.

KoikeyaHiev1

Results | Participants | Input | Summary | Appendix

  • Run ID: KoikeyaHiev1
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: manual
  • Task: annhiev
  • Run description: Efficient structural analyses of sentences.

KoikeyaTri1

Results | Participants | Input | Summary | Appendix

  • Run ID: KoikeyaTri1
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: manual
  • Task: triage
  • Run description: Efficient structural analyses of sentences.

KoikeyaTri2

Results | Participants | Input | Summary | Appendix

  • Run ID: KoikeyaTri2
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: manual
  • Task: triage
  • Run description: Efficient structural analyses of sentences. Extended version.

KoikeyaTri3

Results | Participants | Input | Summary | Appendix

  • Run ID: KoikeyaTri3
  • Participant: u.tokyo
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: manual
  • Task: triage
  • Run description: Efficient structural analyses of sentences. Extended and high-rank documents version.

lga1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lga1
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: query expansion and pseudo-relevance feedback

lga2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lga2
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: query expansion

lgcab1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lgcab1
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhiev
  • Run description: knn+svd

lgcab2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lgcab2
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhiev
  • Run description: knn

lgcad1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lgcad1
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: knn+svd

lgcad2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lgcad2
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: knn

lgct1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lgct1
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Naive Bayes Classifier Gene Filter Mesh Terms

lgct2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: lgct2
  • Participant: indiana.u.seki
  • Track: Genomics
  • Year: 2004
  • Submission: 8/31/2004
  • Type: automatic
  • Task: triage
  • Run description: Naive Bayes Classifier Gene Filter Mesh Terms

LHCUMDSE

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: LHCUMDSE
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Search engine (SE) queries were formulated by extracting information from the task queries gene names recognized by ABGene, the corresponding MeSH terms for the gene names, phrases/concepts identified by MetaMap, organisms extracted using the NCBI Taxonomy, and other useful words extracted based on their form and TFIDF score. The extracted information was translated into SE syntax and weighted, and the results were filtered to those identified as likely genetically related by Journal Descriptor Indexing (JDI).

MeijiHilG

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: MeijiHilG
  • Participant: meiji.u
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: query expansion depended on context

NLMA1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMA1
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: A machine learning method which is a variation of decision list learning. NLMA1 is obtained using the whole training set (T1=0.5, T2=50)

NLMA2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMA2
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: annhi
  • Run description: A machine learning method which is a variation of decision list learning. NLMA2 is obtained with an additional bootstrapping step, for (PMID, gene) pairs with a summation that is larger than T3 (i.e., 100), add them into the training set, and redo the categorization (with a threshold T1=0.8, T2=80).

NLMT21

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMT21
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Using all training data, create theme in + with background + or - Query test set with theme Keep if score > 0

NLMT22

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMT22
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Split into three sets, JBC,JCB,PNAS Create theme for each set in + with background + or - Split test set by journal Query each set by its theme Combine sco > 0 for all 3 journals

NLMT2ADA

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMT2ADA
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: ADA machine learning algorithm using 3-fold cross validation on train

NLMT2BAYES

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMT2BAYES
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: BAYES machine learning algorithm using 3-fold cross validation on train

NLMT2SVM

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NLMT2SVM
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM machine learning algorithm using 3-fold cross validation on train

NTU2v3N1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NTU2v3N1
  • Participant: ntu.chen
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: title + abstract + fig/tbl captions

NTU3v3N1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NTU3v3N1
  • Participant: ntu.chen
  • Track: Genomics
  • Year: 2004
  • Submission: 8/29/2004
  • Type: automatic
  • Task: triage
  • Run description: atl + abs + fig/tbl captions + body(if no abs)

NTU3v3N1c2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NTU3v3N1c2
  • Participant: ntu.chen
  • Track: Genomics
  • Year: 2004
  • Submission: 8/29/2004
  • Type: automatic
  • Task: triage
  • Run description: atl + abs + fig/tbl captions + body(if no abs) different parameters

NTU4v3N1416

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: NTU4v3N1416
  • Participant: ntu.chen
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: atl + abs + fig/tbl captions + BODY (if no abs) or result/discussion/conclusion Sections in BODY (if has abs)

nusbird2004a

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: nusbird2004a
  • Participant: mlg.nus
  • Track: Genomics
  • Year: 2004
  • Submission: 8/26/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM

nusbird2004b

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: nusbird2004b
  • Participant: mlg.nus
  • Track: Genomics
  • Year: 2004
  • Submission: 8/29/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM

nusbird2004c

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: nusbird2004c
  • Participant: mlg.nus
  • Track: Genomics
  • Year: 2004
  • Submission: 8/29/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM, with additional positive training examples.

nusbird2004d

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: nusbird2004d
  • Participant: mlg.nus
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM

nusbird2004e

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: nusbird2004e
  • Participant: mlg.nus
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM

OHSUAll

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: OHSUAll
  • Participant: ohsu.hersh
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Automatic query generation, processed by Lucene using TF-IDF without normalization

OHSUNBAYES

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: OHSUNBAYES
  • Participant: ohsu.hersh
  • Track: Genomics
  • Year: 2004
  • Submission: 8/24/2004
  • Type: automatic
  • Task: triage
  • Run description: CHI SQUARE 0.025 FULL FEATURE SET NAIVE BAYES CLASSIFIER

OHSUNeeds

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: OHSUNeeds
  • Participant: ohsu.hersh
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Automatic query generation, processed by Lucene using TF-IDF without normalization

OHSUSVMJ20

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: OHSUSVMJ20
  • Participant: ohsu.hersh
  • Track: Genomics
  • Year: 2004
  • Submission: 8/24/2004
  • Type: automatic
  • Task: triage
  • Run description: CHI SQUARE 0.025 FULL FEATURE SET SVMLIGHT j=20.0 CLASSIFIER

OHSUVP

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: OHSUVP
  • Participant: ohsu.hersh
  • Track: Genomics
  • Year: 2004
  • Submission: 8/24/2004
  • Type: automatic
  • Task: triage
  • Run description: CHI SQUARE 0.025 FULL FEATURE SET VOTING PERCEPTRON CLASSIFIER LINEAR KERNEL FN LEARNING RATE 20.0 FP LEARNING RATE 1.0

PD50501

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: PD50501
  • Participant: u.padova
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: MySQL has been employed as IR engine. PD50501 has been obtained by using a new query expansion technique. After retrieving 20 documents, all the symbols are selected together with the keywords occurring in the 10-word window built around each symbol. The 50 most frequent keywords are selected and linked to the symbols co-occurring in the text windows. This way, a bi-partite graph is generated where symbols "point-to" frequent and close keywords.SPLIT (see PDTNsmp4 description) is then performed to disclose the mutual reinforcement relationship between symbols and keywords. The most "authoritative" keywords are frequently pointed-to by the symbols which frequently point-to "authoritative" keywords. The most "authoritative" keywords are added to expand the query which is in turn used to retrieve 1000 documents.

PDTNsmp4

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: PDTNsmp4
  • Participant: u.padova
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: MySQL has been employed as IR engine. PDTNsmp4 has been obtained by indexing the collection using SPLIT, a stemming algorithm proposed and tested for some European languages. SPLIT infers directly the word formation rules from the corpus of words in the collection by exploiting a mutual reinforcement between the prefixes and suffixes of words. The hypothesis that SPLIT is effective has been tested in this domain-specific context, in which word formation rules can be different from those of common English.

pllsgen4a1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4a1
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Word based indexing, no stemmer, Inquery stopwords, TF*IDF,BM25TF, MESH expansion,LocusLink Expansion, Pseudo feedback.

pllsgen4a2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4a2
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Word based indexing, porter stemmer, Inquery stopwords, TF*IDF,BM25TF, MESH expansion,LocusLink Expansion, Pseudo feedback.

pllsgen4t1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4t1
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM Classifier using features from fulltext words, MeSH terms, Gene names, weighted by log-TF.

pllsgen4t2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4t2
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM Classifier using features from fulltext words, MeSH terms, Gene names, weighted by log-TF.

pllsgen4t3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4t3
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 8/27/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM Classifier using features from fulltext words, MeSH terms, Gene names, weighted by log-TF.

pllsgen4t4

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4t4
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM classifier using features from fulltext words, MeSH terms and Gene names, weighted by log-TF

pllsgen4t5

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: pllsgen4t5
  • Participant: patolis.fujita
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: SVM classifier using features from fulltext words, MeSH terms and Gene names, weighted by log-TF

PSE

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: PSE
  • Participant: german.u.cairo
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: no query expansion, no-stemming, hyphenated expressions, OKAPI-BM25, PSE

RMITa

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: RMITa
  • Participant: rmit.scholer
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Plain text search using 'title', and 'need' fields from the query, and all fields from the Medline data. Searches were conducted using indexes from the Zettaire search system.

RMITb

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: RMITb
  • Participant: rmit.scholer
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Novel technique using o 'title', and 'need' fields from the queries o all fields from Medline data o term expansion o species-based search-space partitioning Searches were conducted using indexes from the Zettaire search system.

run1

Results | Participants | Input | Summary | Appendix

  • Run ID: run1
  • Participant: utwente
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: manual
  • Task: adhoc
  • Run description: We just recently joined the genomics track, because of this our system couldnt run all queries yet. the ones it could run are included in our results. The titles and abstracts from the medline records were used.

rutgersGAH1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: rutgersGAH1
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: We used MG retrieval system to index the MEDLINE article collection. We used title, abstract, chemical names and mesh terms sections of the articles to index the collection. We constructed queries from topic titles and needs. We performed the retrieval as a two-stage process. First we identifed the relevant documents by MG, and then we performed boolean full-text on the results using MySQL.

rutgersGAH2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: rutgersGAH2
  • Participant: rutgers.dayanik
  • Track: Genomics
  • Year: 2004
  • Submission: 7/14/2004
  • Type: automatic
  • Task: adhoc
  • Run description: We used MG retrieval system to index the MEDLINE article collection. We used title, abstract, chemical names and mesh terms sections of the articles to index the collection. We constructed queries from topic titles and needs. We computed a relevance score for ranking using the frequency and inverse document frequency of query terms as well as the proximity of topic title query terms.

shefauto1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: shefauto1
  • Participant: u.sheffield.gaizauskas
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Use the UMLS to extend synonyms for each keywords. Use LT chunker to find noun phrases and abbrievations.

shefauto2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: shefauto2
  • Participant: u.sheffield.gaizauskas
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: No synonyms are used. Use LT chunker to find noun phrases and abbrievations.

tgnNecaux

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: tgnNecaux
  • Participant: tarragon
  • Track: Genomics
  • Year: 2004
  • Submission: 7/12/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Variant run. Eliminates noise phrases, then uses on all fields and combines using , with CONTEXT field as the auxiliary.

tgnSplit

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: tgnSplit
  • Participant: tarragon
  • Track: Genomics
  • Year: 2004
  • Submission: 7/12/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Base run. Eliminates noise phrases, then uses on all fields and combines using .

THIRcat01

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THIRcat01
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: balanced between +/- subsets

THIRcat02

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THIRcat02
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: kmeans clustering, low threshold

THIRcat03

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THIRcat03
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: using GO-item related words to expand the feature directory

THIRcat04

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THIRcat04
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: kmeans clustering, high threshold

THIRcat05

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THIRcat05
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: bool weight instead of TFIDF weight

THUIRgen01

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THUIRgen01
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: manual
  • Task: adhoc
  • Run description: Use our IR systerm named TMiner.Recognize term in non_text fields,in particular MeSH,Other Term (OT),Gene Symbol(GS) and substance name(RN).Give different weights to weights the field of title(TI) and MeSH.Use relevance feedback technology. All above processes are done autaomatiacally. Manually select gene names and expend queries by an abbreviations dictionary.

THUIRgen02

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: THUIRgen02
  • Participant: tsinghua.ma
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Use our IR systerm named TMiner Recognize term in non_text fields,in particular MeSH,Other Term (OT),Gene Symbol(GS) and substance name(RN).Give different weights to weights the fields of title(TI) and MeSH.Use relevance feedback technology.Query Expansion (QE) uses an abbreviation dictionary. All processes are done automatically.

tnog2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: tnog2
  • Participant: tno.kraaij
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Baseline run, using only title and abstract fields. LM based retrieval model

tnog3

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: tnog3
  • Participant: tno.kraaij
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Combination run baseline run tnog2 interpolated with a run based on MeSH queries. MeSH queries were automatically derived from the top 3 documents in the tnog2 run.

tq0

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: tq0
  • Participant: nlm.umd.ul
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: TexTool, a text neighboring program based on TFIDF vector retrieval, was applied to the queries after removing common stopwords (the, we, etc.) from a list of 313, entering the title twice and the need and context once, to find 1,000 similar abstracts.

TRICSUSM

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: TRICSUSM
  • Participant: u.sanmarcos
  • Track: Genomics
  • Year: 2004
  • Submission: 8/26/2004
  • Type: manual
  • Task: triage
  • Run description: We have designed a decision tree learning algorithm for the text categorization. The set of atributes is based on the keywords, the contents of the glosref tag, and font-change tags. We searched in the abstract for such attributes to determine whether to select the text or not.

UBgtNormJM1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: UBgtNormJM1
  • Participant: suny.buffalo
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: Document indexing Language models using linear interpolation smoothing, Query construction topic expanded with NLM's MetaMap, common phrases discarded before retrieval.

UIowaGN1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: UIowaGN1
  • Participant: u.iowa
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: We use all fields in the topics. We automatically expand synonyms using LocusLink. We then conduct retrieval using Lucy/Zettair search engine and rank documents by relevance score. Phrases are in the queries.

utaauto

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: utaauto
  • Participant: u.tampere
  • Track: Genomics
  • Year: 2004
  • Submission: 7/7/2004
  • Type: automatic
  • Task: adhoc
  • Run description: -InQuery retrieval system -Title, Abstract, MH, RN, and GS fields indexed -Text- and MH-field based retrieval -Removal of non-topic words from the topics using average term frequency statistics -Identification of phrases in topics using a collocation method -Pseudorelevance feedback MH terms added to the first queries -Weighting of query keys whose document frequency is low (structural weighting)

utamanu

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: utamanu
  • Participant: u.tampere
  • Track: Genomics
  • Year: 2004
  • Submission: 7/10/2004
  • Type: manual
  • Task: adhoc
  • Run description: -InQuery retrieval system -Title, Abstract, MH, RN, and GS fields indexed -Text- and MH-field based retrieval -The formulation of Boolean queries -New terms from MeSH, genome databases, and dictionaries -Queries were reformulated using MH terms from the top documents of the initial search

uwmtDg04n

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: uwmtDg04n
  • Participant: u.waterloo.clarke
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: From the initial topic, after the removal of stopwords, we have created a plain query (without any expansions) and an expanded query that used data from the Medstract project, LocusLink and the EuGenes database to find acronym expansions and aliases/synonyms for gene names and proteins. We have run these two queries and analyzed the top documents returned in order to validate the possible expansions found in the databases and to find additional terms resulting from permutations of terms (such as "nf.kappa.b" and "kappa.b.nf"). In the second pass, we have run the resulting query (validated expansions/aliases) and scored the documents using Okapi. For the third pass, we computed additional expansion terms based on statistical feedback (obtained by analyzing the top documents returned) and Google feedback (analyzing the passages for the top 20 documents returned by Google for the unexpanded "need-only" query). We then executed the query (expanded by the feedback terms) and scored the documents using Okapi. In the last pass, we used the species names found in the topic to re-rank the documents. In general, if, for instance, "mice" occurs in the need of a topic, all documents containing "mouse", "mice", or "mus.musculus" are getting a better rank.

uwmtDg04tn

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: uwmtDg04tn
  • Participant: u.waterloo.clarke
  • Track: Genomics
  • Year: 2004
  • Submission: 7/15/2004
  • Type: automatic
  • Task: adhoc
  • Run description: From the initial topic, after the removal of stopwords, we have created a plain query (without any expansions) and an expanded query that used data from the Medstract project, LocusLink and the EuGenes database to find acronym expansions and aliases/synonyms for gene names and proteins. We have run these two queries and analyzed the top documents returned in order to validate the possible expansions found in the databases and to find additional terms resulting from permutations of terms (such as "nf.kappa.b" and "kappa.b.nf"). In the second pass, we have run the resulting query (validated expansions/aliases) and scored the documents using Okapi. For the third pass, we computed additional expansion terms based on statistical feedback (obtained by analyzing the top documents returned) and Google feedback (analyzing the passages for the top 20 documents returned by Google for the unexpanded "need-only" query). We then executed the query (expanded by the feedback terms) and scored the documents using Okapi. In the last pass, we used the species names found in the topic to re-rank the documents. In general, if, for instance, "mice" occurs in the need of a topic, all documents containing "mouse", "mice", or "mus.musculus" are getting a better rank.

wdtriage1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wdtriage1
  • Participant: indiana.u.yang
  • Track: Genomics
  • Year: 2004
  • Submission: 8/30/2004
  • Type: automatic
  • Task: triage
  • Run description: Categorizing by Rainbow with Bayes

wdvqlx1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wdvqlx1
  • Participant: indiana.u.yang
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: query expansion with nouns and phrases

wdvqlxa1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wdvqlxa1
  • Participant: indiana.u.yang
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: query expansion with Gene Ontology synonyms, nouns, and phrases

wiscW

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wiscW
  • Participant: u.wisconsin
  • Track: Genomics
  • Year: 2004
  • Submission: 8/20/2004
  • Type: automatic
  • Task: annhi
  • Run description: Statistical model using words only as features.

wiscWR

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wiscWR
  • Participant: u.wisconsin
  • Track: Genomics
  • Year: 2004
  • Submission: 8/20/2004
  • Type: automatic
  • Task: annhi
  • Run description: Statistical model using words plus learned syntactic/contextual rules as features.

wiscWRT

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wiscWRT
  • Participant: u.wisconsin
  • Track: Genomics
  • Year: 2004
  • Submission: 8/20/2004
  • Type: automatic
  • Task: annhi
  • Run description: Statistical model using words, learned syntactic rules, and induced GO-specific terms as features.

wiscWT

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: wiscWT
  • Participant: u.wisconsin
  • Track: Genomics
  • Year: 2004
  • Submission: 8/20/2004
  • Type: automatic
  • Task: annhi
  • Run description: Statistical model using words plus induced GO-specific terms as features.

york04g1

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: york04g1
  • Participant: york.u
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: automatic
  • Task: adhoc
  • Run description: 1. Retrieval using probability model 2. Weighting using BM25 3. Indexing using Okapi2.31 4. Relevance feedback using Okapi 5. Using machine learning techniques for query expansion

york04g2

Results | Participants | Proceedings | Input | Summary | Appendix

  • Run ID: york04g2
  • Participant: york.u
  • Track: Genomics
  • Year: 2004
  • Submission: 7/16/2004
  • Type: manual
  • Task: adhoc
  • Run description: 1. Retireval using probabilistic model 2. Weighting using BM25 3. Indexing using Okapi2.31 4. Relevance feedback using Okapi 5. Using machine learning techniques for query expansion