Experimental Setup

4.2 Experimental Setup

We aim to compare the effectiveness of the models listed in Table 4.1, each of which was introduced in either the preceding section or in Chapter 2. For all models, we use the resulting query model as estimated query part, ${\hat{θ}}_{Q}$ , in Eq. 2.10. All of the models have a number of parameters in common. In this chapter, we focus on varying these parameters and observing the effect on retrieval effectiveness. We consider the following parameters: $| V_{Q} |$ , $| R |$ , and $λ_{Q}$ . See Section 3.4 for their descriptions. Some of the feedback models under investigation require additional parameter settings. For MBF, and NLLR (cf. Eqs. 2.16 and 4.3) we set $λ_{R} = 0.15$ and $λ_{R} = 0.5$ respectively. For PRM (cf. Eq. 2.27), we set $μ = 0$ , which effectively results in RM-0 estimated on parsimonious document models: $\begin{array}{rcl} P (t_{1}, \dots, t_{| V |} | θ Q) & \propto & \prod_{i = 1}^{| V |} \frac{1}{| R |} \sum_{D \in R} P (t_{i} | {\hat{θ}}_{D}) . & (4.7) \end{array}$

In essence, Eq. 4.7 takes the middle ground between RM and MBF; it combines the estimation method of MBF with the document independence assumption of RM. For evaluation, we use the following diverse set of test collections

TREC Robust 2004 (TREC-ROB-04),
TREC Relevance Feedback 2008 (TREC-RF-2008 and TREC-PRF-08), and
TREC Web 2009, using the Category B subset (TREC-WEB-09).

These collections were introduced in Section 3.3. The percentages and significance tests in the result tables in this chapter indicate the difference with respect to the baseline—we use a ‘*’ to indicate a significant difference, as detailed in Section 3.2.2. In the next section we consider retrieval effectiveness using pseudo relevance feedback and in Section 4.4 we turn to explicit relevance feedback.

PrevTail

Front