A hybrid scavenger grid approach to intranet search

dc.contributor.advisorSuleman, Husseinen_ZA
dc.contributor.authorNakashole, Ndapandulaen_ZA
dc.date.accessioned2014-10-30T13:54:10Z
dc.date.available2014-10-30T13:54:10Z
dc.date.issued2009en_ZA
dc.descriptionIncludes bibliographical references (p. 93-98).en_ZA
dc.description.abstractAccording to a 2007 global survey of 178 organisational intranets, 3 out of 5 organisations are not satisfied with their intranet search services. However, as intranet data collections become large, effective full-text intranet search services are needed more than ever before. To provide an effective full-text search service based on current information retrieval algorithms, organisations have to deal with the need for greater computational power. Hardware architectures that can scale to large data collections and can be obtained and maintained at a reasonable cost are needed. Web search engines address scalability and cost-effectiveness by using large-scale centralised cluster architectures. The scalability of cluster architectures is evident in the ability of Web search engines to respond to millions of queries within a few seconds while searching very large data collections. Though more cost-effective than high-end supercomputers, cluster architectures still have relatively high acquisition and maintenance costs. Where information retrieval is not the core business of an organisation, a cluster-based approach may not be economically viable. A hybrid scavenger grid is proposed as an alternative architecture - it consists of a combination of dedicated and dynamic resources in the form of idle desktop workstations. From the dedicated resources, the architecture gets predictability and reliability whereas from the dynamic resources it gets scalability. An experimental search engine was deployed on a hybrid scavenger grid and evaluated. Test results showed that the resources of the grid can be organised to deliver the best performance by using the optimal number of machines and scheduling the optimal combination of tasks that the machines perform. A system-efficiency and cost-effectiveness comparison of a grid and a multi-core machine showed that for workloads of modest to large sizes, the grid architecture delivers better throughput per unit cost than the multi-core, at a system efficiency that is comparable to that of the multi-core. The study has shown that a hybrid scavenger grid is a feasible search engine architecture that is cost-effective and scales to medium- to large-scale data collections.en_ZA
dc.identifier.apacitationNakashole, N. (2009). <i>A hybrid scavenger grid approach to intranet search</i>. (Thesis). University of Cape Town ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/8977en_ZA
dc.identifier.chicagocitationNakashole, Ndapandula. <i>"A hybrid scavenger grid approach to intranet search."</i> Thesis., University of Cape Town ,Faculty of Science ,Department of Computer Science, 2009. http://hdl.handle.net/11427/8977en_ZA
dc.identifier.citationNakashole, N. 2009. A hybrid scavenger grid approach to intranet search. University of Cape Town.en_ZA
dc.identifier.ris TY - Thesis / Dissertation AU - Nakashole, Ndapandula AB - According to a 2007 global survey of 178 organisational intranets, 3 out of 5 organisations are not satisfied with their intranet search services. However, as intranet data collections become large, effective full-text intranet search services are needed more than ever before. To provide an effective full-text search service based on current information retrieval algorithms, organisations have to deal with the need for greater computational power. Hardware architectures that can scale to large data collections and can be obtained and maintained at a reasonable cost are needed. Web search engines address scalability and cost-effectiveness by using large-scale centralised cluster architectures. The scalability of cluster architectures is evident in the ability of Web search engines to respond to millions of queries within a few seconds while searching very large data collections. Though more cost-effective than high-end supercomputers, cluster architectures still have relatively high acquisition and maintenance costs. Where information retrieval is not the core business of an organisation, a cluster-based approach may not be economically viable. A hybrid scavenger grid is proposed as an alternative architecture - it consists of a combination of dedicated and dynamic resources in the form of idle desktop workstations. From the dedicated resources, the architecture gets predictability and reliability whereas from the dynamic resources it gets scalability. An experimental search engine was deployed on a hybrid scavenger grid and evaluated. Test results showed that the resources of the grid can be organised to deliver the best performance by using the optimal number of machines and scheduling the optimal combination of tasks that the machines perform. A system-efficiency and cost-effectiveness comparison of a grid and a multi-core machine showed that for workloads of modest to large sizes, the grid architecture delivers better throughput per unit cost than the multi-core, at a system efficiency that is comparable to that of the multi-core. The study has shown that a hybrid scavenger grid is a feasible search engine architecture that is cost-effective and scales to medium- to large-scale data collections. DA - 2009 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2009 T1 - A hybrid scavenger grid approach to intranet search TI - A hybrid scavenger grid approach to intranet search UR - http://hdl.handle.net/11427/8977 ER - en_ZA
dc.identifier.urihttp://hdl.handle.net/11427/8977
dc.identifier.vancouvercitationNakashole N. A hybrid scavenger grid approach to intranet search. [Thesis]. University of Cape Town ,Faculty of Science ,Department of Computer Science, 2009 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/8977en_ZA
dc.language.isoengen_ZA
dc.publisher.departmentDepartment of Computer Scienceen_ZA
dc.publisher.facultyFaculty of Scienceen_ZA
dc.publisher.institutionUniversity of Cape Town
dc.subject.otherComputer Scienceen_ZA
dc.titleA hybrid scavenger grid approach to intranet searchen_ZA
dc.typeMaster Thesis
dc.type.qualificationlevelMasters
dc.type.qualificationnameMScen_ZA
uct.type.filetypeText
uct.type.filetypeImage
uct.type.publicationResearchen_ZA
uct.type.resourceThesisen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_sci_2009_nakashole_n.pdf
Size:
5.32 MB
Format:
Adobe Portable Document Format
Description:
Collections