Analyzing storage system workloads
| dc.contributor.advisor | Kritzinger, Pieter | en_ZA |
| dc.contributor.author | Sikalinda, Paul | en_ZA |
| dc.date.accessioned | 2014-08-13T19:36:41Z | |
| dc.date.available | 2014-08-13T19:36:41Z | |
| dc.date.issued | 2006 | en_ZA |
| dc.description | Word processed copy. | en_ZA |
| dc.description | Includes bibliographical references. | en_ZA |
| dc.description.abstract | Analysis of storage system workloads is important for a number of reasons. The analysis might be performed to understand the usage patterns of existing storage systems. It is very important for the architects to understand the usage patterns when designing and developing a new, or improving upon the existing design of a storage system. It is also important for a system administrator to understand the usage patterns when configuring and tuning a storage system. The analysis might also be performed to determine the relationship between any two given workloads. Before a decision is taken to pool storage resources to increase the throughput, there is need to establish whether the different workloads involved are correlated or not. Furthermore, the analysis of storage system workloads can be done to monitor the usage and to understand the storage requirements and behavior of system and application software. Another very important reason for analyzing storage system workloads, is the need to come up with correct workload models for storage system evaluation. For the evaluation, based on simulations or otherwise, to be reliable, one has to analyze, understand and correctly model the workloads. In our work we have developed a general too, called ESSWA (Enterprise Storage System Workload Analyzer) for analyzing storage system workloads, which has a number of advantages over other storage system workload analyzers described in literature. Given a storage system workload in the form of an I/O trace file containing data for the workload parameters, ESSWA gives statistics of the data. From the statistics one can derive mathematical models in the form of probability distribution functions for the workload parameters. The statistics and mathematical models describe only the particular workload for which they are produced. This is because storage system workload characteristics are sensitive to the file system and buffer pool design and implementation, so that the results of any analysis are less broadly applicable. We experimented with ESSWA by analyzing storage system workloads represented by three sets of I/O traces at our disposal. Our results, among other things show that: I/O request sizes are influenced by the operating system in use; the start addresses of I/O requests are somewhat influenced by the application; and the exponential probability density function, which is often used in simulation of storage systems to generate inter-arrival times of I/O requests, is not the best model for that purpose in the workloads that we analyzed. We found the Weibull, lognormal and beta probability density functions to be better models. | en_ZA |
| dc.identifier.apacitation | Sikalinda, P. (2006). <i>Analyzing storage system workloads</i>. (Thesis). University of Cape Town ,Faculty of Science ,Department of Computer Science. Retrieved from http://hdl.handle.net/11427/6440 | en_ZA |
| dc.identifier.chicagocitation | Sikalinda, Paul. <i>"Analyzing storage system workloads."</i> Thesis., University of Cape Town ,Faculty of Science ,Department of Computer Science, 2006. http://hdl.handle.net/11427/6440 | en_ZA |
| dc.identifier.citation | Sikalinda, P. 2006. Analyzing storage system workloads. University of Cape Town. | en_ZA |
| dc.identifier.ris | TY - Thesis / Dissertation AU - Sikalinda, Paul AB - Analysis of storage system workloads is important for a number of reasons. The analysis might be performed to understand the usage patterns of existing storage systems. It is very important for the architects to understand the usage patterns when designing and developing a new, or improving upon the existing design of a storage system. It is also important for a system administrator to understand the usage patterns when configuring and tuning a storage system. The analysis might also be performed to determine the relationship between any two given workloads. Before a decision is taken to pool storage resources to increase the throughput, there is need to establish whether the different workloads involved are correlated or not. Furthermore, the analysis of storage system workloads can be done to monitor the usage and to understand the storage requirements and behavior of system and application software. Another very important reason for analyzing storage system workloads, is the need to come up with correct workload models for storage system evaluation. For the evaluation, based on simulations or otherwise, to be reliable, one has to analyze, understand and correctly model the workloads. In our work we have developed a general too, called ESSWA (Enterprise Storage System Workload Analyzer) for analyzing storage system workloads, which has a number of advantages over other storage system workload analyzers described in literature. Given a storage system workload in the form of an I/O trace file containing data for the workload parameters, ESSWA gives statistics of the data. From the statistics one can derive mathematical models in the form of probability distribution functions for the workload parameters. The statistics and mathematical models describe only the particular workload for which they are produced. This is because storage system workload characteristics are sensitive to the file system and buffer pool design and implementation, so that the results of any analysis are less broadly applicable. We experimented with ESSWA by analyzing storage system workloads represented by three sets of I/O traces at our disposal. Our results, among other things show that: I/O request sizes are influenced by the operating system in use; the start addresses of I/O requests are somewhat influenced by the application; and the exponential probability density function, which is often used in simulation of storage systems to generate inter-arrival times of I/O requests, is not the best model for that purpose in the workloads that we analyzed. We found the Weibull, lognormal and beta probability density functions to be better models. DA - 2006 DB - OpenUCT DP - University of Cape Town LK - https://open.uct.ac.za PB - University of Cape Town PY - 2006 T1 - Analyzing storage system workloads TI - Analyzing storage system workloads UR - http://hdl.handle.net/11427/6440 ER - | en_ZA |
| dc.identifier.uri | http://hdl.handle.net/11427/6440 | |
| dc.identifier.vancouvercitation | Sikalinda P. Analyzing storage system workloads. [Thesis]. University of Cape Town ,Faculty of Science ,Department of Computer Science, 2006 [cited yyyy month dd]. Available from: http://hdl.handle.net/11427/6440 | en_ZA |
| dc.language.iso | eng | en_ZA |
| dc.publisher.department | Department of Computer Science | en_ZA |
| dc.publisher.faculty | Faculty of Science | en_ZA |
| dc.publisher.institution | University of Cape Town | |
| dc.subject.other | Computer Science | en_ZA |
| dc.title | Analyzing storage system workloads | en_ZA |
| dc.type | Master Thesis | |
| dc.type.qualificationlevel | Masters | |
| dc.type.qualificationname | MSc | en_ZA |
| uct.type.filetype | Text | |
| uct.type.filetype | Image | |
| uct.type.publication | Research | en_ZA |
| uct.type.resource | Thesis | en_ZA |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- thesis_sci_2006_sikalinda_p.pdf
- Size:
- 5.84 MB
- Format:
- Adobe Portable Document Format
- Description: