dc.contributor.author | Nikolaou, Panagiota | en |
dc.contributor.author | Sazeides, Yiannakis | en |
dc.contributor.author | Ndreu, L. | en |
dc.contributor.author | Kleanthous, Marios M. | en |
dc.creator | Nikolaou, Panagiota | en |
dc.creator | Sazeides, Yiannakis | en |
dc.creator | Ndreu, L. | en |
dc.creator | Kleanthous, Marios | en |
dc.date.accessioned | 2019-11-13T10:41:32Z | |
dc.date.available | 2019-11-13T10:41:32Z | |
dc.date.issued | 2015 | |
dc.identifier.isbn | 978-1-4503-4034-2 | |
dc.identifier.uri | http://gnosis.library.ucy.ac.cy/handle/7/54643 | |
dc.description.abstract | Total Cost of Ownership (TCO) is a key optimization metric for the design of a datacenter. This paper proposes, for the first time, a framework for modeling the implications of DRAM failures and DRAM error protection techniques on the TCO of a datacenter. The framework captures the Effects and interactions of several key parameters including: the choice of DRAM protection technique (e.g. single vs dual channel Chipkill), device width (x4 or x8), memory size, power, FITs for various failure modes, the performance, power and temperature overheads of a protection technique for a given service and mixes of collocated services. The usefulness of the proposed framework is demonstrated through several case studies that identify the best DRAM protection technique in each case, in terms of TCO. Interestingly, our analysis reveals that among the three DRAM protection techniques considered, there is no one that is always superior to all the others. Moreover, each technique is better than the others for some cases. This underlines the importance and the need of the proposed framework for making optimal memory protection datacenter design decisions. As part of this work, we analyze and report the performance and power with single channel and dual channel Chipkill on real hardware when running a web search benchmark alone and collocated with benchmarks of varying memory intensity. This analysis reveals that the choice of memory protection can have serious performance and TCO ramifications depending on the memory characteristics of collocated services. Other analysis reveals that, for the datacenter and services assumed in this study, when using Chipkill protection it can be beneficial for TCO to use DRAM with 100x the failure rate of a baseline DRAM as long as the cost per DIMM is at least a dollar less compared to the baseline. © 2015 ACM. | en |
dc.publisher | IEEE Computer Society | en |
dc.source | Proceedings of the Annual International Symposium on Microarchitecture, MICRO | en |
dc.source | 48th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2015 | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84959862033&doi=10.1145%2f2830772.2830804&partnerID=40&md5=2516446146e8ea4bf27a0063ac38d887 | |
dc.subject | World Wide Web | en |
dc.subject | Computer architecture | en |
dc.subject | reliability | en |
dc.subject | Failure analysis | en |
dc.subject | Outages | en |
dc.subject | Costs | en |
dc.subject | Cost benefit analysis | en |
dc.subject | Benchmarking | en |
dc.subject | Program processors | en |
dc.subject | total cost of ownership | en |
dc.subject | Integrated circuit design | en |
dc.subject | Data centers | en |
dc.subject | Dynamic random access storage | en |
dc.subject | co-running services | en |
dc.subject | Datacenter designs | en |
dc.subject | datacenters | en |
dc.subject | DRAM | en |
dc.subject | Memory protection | en |
dc.subject | Offline services | en |
dc.subject | online and offline services | en |
dc.subject | Optimal memory | en |
dc.subject | Protection techniques | en |
dc.subject | Single channels | en |
dc.title | Modeling the implications of DRAM failures and protection techniques on datacenter TCO | en |
dc.type | info:eu-repo/semantics/conferenceObject | |
dc.identifier.doi | 10.1145/2830772.2830804 | |
dc.description.volume | 05-09-December-2015 | en |
dc.description.startingpage | 572 | |
dc.description.endingpage | 584 | |
dc.author.faculty | 002 Σχολή Θετικών και Εφαρμοσμένων Επιστημών / Faculty of Pure and Applied Sciences | |
dc.author.department | Τμήμα Πληροφορικής / Department of Computer Science | |
dc.type.uhtype | Conference Object | en |
dc.description.notes | <p>Sponsors: ARM | en |
dc.description.notes | et al. | en |
dc.description.notes | IBM | en |
dc.description.notes | Intel | en |
dc.description.notes | Microsoft | en |
dc.description.notes | NetApp | en |
dc.description.notes | Conference code: 119360 | en |
dc.description.notes | Cited By :2</p> | en |