Система будет остановлена для регулярного обслуживания. Пожалуйста, сохраните рабочие данные и выйдите из системы.
dc.contributor.author | Zhakiyev, Daniyar![]() |
|
dc.date.accessioned | 2024-05-20T14:24:47Z | |
dc.date.available | 2024-05-20T14:24:47Z | |
dc.date.issued | 2024-04-25 | |
dc.identifier.citation | Zhakiyev, Daniyar (2024) Evaluation Of ML Inference Workloads On Reduced Refresh Rate Dram. Nazarbayev University School of Engineering and Digital Sciences | en_US |
dc.identifier.uri | http://nur.nu.edu.kz/handle/123456789/7693 | |
dc.description.abstract | This thesis investigates the potential benefits of reducing DRAM refresh rates to improve the performance of Machine learning and Neural Network inference workloads. The increasing integration of ML and NN models in various industries makes it necessary to optimize these models to efficiently use computing resources, particularly in devices with limited capabilities. To maintain data integrity, DRAM requires periodic refresh cycles, which has a substantial impact on power consumption and system efficiency. Thus, DRAM refresh rates can be lowered for performance purposes. While this study does not add any additional components to the memory controller, other proposed approaches had hardware or software overhead. Preliminary findings indicate that NNs have a remarkable tolerance to data loss, caused by reduced refresh rates. Results show that DRAM refresh rates can be reduced by up to 15-150 times the usual refresh rate without significant impact on NN accuracy. Additionally, NNs showed 2.7% faster inference and consumed 5.6% less power at refresh rate of 1 second. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Nazarbayev University School of Engineering and Digital Sciences | en_US |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.subject | DRAM | en_US |
dc.subject | Refresh Rate | en_US |
dc.subject | Neural Network Inference | en_US |
dc.subject | Type of access: Open Access | en_US |
dc.title | EVALUATION OF ML INFERENCE WORKLOADS ON REDUCED REFRESH RATE DRAM | en_US |
dc.type | Master's thesis | en_US |
workflow.import.source | science |
The following license files are associated with this item: