- Timestamp:
- 06/03/13 13:55:25 (12 years ago)
- Location:
- papers/SMPaT-2012_DCWoRMS
- Files:
-
- 5 edited
Legend:
- Unmodified
- Added
- Removed
-
papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.aux
r1068 r1069 8 8 \citation{fit4green} 9 9 \citation{networks} 10 \citation{fit4green_carbon_scheduler} 10 11 \citation{sgi} 11 12 \citation{colt} 12 13 \citation{ecocooling} 14 \citation{SimGrid} 13 15 \citation{ff} 14 16 \citation{DCD_Romonet} … … 24 26 \@writefile{toc}{\contentsline {section}{\numberline {3}DCworms}{5}} 25 27 \citation{GSSIM} 28 \@writefile{lof}{\contentsline {figure}{\numberline {1}{\ignorespaces DCworms architecture}}{6}} 29 \newlabel{fig:arch}{{1}{6}} 30 \@writefile{toc}{\contentsline {subsection}{\numberline {3.1}Architecture}{6}} 26 31 \citation{GWF} 27 32 \citation{SLURM} 28 33 \citation{TORQUE} 29 \@writefile{lof}{\contentsline {figure}{\numberline {1}{\ignorespaces DCworms architecture}}{6}}30 \newlabel{fig:arch}{{1}{6}}31 \@writefile{toc}{\contentsline {subsection}{\numberline {3.1}Architecture}{6}}32 34 \citation{Ghislain} 33 35 \@writefile{toc}{\contentsline {subsection}{\numberline {3.2}Workload modeling}{7}} 34 \@writefile{toc}{\contentsline {subsection}{\numberline {3.3}Resource modeling}{7}}35 36 \@writefile{lof}{\contentsline {figure}{\numberline {2}{\ignorespaces Levels of information about jobs}}{8}} 36 37 \newlabel{fig:jobsStructure}{{2}{8}} 38 \@writefile{toc}{\contentsline {subsection}{\numberline {3.3}Resource modeling}{8}} 37 39 \citation{d2.2} 40 \@writefile{toc}{\contentsline {subsection}{\numberline {3.4}Energy management concept in DCworms}{9}} 38 41 \citation{GSSIM_Energy} 39 \@writefile{toc}{\contentsline {subsection}{\numberline {3.4}Energy management concept in DCworms}{9}}40 42 \@writefile{toc}{\contentsline {subsubsection}{\numberline {3.4.1}Power management}{10}} 41 43 \@writefile{toc}{\contentsline {paragraph}{\textbf {Power profile}}{10}} … … 46 48 \@writefile{toc}{\contentsline {subsection}{\numberline {3.5}Application performance modeling}{11}} 47 49 \newlabel{sec:apps}{{3.5}{11}} 50 \citation{GSSIM} 48 51 \citation{e2dc13} 49 52 \citation{d2.2} … … 54 57 \@writefile{toc}{\contentsline {subsection}{\numberline {4.1}Power consumption models}{13}} 55 58 \newlabel{sec:power}{{4.1}{13}} 56 \newlabel{eq:ohm-law}{{2}{13}}57 59 \@writefile{lof}{\contentsline {figure}{\numberline {5}{\ignorespaces Power in time for the highest frequency}}{14}} 58 60 \newlabel{fig:fans_P}{{5}{14}} 61 \newlabel{eq:ohm-law}{{2}{14}} 59 62 \@writefile{toc}{\contentsline {subsection}{\numberline {4.2}Static approach}{14}} 60 \newlabel{eq:static}{{3}{14}}61 63 \citation{fit4green_scheduler} 64 \newlabel{eq:static}{{3}{15}} 62 65 \@writefile{toc}{\contentsline {subsection}{\numberline {4.3}Resource load}{15}} 63 66 \newlabel{eq:dynamic}{{4}{15}} 64 67 \newlabel{eq:model}{{7}{16}} 65 68 \@writefile{toc}{\contentsline {subsection}{\numberline {4.4}Application specific}{16}} 66 \newlabel{eq:app}{{8}{16}}67 69 \citation{e2dc12} 68 70 \citation{abinit} 69 \ citation{cray}71 \newlabel{eq:app}{{8}{17}} 70 72 \@writefile{toc}{\contentsline {section}{\numberline {5}Experiments and evaluation}{17}} 71 73 \newlabel{sec:experiments}{{5}{17}} … … 73 75 \@writefile{lot}{\contentsline {table}{\numberline {1}{\ignorespaces RECS system configuration}}{17}} 74 76 \newlabel{testBed}{{1}{17}} 75 \ @writefile{toc}{\contentsline {subsection}{\numberline {5.2}Evaluated applications}{17}}77 \citation{cray} 76 78 \citation{linpack} 77 79 \citation{tar} 78 80 \citation{fft} 79 \@writefile{toc}{\contentsline {subsection}{\numberline {5.3}Models}{18}} 80 \newlabel{sec:models}{{5.3}{18}} 81 \@writefile{lot}{\contentsline {table}{\numberline {2}{\ignorespaces $P_{cpubase}$ values in Watts}}{18}} 82 \newlabel{nodeBasePowerUsage}{{2}{18}} 83 \@writefile{lot}{\contentsline {table}{\numberline {3}{\ignorespaces $P_{app}$ values in Watts}}{19}} 84 \newlabel{appPowerUsage}{{3}{19}} 85 \@writefile{lot}{\contentsline {table}{\numberline {4}{\ignorespaces Power models error in \%}}{19}} 86 \newlabel{expPowerModels}{{4}{19}} 87 \@writefile{toc}{\contentsline {subsection}{\numberline {5.4}Methodology}{19}} 88 \@writefile{toc}{\contentsline {subsection}{\numberline {5.5}Computational analysis}{20}} 89 \@writefile{lot}{\contentsline {table}{\numberline {5}{\ignorespaces Workload characteristics}}{21}} 90 \newlabel{workloadCharacteristics}{{5}{21}} 81 \@writefile{toc}{\contentsline {subsection}{\numberline {5.2}Evaluated applications}{18}} 82 \@writefile{toc}{\contentsline {subsection}{\numberline {5.3}Methodology}{18}} 83 \@writefile{lot}{\contentsline {table}{\numberline {2}{\ignorespaces Workload characteristics}}{19}} 84 \newlabel{workloadCharacteristics}{{2}{19}} 85 \@writefile{toc}{\contentsline {subsection}{\numberline {5.4}Models}{20}} 86 \newlabel{sec:models}{{5.4}{20}} 87 \@writefile{lot}{\contentsline {table}{\numberline {3}{\ignorespaces $P_{cpubase}$ values in Watts}}{20}} 88 \newlabel{nodeBasePowerUsage}{{3}{20}} 89 \@writefile{lot}{\contentsline {table}{\numberline {4}{\ignorespaces $P_{app}$ values in Watts}}{20}} 90 \newlabel{appPowerUsage}{{4}{20}} 91 \@writefile{lot}{\contentsline {table}{\numberline {5}{\ignorespaces Power models error in \%}}{21}} 92 \newlabel{expPowerModels}{{5}{21}} 93 \@writefile{toc}{\contentsline {subsection}{\numberline {5.5}Resource management policies evaluation}{21}} 91 94 \@writefile{toc}{\contentsline {subsubsection}{\numberline {5.5.1}Random approach}{21}} 92 95 \@writefile{lof}{\contentsline {figure}{\numberline {6}{\ignorespaces Comparison of energy usage for Random (left) and Random + switching off unused nodes strategy (right)}}{22}} 93 96 \newlabel{fig:70r_rnpm}{{6}{22}} 94 97 \@writefile{toc}{\contentsline {subsubsection}{\numberline {5.5.2}Energy optimization}{22}} 95 \@writefile{lot}{\contentsline {table}{\numberline {6}{\ignorespaces Measured power of testbed nodes in idle state}}{2 2}}96 \newlabel{idlePower}{{6}{2 2}}98 \@writefile{lot}{\contentsline {table}{\numberline {6}{\ignorespaces Measured power of testbed nodes in idle state}}{23}} 99 \newlabel{idlePower}{{6}{23}} 97 100 \@writefile{lof}{\contentsline {figure}{\numberline {7}{\ignorespaces Energy usage optimization strategy}}{23}} 98 101 \newlabel{fig:70eo}{{7}{23}} … … 102 105 \@writefile{lof}{\contentsline {figure}{\numberline {9}{\ignorespaces Frequency downgrading strategy}}{25}} 103 106 \newlabel{fig:70dfs}{{9}{25}} 104 \@writefile{toc}{\contentsline {subsection}{\numberline {5.6}Discussion}{25}}105 107 \@writefile{lof}{\contentsline {figure}{\numberline {10}{\ignorespaces Schedules obtained for Random strategy (left) and Random + lowest frequency strategy (right) for 10\% of system load}}{26}} 106 108 \newlabel{fig:dfsComp}{{10}{26}} 109 \@writefile{toc}{\contentsline {subsubsection}{\numberline {5.5.4}Summary}{26}} 107 110 \@writefile{lot}{\contentsline {table}{\numberline {7}{\ignorespaces Energy usage [kWh] for different level of system load. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{26}} 108 111 \newlabel{loadEnergy}{{7}{26}} 109 \@writefile{lot}{\contentsline {table}{\numberline {8}{\ignorespaces Makespan [s] for different level of system load. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{26}} 110 \newlabel{loadMakespan}{{8}{26}} 111 \@writefile{toc}{\contentsline {section}{\numberline {6}Conclusions and future work}{27}} 112 \@writefile{lot}{\contentsline {table}{\numberline {8}{\ignorespaces Makespan [s] for different level of system load. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{27}} 113 \newlabel{loadMakespan}{{8}{27}} 114 \@writefile{toc}{\contentsline {subsection}{\numberline {5.6}Verification of models}{27}} 115 \newlabel{eq:modelLoad}{{9}{28}} 116 \@writefile{lot}{\contentsline {table}{\numberline {9}{\ignorespaces Comparison of energy usage estimations [kWh] obtained for two power consumption models. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{28}} 117 \newlabel{modelAccuracy}{{9}{28}} 118 \@writefile{toc}{\contentsline {section}{\numberline {6}Conclusions and future work}{28}} 112 119 \bibcite{fit4green}{{1}{}{{}}{{}}} 113 120 \bibcite{e2dc13}{{2}{}{{}}{{}}} 114 121 \bibcite{e2dc12}{{3}{}{{}}{{}}} 115 122 \bibcite{CloudSim}{{4}{}{{}}{{}}} 116 \newlabel{}{{6}{28}}117 123 \bibcite{DCSG}{{5}{}{{}}{{}}} 118 \bibcite{DCD_Romonet}{{6}{}{{}}{{}}} 119 \bibcite{networks}{{7}{}{{}}{{}}} 120 \bibcite{Ghislain}{{8}{}{{}}{{}}} 121 \bibcite{games}{{9}{}{{}}{{}}} 122 \bibcite{GreenCloud}{{10}{}{{}}{{}}} 123 \bibcite{GSSIM}{{11}{}{{}}{{}}} 124 \bibcite{GSSIM_Energy}{{12}{}{{}}{{}}} 125 \bibcite{koomey}{{13}{}{{}}{{}}} 126 \bibcite{fit4green_scheduler}{{14}{}{{}}{{}}} 127 \bibcite{d2.2}{{15}{}{{}}{{}}} 128 \bibcite{abinit}{{16}{}{{}}{{}}} 129 \bibcite{cray}{{17}{}{{}}{{}}} 130 \bibcite{fft}{{18}{}{{}}{{}}} 131 \bibcite{linpack}{{19}{}{{}}{{}}} 132 \bibcite{tar}{{20}{}{{}}{{}}} 133 \bibcite{GWF}{{21}{}{{}}{{}}} 134 \bibcite{SLURM}{{22}{}{{}}{{}}} 135 \bibcite{SWF}{{23}{}{{}}{{}}} 136 \bibcite{TORQUE}{{24}{}{{}}{{}}} 137 \bibcite{colt}{{25}{}{{}}{{}}} 138 \bibcite{coolemall}{{26}{}{{}}{{}}} 139 \bibcite{ecocooling}{{27}{}{{}}{{}}} 140 \bibcite{ff}{{28}{}{{}}{{}}} 141 \bibcite{pue}{{29}{}{{}}{{}}} 142 \bibcite{sgi}{{30}{}{{}}{{}}} 124 \bibcite{SimGrid}{{6}{}{{}}{{}}} 125 \bibcite{DCD_Romonet}{{7}{}{{}}{{}}} 126 \bibcite{networks}{{8}{}{{}}{{}}} 127 \bibcite{Ghislain}{{9}{}{{}}{{}}} 128 \newlabel{}{{6}{30}} 129 \bibcite{games}{{10}{}{{}}{{}}} 130 \bibcite{GreenCloud}{{11}{}{{}}{{}}} 131 \bibcite{GSSIM}{{12}{}{{}}{{}}} 132 \bibcite{GSSIM_Energy}{{13}{}{{}}{{}}} 133 \bibcite{koomey}{{14}{}{{}}{{}}} 134 \bibcite{fit4green_scheduler}{{15}{}{{}}{{}}} 135 \bibcite{fit4green_carbon_scheduler}{{16}{}{{}}{{}}} 136 \bibcite{d2.2}{{17}{}{{}}{{}}} 137 \bibcite{abinit}{{18}{}{{}}{{}}} 138 \bibcite{cray}{{19}{}{{}}{{}}} 139 \bibcite{fft}{{20}{}{{}}{{}}} 140 \bibcite{linpack}{{21}{}{{}}{{}}} 141 \bibcite{tar}{{22}{}{{}}{{}}} 142 \bibcite{GWF}{{23}{}{{}}{{}}} 143 \bibcite{SLURM}{{24}{}{{}}{{}}} 144 \bibcite{SWF}{{25}{}{{}}{{}}} 145 \bibcite{TORQUE}{{26}{}{{}}{{}}} 146 \bibcite{colt}{{27}{}{{}}{{}}} 147 \bibcite{coolemall}{{28}{}{{}}{{}}} 148 \bibcite{ecocooling}{{29}{}{{}}{{}}} 149 \bibcite{ff}{{30}{}{{}}{{}}} 150 \bibcite{pue}{{31}{}{{}}{{}}} 151 \bibcite{sgi}{{32}{}{{}}{{}}} 143 152 \providecommand\NAT@force@numbers{}\NAT@force@numbers -
papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.fdb_latexmk
r1068 r1069 1 1 # Fdb version 2 2 ["pdflatex"] 1370 008374"elsarticle-DCWoRMS.tex" "elsarticle-DCWoRMS.pdf" "elsarticle-DCWoRMS"2 ["pdflatex"] 1370260512 "elsarticle-DCWoRMS.tex" "elsarticle-DCWoRMS.pdf" "elsarticle-DCWoRMS" 3 3 "/usr/local/texlive/2010/texmf-dist/tex/context/base/supp-pdf.mkii" 1251025892 71625 fad1c4b52151c234b6873a255b0ad6b3 "" 4 4 "/usr/local/texlive/2010/texmf-dist/tex/generic/oberdiek/etexcmds.sty" 1267408169 5670 cacb018555825cfe95cd1e1317d82c1d "" … … 30 30 "/usr/local/texlive/2010/texmf-dist/tex/latex/psnfss/upsy.fd" 1137110629 148 2da0acd77cba348f34823f44cabf0058 "" 31 31 "/usr/local/texlive/2010/texmf-dist/tex/latex/psnfss/upzd.fd" 1137110629 148 b2a94082cb802f90d3daf6dd0c7188a0 "" 32 "elsarticle-DCWoRMS.aux" 1370 008377 7677 f71d3a69833156dd0cd55997caa941bf""33 "elsarticle-DCWoRMS.spl" 1370 0083750 d41d8cd98f00b204e9800998ecf8427e ""34 "elsarticle-DCWoRMS.tex" 1370 007766 81944 a4f37cdda6dc074fed8954549930ec15""32 "elsarticle-DCWoRMS.aux" 1370260514 8344 4fae76ffb960809b0662b0a5380c1cf5 "" 33 "elsarticle-DCWoRMS.spl" 1370260512 0 d41d8cd98f00b204e9800998ecf8427e "" 34 "elsarticle-DCWoRMS.tex" 1370260508 86035 d15304aaf50305115c88a8a70ebe8af4 "" 35 35 "elsarticle.cls" 1352447924 26095 ad44f4892f75e6e05dca57a3581f78d1 "" 36 36 "fig/70dfsGantt.png" 1370005142 138858 edc557d8862a7825f2941d8c69c30659 "" -
papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.tex
r1068 r1069 174 174 %Furthermore, available power supply is usually limited so it also may reduce data center development capabilities, especially looking at challenges related to exascale computing breakthrough foreseen within this decade. 175 175 176 For these reasons many efforts were undertaken to measure and study energy efficiency of data centers. There are projects focused on data center monitoring and management \cite{games}\cite{fit4green} whereas others on energy efficiency of networks \cite{networks} . Additionally, vendors offer a wide spectrum of energy efficient solutions for computing and cooling \cite{sgi}\cite{colt}\cite{ecocooling}. However, a variety of solutions and configuration options can be applied planning new or upgrading existing data centers.176 For these reasons many efforts were undertaken to measure and study energy efficiency of data centers. There are projects focused on data center monitoring and management \cite{games}\cite{fit4green} whereas others on energy efficiency of networks \cite{networks} or distributed computing infrastructures, like grids \cite{fit4green_carbon_scheduler}. Additionally, vendors offer a wide spectrum of energy efficient solutions for computing and cooling \cite{sgi}\cite{colt}\cite{ecocooling}. However, a variety of solutions and configuration options can be applied planning new or upgrading existing data centers. 177 177 In order to optimize a design or configuration of data center we need a thorough study using appropriate metrics and tools evaluating how much computation or data processing can be done within given power and energy budget and how it affects temperatures, heat transfers, and airflows within data center. 178 178 Therefore, there is a need for simulation tools and models that approach the problem from a perspective of end users and take into account all the factors that are critical to understanding and improving the energy efficiency of data centers, in particular, hardware characteristics, applications, management policies, and cooling. 179 179 These tools should support data center designers and operators by answering questions how specific application types, levels of load, hardware specifications, physical arrangements, cooling technology, etc. impact overall data center energy efficiency. 180 There are various tools that allow simulation of computing infrastructures . On one hand they include advanced packages for modeling heat transfer and energy consumption in data centers \cite{ff} or tools concentrating on their financial analysis \cite{DCD_Romonet}. On the other hand, there are simulators focusing on computations such as CloudSim \cite{CloudSim}. The CoolEmAll project aims to integrate these approaches and enable advanced analysis of data center efficiency taking into account all these aspects \cite{e2dc12}\cite{coolemall}.180 There are various tools that allow simulation of computing infrastructures, like SimGrid\cite{SimGrid}. On one hand they include advanced packages for modeling heat transfer and energy consumption in data centers \cite{ff} or tools concentrating on their financial analysis \cite{DCD_Romonet}. On the other hand, there are simulators focusing on computations such as CloudSim \cite{CloudSim}. The CoolEmAll project aims to integrate these approaches and enable advanced analysis of data center efficiency taking into account all these aspects \cite{e2dc12}\cite{coolemall}. 181 181 182 182 One of the results of the CoolEmAll project is the Data Center Workload and Resource Management Simulator (DCworms) which enables modeling and simulation of computing infrastructures to estimate their performance, energy consumption, and energy-efficiency metrics for diverse workloads and management policies. … … 200 200 In terms of application modeling, all tools, except DCSG Simulator, describe the application with a number of computational and communicational requirements. In addition, GreenCloud and DCworms allow introducing the QoS requirements by taking into account the time constraints during the simulation. DCSG Simulator instead of modeling of the single application, enables the definition of workload that leads to a given utilization level. However, only DCworms supports application performance modeling by not only incorporating simple requirements that are taken into account during scheduling, but also by allowing specification of task execution time. 201 201 202 GreenCloud, CloudSim and DCworms are released as Open Source under the GPL. DCSG Simulator is available under anOSL V3.0 open-source license, however, it can be only accessed by the DCSG members.202 GreenCloud, CloudSim and DCworms are released as Open Source under the GPL. DCSG Simulator is available under the OSL V3.0 open-source license, however, it can be only accessed by the DCSG members. 203 203 204 204 Summarizing, DCworms stands out from other tools due to the flexibility in terms of data center equipment and structure definition. … … 210 210 211 211 Data Center workload and resource management simulator (DCworms) is a simulation tool based on the GSSIM framework \cite{GSSIM} developed by Poznan Supercomputing and Networking Center (PSNC). 212 GSSIM has been proposed to provide an automated tool for experimental studies of various resource management and scheduling strategies in distributed computing systems. DCworms extends its basic functionality and adds some additional features related to the energy efficiency issues in data centers. In this section we will introduce the functionality of the simulator, in terms of modeling and simulation of large scale distributed systems like Grids and Clouds.212 GSSIM has been proposed to provide an automated tool for experimental studies of various resource management and scheduling strategies in distributed computing systems. DCworms extends its basic functionality and adds some additional features related to the energy efficiency issues in data centers. In this section we will introduce the functionality of the simulator, in terms of modeling and simulation of data centers. 213 213 214 214 … … 222 222 \end{figure} 223 223 224 DCworms is an event-driven simulation tool written in Java. In general, input data for the DCworms consist of workload and resources descriptions. They can be provided by the user, read from real traces or generated using the generator module. In this terms DCworms benefits from the GSSIM workload generator tool that allows creating synthetic workloads (\cite{GSSIM}). However, the key elements of the presented architecture are plugins. They allow the researchers to configure and adapt the simulation environment to the peculiarities of their studies, starting from modeling job performance, through energy estimations up to implementation of resource management and scheduling policies. Each plugin can be implemented independently and plugged into a specific experiment. Results of experiments are collected, aggregated, and visualized using the statistics module. Due to a modular and plug-able architecture DCworms can be applied to specific resource management problems and address different usersâ requirements. 224 DCworms is an event-driven simulation tool written in Java. In general, input data for the DCworms consist of workload and resource descriptions. They can be provided by the user, read from real traces or generated using the generator module. In this terms DCworms benefits from the GSSIM workload generator tool that allows creating synthetic workloads (\cite{GSSIM}). However, the key elements of the presented architecture are plugins. They allow the researchers to configure and adapt the simulation environment to the peculiarities of their studies, starting from modeling job performance, through energy estimations up to implementation of resource management and scheduling policies. Each plugin can be implemented independently and plugged into a specific experiment. Afterwards, politics introduced by particular plugins are applied to a given set of tasks and resources and are triggered by each change of simulated environment state. Results of experiments are collected, aggregated, and visualized using the statistics module. The output of each simulation consists of a number of statistics created to help in 225 comparative evaluations of resource and workload management policies. Due to a modular and plug-able architecture DCworms can be applied to specific resource management problems and address different usersâ requirements. 225 226 226 227 … … 333 334 \item network parameters 334 335 \end{itemize} 335 Using these parameters developers can for instance take into account the architectures of the underlying systems, such as multi-core processors, and their impact on the final performance of applications. 336 Using these parameters developers can for instance take into account the architectures of the underlying systems, such as multi-core processors, and their impact on the final performance of applications. Examples of application performance modeling in DCworms are shown in \cite{GSSIM}. 336 337 337 338 … … 339 340 \section{Modeling of energy consumption in DCworms} 340 341 341 DCworms is an open framework in which various models and algorithms can be investigated as presented in Section \ref{sec:apps}. In this section, we discuss possible approaches to modeling that can be applied to simulation of energy-efficiency of distributed computing systems. In general, to facilitate the simulation process, DCworms provides some basic implementation of power consumption, air throughput and thermal models. We introduce power consumption models as examples and validate part of them by experiments in real computing system (in Section \ref{sec:experiments}). 342 DCworms is an open framework in which various models and algorithms can be investigated as presented in Section \ref{sec:apps}. In this section, we discuss possible approaches to modeling that can be applied to simulation of energy-efficiency of distributed computing systems. In general, to facilitate the simulation process, DCworms provides some basic implementation of power consumption, air throughput and thermal models. We introduce power consumption models as examples and validate part of them by experiments in real computing system (in Section \ref{sec:experiments}). Description of thermal models and corresponding experiments was presented in \cite{e2dc13}. 342 343 343 344 The most common questions explored by researchers who study energy-efficiency of distributed computing systems is how much energy $E$ do these systems require to execute workloads. In order to obtain this value the simulator must calculate values of power $P_i(t)$ and load $L_i(t)$ in time for all $m$ computing nodes, $i=1..m$. Load function may depend on specific load models applied. In more complex cases it can even be defined as vectors of different resource usage in time. In a simple case load can be either idle or busy but even in this case estimation of job processing times $p_j$ is needed to calculate total energy consumption. The total energy consumption of computing nodes is given by (\ref{eq:E}): … … 513 514 514 515 515 \subsection{Models}\label{sec:models}516 517 Based on the measured values we evaluated three types of models that can be applied, among others, to the simulation environment.518 519 \textbf{Static}520 This model refers to the static approach presented in Section~\ref{sec:power}. According to the measured values we created a resource power consumption model that is based on a static definition of resource power usage.521 %With each node power state, understood as a possible operating state (p-state), we associated a power consumption value that derives from the averaged values of measurements obtained for different types of application. Therefore, the current power usage of the node, can be expressed as: $P = P_{idle} + P_{f}$ where $P$ denotes power consumed by the node, $P_{idle}$ is a power usage of node in idle state and $P_{f}$ stands for power usage of CPU operating at the given frequency level.522 523 \textbf{Dynamic}524 This model refers to the Resource load approach presented in Section~\ref{sec:power}. Based on the measured values of the total node power usage for various levels of load and frequencies of CPUs node power usage was defined as in \ref{eq:model}.525 526 %and referring to the existing models presented in literature, we assumed the following equation: $P = P_{idle} + load*P_{cpubase}*c^{(f-f_{base})/100} + P_{app}$, where $P$ denotes power consumed by the node executing the given application, $P_{idle}$ is a power usage of node in idle state, load is the current utilization level of the node, $P_{cpubase}$ stands for power usage of fully loaded CPU working in the lowest frequency, $c$ is the constant factor indicating the increase of power consumption with respect to the frequency increase $f$- is a current frequency, $f_{base}$- is the lowest available frequency within the given CPU and $P_{app}$ denotes the additional power usage derived from executing a particular application).527 528 529 Table~\ref{nodeBasePowerUsage} and Table~\ref{appPowerUsage} contain values of $P_{cpubase}$ and $P_{app}$, respectively, obtained for the particular application and resource architectures. Lack of the corresponding value means that the application did not run on the given type of node.530 \begin {table}[h!]531 \centering532 \begin{tabular}{lccc}533 \hline534 Intel I7 & AMD Fusion & Atom D510 \\535 \hline536 8 & 2 & 1 \\537 \hline538 \end{tabular}539 \caption {\label{nodeBasePowerUsage} $P_{cpubase}$ values in Watts}540 \end {table}541 542 543 \begin {table}[h!]544 \centering545 \begin{tabular}{l|ccc}546 \hline547 & \multicolumn{3}{c} {Node type}\\548 Application & Intel I7 & AMD Fusion & Atom D510 \\549 \hline550 Abinit & 3.3 & - & - \\551 Linpactiny & 2.5 & - & 0.2 \\552 Linpack3gb & 6 & - & - \\553 C-Ray & 4 & 1 & 0.05 \\554 FFT & 3.5 & 2 & 0.1 \\555 Tar & 3 & 2.5 & 0.5 \\556 557 \hline558 \end{tabular}559 \caption {\label{appPowerUsage} $P_{app}$ values in Watts}560 \end {table}561 562 563 \textbf{Mapping}564 This model refers to the Application specific approach presented in Section~\ref{sec:power}. However, in this model we applied the measured values for each application exactly to the power model. Neither dependencies with load nor application profiles are modeled. Obviously this model is contaminated only with the inaccuracy of the measurements and variability of power usage (caused by other unmeasured factors).565 566 The following table (Table~\ref{expPowerModels}) contains the relative errors of the models with respect to the measured values567 \begin {table}[h!]568 \centering569 \begin{tabular}{llr}570 \hline571 Static & Dynamic & Mapping \\572 \hline573 13.74 & 10.85 & 0 \\574 \hline575 \end{tabular}576 \caption {\label{expPowerModels} Power models error in \%}577 \end {table}578 579 Obviously, 0\% error in the case of the Mapping model is caused by the use of a tabular data, which for each application stores a specific power usage. Nevertheless, in all models we face possible deviations from the average caused by power usage fluctuations not explained by variables used in models. These deviations reached around 7\% for each case.580 581 For the experimental purposes we decided to use the latter model. Thus, we introduce into the simulation environment exact values obtained within our testbed, to build both the power profiles of applications as well as the application performance models, denoting their execution times.582 583 516 584 517 \subsection{Methodology} … … 622 555 In all cases we assumed that tasks are scheduled and served in order of their arrival (FIFO strategy) using relaxed backfilling approach, with indefinite delay for the highest priority task. Moreover, all tasks were assigned to nodes with the condition that they can be assigned only to nodes of the type on which the application was able to run (in other words - we had the corresponding value of power consumption and execution time). 623 556 624 \subsection{Computational analysis} 557 558 \subsection{Models}\label{sec:models} 559 560 Based on the measured values we evaluated three types of models that can be applied, among others, to the simulation environment. 561 562 \textbf{Static} 563 This model refers to the static approach presented in Section~\ref{sec:power}. According to the measured values we created a resource power consumption model that is based on a static definition of resource power usage. 564 %With each node power state, understood as a possible operating state (p-state), we associated a power consumption value that derives from the averaged values of measurements obtained for different types of application. Therefore, the current power usage of the node, can be expressed as: $P = P_{idle} + P_{f}$ where $P$ denotes power consumed by the node, $P_{idle}$ is a power usage of node in idle state and $P_{f}$ stands for power usage of CPU operating at the given frequency level. 565 566 \textbf{Dynamic} 567 This model refers to the Resource load approach presented in Section~\ref{sec:power}. Based on the measured values of the total node power usage for various levels of load and frequencies of CPUs node power usage was defined as in \ref{eq:model}. 568 569 %and referring to the existing models presented in literature, we assumed the following equation: $P = P_{idle} + load*P_{cpubase}*c^{(f-f_{base})/100} + P_{app}$, where $P$ denotes power consumed by the node executing the given application, $P_{idle}$ is a power usage of node in idle state, load is the current utilization level of the node, $P_{cpubase}$ stands for power usage of fully loaded CPU working in the lowest frequency, $c$ is the constant factor indicating the increase of power consumption with respect to the frequency increase $f$- is a current frequency, $f_{base}$- is the lowest available frequency within the given CPU and $P_{app}$ denotes the additional power usage derived from executing a particular application). 570 571 572 Table~\ref{nodeBasePowerUsage} and Table~\ref{appPowerUsage} contain values of $P_{cpubase}$ and $P_{app}$, respectively, obtained for the particular application and resource architectures. Lack of the corresponding value means that the application did not run on the given type of node. 573 \begin {table}[h!] 574 \centering 575 \begin{tabular}{lccc} 576 \hline 577 Intel I7 & AMD Fusion & Atom D510 \\ 578 \hline 579 8 & 2 & 1 \\ 580 \hline 581 \end{tabular} 582 \caption {\label{nodeBasePowerUsage} $P_{cpubase}$ values in Watts} 583 \end {table} 584 585 586 \begin {table}[h!] 587 \centering 588 \begin{tabular}{l|ccc} 589 \hline 590 & \multicolumn{3}{c} {Node type}\\ 591 Application & Intel I7 & AMD Fusion & Atom D510 \\ 592 \hline 593 Abinit & 3.3 & - & - \\ 594 Linpactiny & 2.5 & - & 0.2 \\ 595 Linpack3gb & 6 & - & - \\ 596 C-Ray & 4 & 1 & 0.05 \\ 597 FFT & 3.5 & 2 & 0.1 \\ 598 Tar & 3 & 2.5 & 0.5 \\ 599 600 \hline 601 \end{tabular} 602 \caption {\label{appPowerUsage} $P_{app}$ values in Watts} 603 \end {table} 604 605 606 \textbf{Mapping} 607 This model refers to the Application specific approach presented in Section~\ref{sec:power}. However, in this model we applied the measured values for each application exactly to the power model. Neither dependencies with load nor application profiles are modeled. Obviously this model is contaminated only with the inaccuracy of the measurements and variability of power usage (caused by other unmeasured factors). 608 609 The following table (Table~\ref{expPowerModels}) contains the relative errors of the models with respect to the measured values 610 \begin {table}[h!] 611 \centering 612 \begin{tabular}{llr} 613 \hline 614 Static & Dynamic & Mapping \\ 615 \hline 616 13.74 & 10.85 & 0 \\ 617 \hline 618 \end{tabular} 619 \caption {\label{expPowerModels} Power models error in \%} 620 \end {table} 621 622 Obviously, 0\% error in the case of the Mapping model is caused by the use of a tabular data, which for each application stores a specific power usage. Nevertheless, in all models we face possible deviations from the average caused by power usage fluctuations not explained by variables used in models. These deviations reached around 7\% for each case. 623 624 For the experimental purposes we decided to use the latter model. Thus, we introduce into the simulation environment exact values obtained within our testbed, to build both the power profiles of applications as well as the application performance models, denoting their execution times. 625 626 627 628 \subsection{Resource management policies evaluation} 625 629 626 630 In the following section we present the results obtained for the workload with load density equal to 70\% in the light of five resource management and scheduling strategies. The scheduling strategies were evaluated according to two criteria: total energy consumption and maximum completion time of all tasks (makespan). These evaluation criteria employed in our experiments represent interests of various groups of stakeholders present in data centers. … … 665 669 Type of processor within the node & Power usage in idle state [W] \\ 666 670 \hline 667 671 Intel i7 & 11.5 \\ 668 672 AMD Fusion & 10 \\ 669 673 Atom D510 & 19 \\ … … 706 710 707 711 708 As we were looking for the trade-off between total completion time and energy usage, we were searching for the workload load level that can benefit from the lower system performance in terms of energy-efficiency. For the frequency downgrading policy, we noticed the improvement on the energy usage criterion only for the workload resulting in 10\% system load. For this threshold we observed that slowdown in task execution does not affect the subsequent tasks in the system and thus the total completion time of the whole workload. 712 As we were looking for the trade-off between total completion time and energy usage, we were searching for the workload load level that can benefit from the lower system performance in terms of energy-efficiency. For the frequency downgrading policy, we noticed the improvement on the energy usage criterion only for the workload resulting in 10\% system load. For this threshold we observed that slowdown in task execution does not affect the subsequent tasks in the system and thus the total completion time of the whole workload. In general, for the low loaded systems, where the longer execution times do not affect the computations of subsequent tasks, downgrading the frequency can reduce the overall energy consumption. However, for the more loaded systems, which are typical for HPC centers, and taking into account that idle nodes can be turned off after all, both the simulation results as well as the measurements obtained on real hardware show that the best way to save energy is to perform computations at the maximum frequency level. Due to the shorter execution times, idle nodes can be turned off sooner which results in lower energy usage. 709 713 710 714 Figure~\ref{fig:dfsComp} shows schedules obtained for Random and Random + lowest frequency strategy. … … 717 721 \end{figure} 718 722 719 \subs ection{Discussion}723 \subsubsection{Summary} 720 724 The following tables: Table~\ref{loadEnergy} and Table~\ref{loadMakespan} contain the values of evaluation criteria (total energy usage and makespan respectively) gathered for all investigated workloads. 721 725 … … 756 760 We also demonstrated differences between power usage models. They span from rough static approach to accurate application specific models. However, the latter can be difficult or even infeasible to use, as it requires real measurements for specific applications beforehand. This issue can be partially resolved by introducing application profiles and classification, which can deteriorate the accuracy though. This issue is begin studied more deeply within CoolEmAll project. 757 761 762 \subsection{Verification of models} 763 764 This section contains more detailed comparison of two types of power consumption models that can be applied, among others, within the DCworms. The first one, called Mapping approach, was applied to the experiments in the previous section. As mentioned within this model, the values measured on the CoolEmAll testbed for each application were applied directly to the power consumption model used in DCworms. 765 766 Model evaluated in this section is a modification of the Mapping and Dynamic model by additional modeling of dependencies with the processor load. 767 Within this model, we benefited from the power profiles based on the measurements made on CoolEmAll testbed (and adopted also by the previous model). However, data applied to the simulation environment consisted only of measurements gathered for applications ran in mode resulting in lowest and highest processor load. For all load levels between the given two values we assumed the linear dependency between the load and power consumption. Thus, the power consumption for the given processor load can be expressed using the following equation (\ref{eq:modelLoad}): 768 769 \begin{equation} 770 P_L = P_{LL} + (L-LL)*(P_{HL}-P_{LL})/(HL-LL), \label{eq:modelLoad} 771 \end{equation} 772 773 where $L$ is a given processor load, $LL$ is the lowest measured processor load, $HL$ is the highest measured processor load, $P_L$ denotes power consumption for a given processor load, $P_{LL}$ is a power consumption measured for the lowest processor load and $P_{HL}$ stands for power consumption measured for the highest processor load. 774 775 \begin {table}[h!] 776 \centering 777 \begin{tabular}{l| c | c | c } 778 \hline 779 Policy / Model & Mapping & Mapping + Dynamic & Accuracy [\%]\\ 780 \hline 781 R & 46.883 & 44.476 & 94.87 \\ 782 R+NPM & 30.568 & 28.250 & 92.42 \\ 783 EO & 77.109 & 75.277 & 97.62\\ 784 EO+NPM & 46.305 & 44.050 & 95.13\\ 785 R+LF & 36.705 & 34.298 & 93.44\\ 786 \hline 787 \end{tabular} 788 \caption {\label{modelAccuracy} Comparison of energy usage estimations [kWh] obtained for two power consumption models. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency} 789 \end {table} 790 791 As it can be observed the accuracy of the Mapping + Dynamic based model is high and exceeds visibly 90\%. Satisfactory accuracy suggests that applying various power consumption models, while verifying different approaches or in case of lack of detailed measurements, does not lead to deterioration of overall results. This fact confirms also the important role of simulations in the experiments related to the distributed computing systems. 758 792 759 793 … … 830 864 \bibitem{DCSG} http://dcsg.bcs.org/welcome-dcsg-simulator 831 865 866 \bibitem{SimGrid} http://simgrid.gforge.inria.fr/index.html 867 832 868 \bibitem{DCD_Romonet} http://www.datacenterdynamics.com/blogs/ian-bitterlin/it-does-more-it-says-tin\%E2\%80\%A6 833 869 … … 852 888 \bibitem{fit4green_scheduler} O. M{\"a}mmel{\"a}, M. Majanen, R. Basmadjian, H. De Meer, A. Giesler, W. Homberg, Energy-aware job scheduler for high-performance computing, Computer Science - Research and Development, November 2012, Volume 27, Issue 4, pp 265-275. 853 889 890 \bibitem {fit4green_carbon_scheduler} M. Majanen, O. M{\"a}mmel{\"a}, A. Giesler, Energy and carbon aware scheduling in supercomputing, International Journal on Advances in Intelligent Systems. Vol. 5 (2012) Nr: 3 \& 4, pp. 451 - 469. 891 854 892 \bibitem {d2.2} U. Woessner, E. Volk, G. Gallizo, M. vor dem Berge, G. Da Costa, P. Domagalski, W. Piatek, J-M. Pierson. (2012) D2.2 Design of the CoolEmAll simulation and visualisation environment - CoolEmAll Deliverable, http://coolemall.eu 855 893
Note: See TracChangeset
for help on using the changeset viewer.