Context Navigation

← Previous Change
Next Change →

Changeset 1069 for papers

Timestamp:

06/03/13 13:55:25 (12 years ago)

Author:

wojtekp

Message:

Location:

papers/SMPaT-2012_DCWoRMS

Files:

: 5 edited

elsarticle-DCWoRMS.aux (modified) (6 diffs)
elsarticle-DCWoRMS.fdb_latexmk (modified) (2 diffs)
elsarticle-DCWoRMS.pdf (modified) (previous)
elsarticle-DCWoRMS.synctex.gz (modified) (previous)
elsarticle-DCWoRMS.tex (modified) (14 diffs)

Legend:

: Unmodified
: Added
: Removed

papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.aux

r1068	r1069
8	8	\citation{fit4green}
9	9	\citation{networks}
	10	\citation{fit4green_carbon_scheduler}
10	11	\citation{sgi}
11	12	\citation{colt}
12	13	\citation{ecocooling}
	14	\citation{SimGrid}
13	15	\citation{ff}
14	16	\citation{DCD_Romonet}
…	…
24	26	\@writefile{toc}{\contentsline {section}{\numberline {3}DCworms}{5}}
25	27	\citation{GSSIM}
	28	\@writefile{lof}{\contentsline {figure}{\numberline {1}{\ignorespaces DCworms architecture}}{6}}
	29	\newlabel{fig:arch}{{1}{6}}
	30	\@writefile{toc}{\contentsline {subsection}{\numberline {3.1}Architecture}{6}}
26	31	\citation{GWF}
27	32	\citation{SLURM}
28	33	\citation{TORQUE}
29		~~\@writefile{lof}{\contentsline {figure}{\numberline {1}{\ignorespaces DCworms architecture}}{6}}~~
30		~~\newlabel{fig:arch}{{1}{6}}~~
31		~~\@writefile{toc}{\contentsline {subsection}{\numberline {3.1}Architecture}{6}}~~
32	34	\citation{Ghislain}
33	35	\@writefile{toc}{\contentsline {subsection}{\numberline {3.2}Workload modeling}{7}}
34		~~\@writefile{toc}{\contentsline {subsection}{\numberline {3.3}Resource modeling}{7}}~~
35	36	\@writefile{lof}{\contentsline {figure}{\numberline {2}{\ignorespaces Levels of information about jobs}}{8}}
36	37	\newlabel{fig:jobsStructure}{{2}{8}}
	38	\@writefile{toc}{\contentsline {subsection}{\numberline {3.3}Resource modeling}{8}}
37	39	\citation{d2.2}
	40	\@writefile{toc}{\contentsline {subsection}{\numberline {3.4}Energy management concept in DCworms}{9}}
38	41	\citation{GSSIM_Energy}
39		~~\@writefile{toc}{\contentsline {subsection}{\numberline {3.4}Energy management concept in DCworms}{9}}~~
40	42	\@writefile{toc}{\contentsline {subsubsection}{\numberline {3.4.1}Power management}{10}}
41	43	\@writefile{toc}{\contentsline {paragraph}{\textbf {Power profile}}{10}}
…	…
46	48	\@writefile{toc}{\contentsline {subsection}{\numberline {3.5}Application performance modeling}{11}}
47	49	\newlabel{sec:apps}{{3.5}{11}}
	50	\citation{GSSIM}
48	51	\citation{e2dc13}
49	52	\citation{d2.2}
…	…
54	57	\@writefile{toc}{\contentsline {subsection}{\numberline {4.1}Power consumption models}{13}}
55	58	\newlabel{sec:power}{{4.1}{13}}
56		~~\newlabel{eq:ohm-law}{{2}{13}}~~
57	59	\@writefile{lof}{\contentsline {figure}{\numberline {5}{\ignorespaces Power in time for the highest frequency}}{14}}
58	60	\newlabel{fig:fans_P}{{5}{14}}
	61	\newlabel{eq:ohm-law}{{2}{14}}
59	62	\@writefile{toc}{\contentsline {subsection}{\numberline {4.2}Static approach}{14}}
60		~~\newlabel{eq:static}{{3}{14}}~~
61	63	\citation{fit4green_scheduler}
	64	\newlabel{eq:static}{{3}{15}}
62	65	\@writefile{toc}{\contentsline {subsection}{\numberline {4.3}Resource load}{15}}
63	66	\newlabel{eq:dynamic}{{4}{15}}
64	67	\newlabel{eq:model}{{7}{16}}
65	68	\@writefile{toc}{\contentsline {subsection}{\numberline {4.4}Application specific}{16}}
66		~~\newlabel{eq:app}{{8}{16}}~~
67	69	\citation{e2dc12}
68	70	\citation{abinit}
69		\~~citation{cray~~}
	71	\newlabel{eq:app}{{8}{17}}
70	72	\@writefile{toc}{\contentsline {section}{\numberline {5}Experiments and evaluation}{17}}
71	73	\newlabel{sec:experiments}{{5}{17}}
…	…
73	75	\@writefile{lot}{\contentsline {table}{\numberline {1}{\ignorespaces RECS system configuration}}{17}}
74	76	\newlabel{testBed}{{1}{17}}
75		\~~@writefile{toc}{\contentsline {subsection}{\numberline {5.2}Evaluated applications}{17}~~}
	77	\citation{cray}
76	78	\citation{linpack}
77	79	\citation{tar}
78	80	\citation{fft}
79		\@writefile{toc}{\contentsline {subsection}{\numberline {5.3}Models}{18}}
80		\newlabel{sec:models}{{5.3}{18}}
81		\@writefile{lot}{\contentsline {table}{\numberline {2}{\ignorespaces $P_{cpubase}$ values in Watts}}{18}}
82		\newlabel{nodeBasePowerUsage}{{2}{18}}
83		\@writefile{lot}{\contentsline {table}{\numberline {3}{\ignorespaces $P_{app}$ values in Watts}}{19}}
84		\newlabel{appPowerUsage}{{3}{19}}
85		\@writefile{lot}{\contentsline {table}{\numberline {4}{\ignorespaces Power models error in \%}}{19}}
86		\newlabel{expPowerModels}{{4}{19}}
87		\@writefile{toc}{\contentsline {subsection}{\numberline {5.4}Methodology}{19}}
88		\@writefile{toc}{\contentsline {subsection}{\numberline {5.5}Computational analysis}{20}}
89		\@writefile{lot}{\contentsline {table}{\numberline {5}{\ignorespaces Workload characteristics}}{21}}
90		\newlabel{workloadCharacteristics}{{5}{21}}
	81	\@writefile{toc}{\contentsline {subsection}{\numberline {5.2}Evaluated applications}{18}}
	82	\@writefile{toc}{\contentsline {subsection}{\numberline {5.3}Methodology}{18}}
	83	\@writefile{lot}{\contentsline {table}{\numberline {2}{\ignorespaces Workload characteristics}}{19}}
	84	\newlabel{workloadCharacteristics}{{2}{19}}
	85	\@writefile{toc}{\contentsline {subsection}{\numberline {5.4}Models}{20}}
	86	\newlabel{sec:models}{{5.4}{20}}
	87	\@writefile{lot}{\contentsline {table}{\numberline {3}{\ignorespaces $P_{cpubase}$ values in Watts}}{20}}
	88	\newlabel{nodeBasePowerUsage}{{3}{20}}
	89	\@writefile{lot}{\contentsline {table}{\numberline {4}{\ignorespaces $P_{app}$ values in Watts}}{20}}
	90	\newlabel{appPowerUsage}{{4}{20}}
	91	\@writefile{lot}{\contentsline {table}{\numberline {5}{\ignorespaces Power models error in \%}}{21}}
	92	\newlabel{expPowerModels}{{5}{21}}
	93	\@writefile{toc}{\contentsline {subsection}{\numberline {5.5}Resource management policies evaluation}{21}}
91	94	\@writefile{toc}{\contentsline {subsubsection}{\numberline {5.5.1}Random approach}{21}}
92	95	\@writefile{lof}{\contentsline {figure}{\numberline {6}{\ignorespaces Comparison of energy usage for Random (left) and Random + switching off unused nodes strategy (right)}}{22}}
93	96	\newlabel{fig:70r_rnpm}{{6}{22}}
94	97	\@writefile{toc}{\contentsline {subsubsection}{\numberline {5.5.2}Energy optimization}{22}}
95		\@writefile{lot}{\contentsline {table}{\numberline {6}{\ignorespaces Measured power of testbed nodes in idle state}}{22}}
96		\newlabel{idlePower}{{6}{22}}
	98	\@writefile{lot}{\contentsline {table}{\numberline {6}{\ignorespaces Measured power of testbed nodes in idle state}}{23}}
	99	\newlabel{idlePower}{{6}{23}}
97	100	\@writefile{lof}{\contentsline {figure}{\numberline {7}{\ignorespaces Energy usage optimization strategy}}{23}}
98	101	\newlabel{fig:70eo}{{7}{23}}
…	…
102	105	\@writefile{lof}{\contentsline {figure}{\numberline {9}{\ignorespaces Frequency downgrading strategy}}{25}}
103	106	\newlabel{fig:70dfs}{{9}{25}}
104		~~\@writefile{toc}{\contentsline {subsection}{\numberline {5.6}Discussion}{25}}~~
105	107	\@writefile{lof}{\contentsline {figure}{\numberline {10}{\ignorespaces Schedules obtained for Random strategy (left) and Random + lowest frequency strategy (right) for 10\% of system load}}{26}}
106	108	\newlabel{fig:dfsComp}{{10}{26}}
	109	\@writefile{toc}{\contentsline {subsubsection}{\numberline {5.5.4}Summary}{26}}
107	110	\@writefile{lot}{\contentsline {table}{\numberline {7}{\ignorespaces Energy usage [kWh] for different level of system load. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{26}}
108	111	\newlabel{loadEnergy}{{7}{26}}
109		\@writefile{lot}{\contentsline {table}{\numberline {8}{\ignorespaces Makespan [s] for different level of system load. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{26}}
110		\newlabel{loadMakespan}{{8}{26}}
111		\@writefile{toc}{\contentsline {section}{\numberline {6}Conclusions and future work}{27}}
	112	\@writefile{lot}{\contentsline {table}{\numberline {8}{\ignorespaces Makespan [s] for different level of system load. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{27}}
	113	\newlabel{loadMakespan}{{8}{27}}
	114	\@writefile{toc}{\contentsline {subsection}{\numberline {5.6}Verification of models}{27}}
	115	\newlabel{eq:modelLoad}{{9}{28}}
	116	\@writefile{lot}{\contentsline {table}{\numberline {9}{\ignorespaces Comparison of energy usage estimations [kWh] obtained for two power consumption models. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}}{28}}
	117	\newlabel{modelAccuracy}{{9}{28}}
	118	\@writefile{toc}{\contentsline {section}{\numberline {6}Conclusions and future work}{28}}
112	119	\bibcite{fit4green}{{1}{}{{}}{{}}}
113	120	\bibcite{e2dc13}{{2}{}{{}}{{}}}
114	121	\bibcite{e2dc12}{{3}{}{{}}{{}}}
115	122	\bibcite{CloudSim}{{4}{}{{}}{{}}}
116		~~\newlabel{}{{6}{28}}~~
117	123	\bibcite{DCSG}{{5}{}{{}}{{}}}
118		\bibcite{DCD_Romonet}{{6}{}{{}}{{}}}
119		\bibcite{networks}{{7}{}{{}}{{}}}
120		\bibcite{Ghislain}{{8}{}{{}}{{}}}
121		\bibcite{games}{{9}{}{{}}{{}}}
122		\bibcite{GreenCloud}{{10}{}{{}}{{}}}
123		\bibcite{GSSIM}{{11}{}{{}}{{}}}
124		\bibcite{GSSIM_Energy}{{12}{}{{}}{{}}}
125		\bibcite{koomey}{{13}{}{{}}{{}}}
126		\bibcite{fit4green_scheduler}{{14}{}{{}}{{}}}
127		\bibcite{d2.2}{{15}{}{{}}{{}}}
128		\bibcite{abinit}{{16}{}{{}}{{}}}
129		\bibcite{cray}{{17}{}{{}}{{}}}
130		\bibcite{fft}{{18}{}{{}}{{}}}
131		\bibcite{linpack}{{19}{}{{}}{{}}}
132		\bibcite{tar}{{20}{}{{}}{{}}}
133		\bibcite{GWF}{{21}{}{{}}{{}}}
134		\bibcite{SLURM}{{22}{}{{}}{{}}}
135		\bibcite{SWF}{{23}{}{{}}{{}}}
136		\bibcite{TORQUE}{{24}{}{{}}{{}}}
137		\bibcite{colt}{{25}{}{{}}{{}}}
138		\bibcite{coolemall}{{26}{}{{}}{{}}}
139		\bibcite{ecocooling}{{27}{}{{}}{{}}}
140		\bibcite{ff}{{28}{}{{}}{{}}}
141		\bibcite{pue}{{29}{}{{}}{{}}}
142		\bibcite{sgi}{{30}{}{{}}{{}}}
	124	\bibcite{SimGrid}{{6}{}{{}}{{}}}
	125	\bibcite{DCD_Romonet}{{7}{}{{}}{{}}}
	126	\bibcite{networks}{{8}{}{{}}{{}}}
	127	\bibcite{Ghislain}{{9}{}{{}}{{}}}
	128	\newlabel{}{{6}{30}}
	129	\bibcite{games}{{10}{}{{}}{{}}}
	130	\bibcite{GreenCloud}{{11}{}{{}}{{}}}
	131	\bibcite{GSSIM}{{12}{}{{}}{{}}}
	132	\bibcite{GSSIM_Energy}{{13}{}{{}}{{}}}
	133	\bibcite{koomey}{{14}{}{{}}{{}}}
	134	\bibcite{fit4green_scheduler}{{15}{}{{}}{{}}}
	135	\bibcite{fit4green_carbon_scheduler}{{16}{}{{}}{{}}}
	136	\bibcite{d2.2}{{17}{}{{}}{{}}}
	137	\bibcite{abinit}{{18}{}{{}}{{}}}
	138	\bibcite{cray}{{19}{}{{}}{{}}}
	139	\bibcite{fft}{{20}{}{{}}{{}}}
	140	\bibcite{linpack}{{21}{}{{}}{{}}}
	141	\bibcite{tar}{{22}{}{{}}{{}}}
	142	\bibcite{GWF}{{23}{}{{}}{{}}}
	143	\bibcite{SLURM}{{24}{}{{}}{{}}}
	144	\bibcite{SWF}{{25}{}{{}}{{}}}
	145	\bibcite{TORQUE}{{26}{}{{}}{{}}}
	146	\bibcite{colt}{{27}{}{{}}{{}}}
	147	\bibcite{coolemall}{{28}{}{{}}{{}}}
	148	\bibcite{ecocooling}{{29}{}{{}}{{}}}
	149	\bibcite{ff}{{30}{}{{}}{{}}}
	150	\bibcite{pue}{{31}{}{{}}{{}}}
	151	\bibcite{sgi}{{32}{}{{}}{{}}}
143	152	\providecommand\NAT@force@numbers{}\NAT@force@numbers

papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.fdb_latexmk

r1068	r1069
1	1	# Fdb version 2
2		["pdflatex"] 1370~~008374~~ "elsarticle-DCWoRMS.tex" "elsarticle-DCWoRMS.pdf" "elsarticle-DCWoRMS"
	2	["pdflatex"] 1370260512 "elsarticle-DCWoRMS.tex" "elsarticle-DCWoRMS.pdf" "elsarticle-DCWoRMS"
3	3	"/usr/local/texlive/2010/texmf-dist/tex/context/base/supp-pdf.mkii" 1251025892 71625 fad1c4b52151c234b6873a255b0ad6b3 ""
4	4	"/usr/local/texlive/2010/texmf-dist/tex/generic/oberdiek/etexcmds.sty" 1267408169 5670 cacb018555825cfe95cd1e1317d82c1d ""
…	…
30	30	"/usr/local/texlive/2010/texmf-dist/tex/latex/psnfss/upsy.fd" 1137110629 148 2da0acd77cba348f34823f44cabf0058 ""
31	31	"/usr/local/texlive/2010/texmf-dist/tex/latex/psnfss/upzd.fd" 1137110629 148 b2a94082cb802f90d3daf6dd0c7188a0 ""
32		"elsarticle-DCWoRMS.aux" 1370~~008377 7677 f71d3a69833156dd0cd55997caa941bf~~ ""
33		"elsarticle-DCWoRMS.spl" 1370~~008375~~ 0 d41d8cd98f00b204e9800998ecf8427e ""
34		"elsarticle-DCWoRMS.tex" 1370~~007766 81944 a4f37cdda6dc074fed8954549930ec15~~ ""
	32	"elsarticle-DCWoRMS.aux" 1370260514 8344 4fae76ffb960809b0662b0a5380c1cf5 ""
	33	"elsarticle-DCWoRMS.spl" 1370260512 0 d41d8cd98f00b204e9800998ecf8427e ""
	34	"elsarticle-DCWoRMS.tex" 1370260508 86035 d15304aaf50305115c88a8a70ebe8af4 ""
35	35	"elsarticle.cls" 1352447924 26095 ad44f4892f75e6e05dca57a3581f78d1 ""
36	36	"fig/70dfsGantt.png" 1370005142 138858 edc557d8862a7825f2941d8c69c30659 ""

papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.tex

r1068	r1069
174	174	%Furthermore, available power supply is usually limited so it also may reduce data center development capabilities, especially looking at challenges related to exascale computing breakthrough foreseen within this decade.
175	175
176		For these reasons many efforts were undertaken to measure and study energy efficiency of data centers. There are projects focused on data center monitoring and management \cite{games}\cite{fit4green} whereas others on energy efficiency of networks \cite{networks}. Additionally, vendors offer a wide spectrum of energy efficient solutions for computing and cooling \cite{sgi}\cite{colt}\cite{ecocooling}. However, a variety of solutions and configuration options can be applied planning new or upgrading existing data centers.
	176	For these reasons many efforts were undertaken to measure and study energy efficiency of data centers. There are projects focused on data center monitoring and management \cite{games}\cite{fit4green} whereas others on energy efficiency of networks \cite{networks} or distributed computing infrastructures, like grids \cite{fit4green_carbon_scheduler}. Additionally, vendors offer a wide spectrum of energy efficient solutions for computing and cooling \cite{sgi}\cite{colt}\cite{ecocooling}. However, a variety of solutions and configuration options can be applied planning new or upgrading existing data centers.
177	177	In order to optimize a design or configuration of data center we need a thorough study using appropriate metrics and tools evaluating how much computation or data processing can be done within given power and energy budget and how it affects temperatures, heat transfers, and airflows within data center.
178	178	Therefore, there is a need for simulation tools and models that approach the problem from a perspective of end users and take into account all the factors that are critical to understanding and improving the energy efficiency of data centers, in particular, hardware characteristics, applications, management policies, and cooling.
179	179	These tools should support data center designers and operators by answering questions how specific application types, levels of load, hardware specifications, physical arrangements, cooling technology, etc. impact overall data center energy efficiency.
180		There are various tools that allow simulation of computing infrastructures. On one hand they include advanced packages for modeling heat transfer and energy consumption in data centers \cite{ff} or tools concentrating on their financial analysis \cite{DCD_Romonet}. On the other hand, there are simulators focusing on computations such as CloudSim \cite{CloudSim}. The CoolEmAll project aims to integrate these approaches and enable advanced analysis of data center efficiency taking into account all these aspects \cite{e2dc12}\cite{coolemall}.
	180	There are various tools that allow simulation of computing infrastructures, like SimGrid\cite{SimGrid}. On one hand they include advanced packages for modeling heat transfer and energy consumption in data centers \cite{ff} or tools concentrating on their financial analysis \cite{DCD_Romonet}. On the other hand, there are simulators focusing on computations such as CloudSim \cite{CloudSim}. The CoolEmAll project aims to integrate these approaches and enable advanced analysis of data center efficiency taking into account all these aspects \cite{e2dc12}\cite{coolemall}.
181	181
182	182	One of the results of the CoolEmAll project is the Data Center Workload and Resource Management Simulator (DCworms) which enables modeling and simulation of computing infrastructures to estimate their performance, energy consumption, and energy-efficiency metrics for diverse workloads and management policies.
…	…
200	200	In terms of application modeling, all tools, except DCSG Simulator, describe the application with a number of computational and communicational requirements. In addition, GreenCloud and DCworms allow introducing the QoS requirements by taking into account the time constraints during the simulation. DCSG Simulator instead of modeling of the single application, enables the definition of workload that leads to a given utilization level. However, only DCworms supports application performance modeling by not only incorporating simple requirements that are taken into account during scheduling, but also by allowing specification of task execution time.
201	201
202		GreenCloud, CloudSim and DCworms are released as Open Source under the GPL. DCSG Simulator is available under an OSL V3.0 open-source license, however, it can be only accessed by the DCSG members.
	202	GreenCloud, CloudSim and DCworms are released as Open Source under the GPL. DCSG Simulator is available under the OSL V3.0 open-source license, however, it can be only accessed by the DCSG members.
203	203
204	204	Summarizing, DCworms stands out from other tools due to the flexibility in terms of data center equipment and structure definition.
…	…
210	210
211	211	Data Center workload and resource management simulator (DCworms) is a simulation tool based on the GSSIM framework \cite{GSSIM} developed by Poznan Supercomputing and Networking Center (PSNC).
212		GSSIM has been proposed to provide an automated tool for experimental studies of various resource management and scheduling strategies in distributed computing systems. DCworms extends its basic functionality and adds some additional features related to the energy efficiency issues in data centers. In this section we will introduce the functionality of the simulator, in terms of modeling and simulation of ~~large scale distributed systems like Grids and Cloud~~s.
	212	GSSIM has been proposed to provide an automated tool for experimental studies of various resource management and scheduling strategies in distributed computing systems. DCworms extends its basic functionality and adds some additional features related to the energy efficiency issues in data centers. In this section we will introduce the functionality of the simulator, in terms of modeling and simulation of data centers.
213	213
214	214
…	…
222	222	\end{figure}
223	223
224		DCworms is an event-driven simulation tool written in Java. In general, input data for the DCworms consist of workload and resources descriptions. They can be provided by the user, read from real traces or generated using the generator module. In this terms DCworms benefits from the GSSIM workload generator tool that allows creating synthetic workloads (\cite{GSSIM}). However, the key elements of the presented architecture are plugins. They allow the researchers to configure and adapt the simulation environment to the peculiarities of their studies, starting from modeling job performance, through energy estimations up to implementation of resource management and scheduling policies. Each plugin can be implemented independently and plugged into a specific experiment. Results of experiments are collected, aggregated, and visualized using the statistics module. Due to a modular and plug-able architecture DCworms can be applied to specific resource management problems and address different usersâ requirements.
	224	DCworms is an event-driven simulation tool written in Java. In general, input data for the DCworms consist of workload and resource descriptions. They can be provided by the user, read from real traces or generated using the generator module. In this terms DCworms benefits from the GSSIM workload generator tool that allows creating synthetic workloads (\cite{GSSIM}). However, the key elements of the presented architecture are plugins. They allow the researchers to configure and adapt the simulation environment to the peculiarities of their studies, starting from modeling job performance, through energy estimations up to implementation of resource management and scheduling policies. Each plugin can be implemented independently and plugged into a specific experiment. Afterwards, politics introduced by particular plugins are applied to a given set of tasks and resources and are triggered by each change of simulated environment state. Results of experiments are collected, aggregated, and visualized using the statistics module. The output of each simulation consists of a number of statistics created to help in
	225	comparative evaluations of resource and workload management policies. Due to a modular and plug-able architecture DCworms can be applied to specific resource management problems and address different usersâ requirements.
225	226
226	227
…	…
333	334	\item network parameters
334	335	\end{itemize}
335		Using these parameters developers can for instance take into account the architectures of the underlying systems, such as multi-core processors, and their impact on the final performance of applications.
	336	Using these parameters developers can for instance take into account the architectures of the underlying systems, such as multi-core processors, and their impact on the final performance of applications. Examples of application performance modeling in DCworms are shown in \cite{GSSIM}.
336	337
337	338
…	…
339	340	\section{Modeling of energy consumption in DCworms}
340	341
341		DCworms is an open framework in which various models and algorithms can be investigated as presented in Section \ref{sec:apps}. In this section, we discuss possible approaches to modeling that can be applied to simulation of energy-efficiency of distributed computing systems. In general, to facilitate the simulation process, DCworms provides some basic implementation of power consumption, air throughput and thermal models. We introduce power consumption models as examples and validate part of them by experiments in real computing system (in Section \ref{sec:experiments}). Description of thermal models and corresponding experiments was presented in \cite{e2dc13}.
	342	DCworms is an open framework in which various models and algorithms can be investigated as presented in Section \ref{sec:apps}. In this section, we discuss possible approaches to modeling that can be applied to simulation of energy-efficiency of distributed computing systems. In general, to facilitate the simulation process, DCworms provides some basic implementation of power consumption, air throughput and thermal models. We introduce power consumption models as examples and validate part of them by experiments in real computing system (in Section \ref{sec:experiments}). Description of thermal models and corresponding experiments was presented in \cite{e2dc13}.
342	343
343	344	The most common questions explored by researchers who study energy-efficiency of distributed computing systems is how much energy $E$ do these systems require to execute workloads. In order to obtain this value the simulator must calculate values of power $P_i(t)$ and load $L_i(t)$ in time for all $m$ computing nodes, $i=1..m$. Load function may depend on specific load models applied. In more complex cases it can even be defined as vectors of different resource usage in time. In a simple case load can be either idle or busy but even in this case estimation of job processing times $p_j$ is needed to calculate total energy consumption. The total energy consumption of computing nodes is given by (\ref{eq:E}):
…	…
513	514
514	515
515		~~\subsection{Models}\label{sec:models}~~
516
517		~~Based on the measured values we evaluated three types of models that can be applied, among others, to the simulation environment.~~
518
519		~~\textbf{Static}~~
520		This model refers to the static approach presented in Section~\ref{sec:power}. According to the measured values we created a resource power consumption model that is based on a static definition of resource power usage.
521		%With each node power state, understood as a possible operating state (p-state), we associated a power consumption value that derives from the averaged values of measurements obtained for different types of application. Therefore, the current power usage of the node, can be expressed as: $P = P_{idle} + P_{f}$ where $P$ denotes power consumed by the node, $P_{idle}$ is a power usage of node in idle state and $P_{f}$ stands for power usage of CPU operating at the given frequency level.
522
523		~~\textbf{Dynamic}~~
524		This model refers to the Resource load approach presented in Section~\ref{sec:power}. Based on the measured values of the total node power usage for various levels of load and frequencies of CPUs node power usage was defined as in \ref{eq:model}.
525
526		%and referring to the existing models presented in literature, we assumed the following equation: $P = P_{idle} + loadP_{cpubase}c^{(f-f_{base})/100} + P_{app}$, where $P$ denotes power consumed by the node executing the given application, $P_{idle}$ is a power usage of node in idle state, load is the current utilization level of the node, $P_{cpubase}$ stands for power usage of fully loaded CPU working in the lowest frequency, $c$ is the constant factor indicating the increase of power consumption with respect to the frequency increase $f$- is a current frequency, $f_{base}$- is the lowest available frequency within the given CPU and $P_{app}$ denotes the additional power usage derived from executing a particular application).
527
528
529		Table~\ref{nodeBasePowerUsage} and Table~\ref{appPowerUsage} contain values of $P_{cpubase}$ and $P_{app}$, respectively, obtained for the particular application and resource architectures. Lack of the corresponding value means that the application did not run on the given type of node.
530		~~\begin {table}[h!]~~
531		~~\centering~~
532		~~\begin{tabular}{lccc}~~
533		~~\hline~~
534		~~Intel I7 & AMD Fusion & Atom D510 \\~~
535		~~\hline~~
536		~~8 & 2 & 1 \\~~
537		~~\hline~~
538		~~\end{tabular}~~
539		~~\caption {\label{nodeBasePowerUsage} $P_{cpubase}$ values in Watts}~~
540		~~\end {table}~~
541
542
543		~~\begin {table}[h!]~~
544		~~\centering~~
545		~~\begin{tabular}{l\|ccc}~~
546		~~\hline~~
547		~~& \multicolumn{3}{c} {Node type}\\~~
548		~~Application & Intel I7 & AMD Fusion & Atom D510 \\~~
549		~~\hline~~
550		~~Abinit & 3.3 & - & - \\~~
551		~~Linpactiny & 2.5 & - & 0.2 \\~~
552		~~Linpack3gb & 6 & - & - \\~~
553		~~C-Ray & 4 & 1 & 0.05 \\~~
554		~~FFT & 3.5 & 2 & 0.1 \\~~
555		~~Tar & 3 & 2.5 & 0.5 \\~~
556
557		~~\hline~~
558		~~\end{tabular}~~
559		~~\caption {\label{appPowerUsage} $P_{app}$ values in Watts}~~
560		~~\end {table}~~
561
562
563		~~\textbf{Mapping}~~
564		This model refers to the Application specific approach presented in Section~\ref{sec:power}. However, in this model we applied the measured values for each application exactly to the power model. Neither dependencies with load nor application profiles are modeled. Obviously this model is contaminated only with the inaccuracy of the measurements and variability of power usage (caused by other unmeasured factors).
565
566		~~The following table (Table~\ref{expPowerModels}) contains the relative errors of the models with respect to the measured values~~
567		~~\begin {table}[h!]~~
568		~~\centering~~
569		~~\begin{tabular}{llr}~~
570		~~\hline~~
571		~~Static & Dynamic & Mapping \\~~
572		~~\hline~~
573		~~13.74 & 10.85 & 0 \\~~
574		~~\hline~~
575		~~\end{tabular}~~
576		~~\caption {\label{expPowerModels} Power models error in \%}~~
577		~~\end {table}~~
578
579		Obviously, 0\% error in the case of the Mapping model is caused by the use of a tabular data, which for each application stores a specific power usage. Nevertheless, in all models we face possible deviations from the average caused by power usage fluctuations not explained by variables used in models. These deviations reached around 7\% for each case.
580
581		For the experimental purposes we decided to use the latter model. Thus, we introduce into the simulation environment exact values obtained within our testbed, to build both the power profiles of applications as well as the application performance models, denoting their execution times.
582
583	516
584	517	\subsection{Methodology}
…	…
622	555	In all cases we assumed that tasks are scheduled and served in order of their arrival (FIFO strategy) using relaxed backfilling approach, with indefinite delay for the highest priority task. Moreover, all tasks were assigned to nodes with the condition that they can be assigned only to nodes of the type on which the application was able to run (in other words - we had the corresponding value of power consumption and execution time).
623	556
624		\subsection{Computational analysis}
	557
	558	\subsection{Models}\label{sec:models}
	559
	560	Based on the measured values we evaluated three types of models that can be applied, among others, to the simulation environment.
	561
	562	\textbf{Static}
	563	This model refers to the static approach presented in Section~\ref{sec:power}. According to the measured values we created a resource power consumption model that is based on a static definition of resource power usage.
	564	%With each node power state, understood as a possible operating state (p-state), we associated a power consumption value that derives from the averaged values of measurements obtained for different types of application. Therefore, the current power usage of the node, can be expressed as: $P = P_{idle} + P_{f}$ where $P$ denotes power consumed by the node, $P_{idle}$ is a power usage of node in idle state and $P_{f}$ stands for power usage of CPU operating at the given frequency level.
	565
	566	\textbf{Dynamic}
	567	This model refers to the Resource load approach presented in Section~\ref{sec:power}. Based on the measured values of the total node power usage for various levels of load and frequencies of CPUs node power usage was defined as in \ref{eq:model}.
	568
	569	%and referring to the existing models presented in literature, we assumed the following equation: $P = P_{idle} + loadP_{cpubase}c^{(f-f_{base})/100} + P_{app}$, where $P$ denotes power consumed by the node executing the given application, $P_{idle}$ is a power usage of node in idle state, load is the current utilization level of the node, $P_{cpubase}$ stands for power usage of fully loaded CPU working in the lowest frequency, $c$ is the constant factor indicating the increase of power consumption with respect to the frequency increase $f$- is a current frequency, $f_{base}$- is the lowest available frequency within the given CPU and $P_{app}$ denotes the additional power usage derived from executing a particular application).
	570
	571
	572	Table~\ref{nodeBasePowerUsage} and Table~\ref{appPowerUsage} contain values of $P_{cpubase}$ and $P_{app}$, respectively, obtained for the particular application and resource architectures. Lack of the corresponding value means that the application did not run on the given type of node.
	573	\begin {table}[h!]
	574	\centering
	575	\begin{tabular}{lccc}
	576	\hline
	577	Intel I7 & AMD Fusion & Atom D510 \\
	578	\hline
	579	8 & 2 & 1 \\
	580	\hline
	581	\end{tabular}
	582	\caption {\label{nodeBasePowerUsage} $P_{cpubase}$ values in Watts}
	583	\end {table}
	584
	585
	586	\begin {table}[h!]
	587	\centering
	588	\begin{tabular}{l\|ccc}
	589	\hline
	590	& \multicolumn{3}{c} {Node type}\\
	591	Application & Intel I7 & AMD Fusion & Atom D510 \\
	592	\hline
	593	Abinit & 3.3 & - & - \\
	594	Linpactiny & 2.5 & - & 0.2 \\
	595	Linpack3gb & 6 & - & - \\
	596	C-Ray & 4 & 1 & 0.05 \\
	597	FFT & 3.5 & 2 & 0.1 \\
	598	Tar & 3 & 2.5 & 0.5 \\
	599
	600	\hline
	601	\end{tabular}
	602	\caption {\label{appPowerUsage} $P_{app}$ values in Watts}
	603	\end {table}
	604
	605
	606	\textbf{Mapping}
	607	This model refers to the Application specific approach presented in Section~\ref{sec:power}. However, in this model we applied the measured values for each application exactly to the power model. Neither dependencies with load nor application profiles are modeled. Obviously this model is contaminated only with the inaccuracy of the measurements and variability of power usage (caused by other unmeasured factors).
	608
	609	The following table (Table~\ref{expPowerModels}) contains the relative errors of the models with respect to the measured values
	610	\begin {table}[h!]
	611	\centering
	612	\begin{tabular}{llr}
	613	\hline
	614	Static & Dynamic & Mapping \\
	615	\hline
	616	13.74 & 10.85 & 0 \\
	617	\hline
	618	\end{tabular}
	619	\caption {\label{expPowerModels} Power models error in \%}
	620	\end {table}
	621
	622	Obviously, 0\% error in the case of the Mapping model is caused by the use of a tabular data, which for each application stores a specific power usage. Nevertheless, in all models we face possible deviations from the average caused by power usage fluctuations not explained by variables used in models. These deviations reached around 7\% for each case.
	623
	624	For the experimental purposes we decided to use the latter model. Thus, we introduce into the simulation environment exact values obtained within our testbed, to build both the power profiles of applications as well as the application performance models, denoting their execution times.
	625
	626
	627
	628	\subsection{Resource management policies evaluation}
625	629
626	630	In the following section we present the results obtained for the workload with load density equal to 70\% in the light of five resource management and scheduling strategies. The scheduling strategies were evaluated according to two criteria: total energy consumption and maximum completion time of all tasks (makespan). These evaluation criteria employed in our experiments represent interests of various groups of stakeholders present in data centers.
…	…
665	669	Type of processor within the node & Power usage in idle state [W] \\
666	670	\hline
667		Intel i7 & 11.5 \\
	671	Intel i7 & 11.5 \\
668	672	AMD Fusion & 10 \\
669	673	Atom D510 & 19 \\
…	…
706	710
707	711
708		As we were looking for the trade-off between total completion time and energy usage, we were searching for the workload load level that can benefit from the lower system performance in terms of energy-efficiency. For the frequency downgrading policy, we noticed the improvement on the energy usage criterion only for the workload resulting in 10\% system load. For this threshold we observed that slowdown in task execution does not affect the subsequent tasks in the system and thus the total completion time of the whole workload.
	712	As we were looking for the trade-off between total completion time and energy usage, we were searching for the workload load level that can benefit from the lower system performance in terms of energy-efficiency. For the frequency downgrading policy, we noticed the improvement on the energy usage criterion only for the workload resulting in 10\% system load. For this threshold we observed that slowdown in task execution does not affect the subsequent tasks in the system and thus the total completion time of the whole workload. In general, for the low loaded systems, where the longer execution times do not affect the computations of subsequent tasks, downgrading the frequency can reduce the overall energy consumption. However, for the more loaded systems, which are typical for HPC centers, and taking into account that idle nodes can be turned off after all, both the simulation results as well as the measurements obtained on real hardware show that the best way to save energy is to perform computations at the maximum frequency level. Due to the shorter execution times, idle nodes can be turned off sooner which results in lower energy usage.
709	713
710	714	Figure~\ref{fig:dfsComp} shows schedules obtained for Random and Random + lowest frequency strategy.
…	…
717	721	\end{figure}
718	722
719		\subs~~ection{Discussion~~}
	723	\subsubsection{Summary}
720	724	The following tables: Table~\ref{loadEnergy} and Table~\ref{loadMakespan} contain the values of evaluation criteria (total energy usage and makespan respectively) gathered for all investigated workloads.
721	725
…	…
756	760	We also demonstrated differences between power usage models. They span from rough static approach to accurate application specific models. However, the latter can be difficult or even infeasible to use, as it requires real measurements for specific applications beforehand. This issue can be partially resolved by introducing application profiles and classification, which can deteriorate the accuracy though. This issue is begin studied more deeply within CoolEmAll project.
757	761
	762	\subsection{Verification of models}
	763
	764	This section contains more detailed comparison of two types of power consumption models that can be applied, among others, within the DCworms. The first one, called Mapping approach, was applied to the experiments in the previous section. As mentioned within this model, the values measured on the CoolEmAll testbed for each application were applied directly to the power consumption model used in DCworms.
	765
	766	Model evaluated in this section is a modification of the Mapping and Dynamic model by additional modeling of dependencies with the processor load.
	767	Within this model, we benefited from the power profiles based on the measurements made on CoolEmAll testbed (and adopted also by the previous model). However, data applied to the simulation environment consisted only of measurements gathered for applications ran in mode resulting in lowest and highest processor load. For all load levels between the given two values we assumed the linear dependency between the load and power consumption. Thus, the power consumption for the given processor load can be expressed using the following equation (\ref{eq:modelLoad}):
	768
	769	\begin{equation}
	770	P_L = P_{LL} + (L-LL)*(P_{HL}-P_{LL})/(HL-LL), \label{eq:modelLoad}
	771	\end{equation}
	772
	773	where $L$ is a given processor load, $LL$ is the lowest measured processor load, $HL$ is the highest measured processor load, $P_L$ denotes power consumption for a given processor load, $P_{LL}$ is a power consumption measured for the lowest processor load and $P_{HL}$ stands for power consumption measured for the highest processor load.
	774
	775	\begin {table}[h!]
	776	\centering
	777	\begin{tabular}{l\| c \| c \| c }
	778	\hline
	779	Policy / Model & Mapping & Mapping + Dynamic & Accuracy [\%]\\
	780	\hline
	781	R & 46.883 & 44.476 & 94.87 \\
	782	R+NPM & 30.568 & 28.250 & 92.42 \\
	783	EO & 77.109 & 75.277 & 97.62\\
	784	EO+NPM & 46.305 & 44.050 & 95.13\\
	785	R+LF & 36.705 & 34.298 & 93.44\\
	786	\hline
	787	\end{tabular}
	788	\caption {\label{modelAccuracy} Comparison of energy usage estimations [kWh] obtained for two power consumption models. R - Random, R+NPM - Random + node power management, EO - Energy optimization, EO+NPM - Energy optimization + node power management, R+LF - Random + lowest frequency}
	789	\end {table}
	790
	791	As it can be observed the accuracy of the Mapping + Dynamic based model is high and exceeds visibly 90\%. Satisfactory accuracy suggests that applying various power consumption models, while verifying different approaches or in case of lack of detailed measurements, does not lead to deterioration of overall results. This fact confirms also the important role of simulations in the experiments related to the distributed computing systems.
758	792
759	793
…	…
830	864	\bibitem{DCSG} http://dcsg.bcs.org/welcome-dcsg-simulator
831	865
	866	\bibitem{SimGrid} http://simgrid.gforge.inria.fr/index.html
	867
832	868	\bibitem{DCD_Romonet} http://www.datacenterdynamics.com/blogs/ian-bitterlin/it-does-more-it-says-tin\%E2\%80\%A6
833	869
…	…
852	888	\bibitem{fit4green_scheduler} O. M{\"a}mmel{\"a}, M. Majanen, R. Basmadjian, H. De Meer, A. Giesler, W. Homberg, Energy-aware job scheduler for high-performance computing, Computer Science - Research and Development, November 2012, Volume 27, Issue 4, pp 265-275.
853	889
	890	\bibitem {fit4green_carbon_scheduler} M. Majanen, O. M{\"a}mmel{\"a}, A. Giesler, Energy and carbon aware scheduling in supercomputing, International Journal on Advances in Intelligent Systems. Vol. 5 (2012) Nr: 3 \& 4, pp. 451 - 469.
	891
854	892	\bibitem {d2.2} U. Woessner, E. Volk, G. Gallizo, M. vor dem Berge, G. Da Costa, P. Domagalski, W. Piatek, J-M. Pierson. (2012) D2.2 Design of the CoolEmAll simulation and visualisation environment - CoolEmAll Deliverable, http://coolemall.eu
855	893

Note: See TracChangeset for help on using the changeset viewer.

Context Navigation

Changeset 1069 for papers

Legend:

papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.aux

papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.fdb_latexmk

papers/SMPaT-2012_DCWoRMS/elsarticle-DCWoRMS.tex

Download in other formats: