Describing Experimental/Simulation Setup

Sometimes the results of a performance analysis may depend on the computers used and/or specific features of software/libraries. In such cases it is extremely important to describe the experimental/simulation set up in details. It enables others to repeat those experiments as well as check whether the results are rigorous, statistically sound, and unbiased. Unfortunately, "Simulation Setup" is the shortest section in many research papers where authors try to save space by cutting down as much as details.

Here are some tips on what to include (in addition to describing the experimental/simulation setup) based on my experiences:
Type of Simulation

Are the results based on Experimentation, Emulation, or Simulation? If simulations also mention further details like whether it is Discrete Event, Montacarlo, Stochastic, or Deterministic simulation.

Are the results for Steady, Dynamic, Starting/ramp-up, or Terminating state(s)?

Number of Experiments/Simulations

No of samples

Sampling methods

Confidence intervals, accuracy levels

Random number generation, whether a different seed is used for every run, what seeds were used (particularly if you use a widely available simulator/emulator)

Computer(s) Used

CPU - Clock speed, no of cores, cache sizes, CPU model (specific model no in addition to saying, e.g., Intel I7, Xeon, Itanium or AMD Opteron), and special instructions used (e.g., MMX, SSE, AVX)

Memory - capacity, speed

Accelerators used, e.g., GPUs or Intel Phi (only when they have an impact)

Sensors and actuators attached (only when they have an impact)

Operating system - type, version, special features, other services that were running, load on the node, performance enhancements/optimizations

Network connections - link speed, type of connection (e.g., Ethernet, WiFi, Bluetooth, ADSL, Dial-up, T1, 3G, HSPA+, 4G, etc.), latency, intermediate nodes (firewalls, proxies), etc. (only when they have an impact)

Cluster setup - interconnection network (e.g., ring, torus, mesh), type of connections (e.g., Ethernet or Infiniband) (only when they have an impact)

Applications/Simulators/Emulators/Libraries

A detailed list of tools used, e.g., math libraries, topology generators, workload generators, etc.

Their version numbers, what specific features/options were enabled/disabled

Specific parameters, e.g., packet/workload generation model, packet loss rate, packet size, inter-packet gap, job arrival time distribution, probability that one node connects to another, node churn rate, etc.

Compiler and compiler options, e.g., gcc, g++, Intel compilers, with/without optimizations, special flags

Cluster middleware (only when they have an impact)

Dilum Bandara

Search This Blog

Describing Experimental/Simulation Setup

Comments

Popular posts from this blog

1. Building P2P Simulators with OverSim - Where to Begin

Distributed Systems Mind Map