The Cost of a CAE Job in the Cloud
Ten years ago, general belief was that high performance cloud computing was about five times more expensive than high performance computing (HPC) on premises, by comparing total cost in the cloud versus the cost of the on-prem compute server, often ignoring additional expenses for operations, management software, IT staff, energy, space, support, and maintenance, for the on-prem HPC server. In addition, cloud providers were quite vague about additional costs for storage, data movement and other cloud services. Over time, more detailed analyses have been published recommending how to save cost in the cloud and increase return on investment.
But one question has barely been answered: How much does one complex engineering simulation job really cost you in the cloud? Because the engineers (and their simulations) are one of the company’s core assets for product innovation, the question could also be asked: How much does innovation cost for my company? Or, how much do I have to spend to increase my next-generation product’s quality, and thus its competitiveness in the market?
The Cloud HPC Architecture
The answer can be found by zooming into an existing high-performance cloud computing architecture that is built to solve complex engineering simulation problems, in very detail. Such an architecture has been implemented, for example, for the cement and mining machinery leader FLSmidth with headquarters in Denmark, see e.g. our FLSmidth eBook. For their simulations, over the last four years, they have been using simulation software from Ansys, Siemens, and Dassault, running in the engineering cloud environment depicted in the following figure.
For executing an engineer’s simulation job, the following steps (which you can skip reading for now if you are just interested in the results) are performed automatically by the different cloud components, invisible for the engineer, on Microsoft’s Azure Cloud:
Bill of Material and Total Cost for an Automotive STAR-CCM+ Simulation
Using an automotive application example, we are running a computational fluid dynamics (CFD) simulation with Siemens STAR-CCM+, for a 100-million-cell geometry, with hexa-mesh. A first benchmark run of this job on one 120-core HBv3 compute node (based on AMD EPYC Milan CPUs) in Azure resulted in 30 hours run time. Next, running the same job on 16 120-core HBv3 compute nodes (total 1920 cores) in Azure resulted in in a run time of just 1.5 hours, demonstrating an excellent scalability of this STAR-CCM+ job for this specific application, with a speed-up factor of 20, achieved by using 16 nodes and 16 times larger local memory, causing fewer memory conflicts.
Now, with all this information, it is possible to set up a Bill of Materials for all the components described above, as a basis for calculating the total cost of this job, as follows (D4SV3, D4SV4, HBv3, and NC12s_v3 are Azure compute instances).
We then collected pricing information for the different components on Microsoft’s websites and with Google Search, for e.g., 100 simulation jobs per months, resulting in:
Finally, dividing $12,245.79 by the 100 jobs results in $122.45 for one STAR-CCM+ simulation job on 16 HBv3 AMD EPYC Milan-powered Azure compute instances, including all infrastructure, simulation, and data costs.
Just imagine, to verify one engineer’s innovative idea, which might dramatically improve your company’s next generation product, help avoid a product failure early in the design phase, spare expensive and time-consuming physical prototyping in the lab, discover a new revolutionary material, reduce the cost of manufacturing, or which could shorten time to market and increase your company’s competitiveness, or all of this together . . . for just $122.45! That's something that you can really take a few moments to enjoy and savor.