Cloud native EDA tools & pre-optimized hardware platforms
Unlimited access to EDA software licenses on-demand
As more designers deploy production EDA workloads on Synopsys Cloud, enabling cloud infrastructure cost optimization has consistently been one of the most common asks. Within the Synopsys Cloud product team, we shortlisted a few areas of innovation which would help our customers achieve this goal. Among these, leveraging spot virtual machines clearly stood out as the most effective means to drive costs down. However, as most cloud infrastructure experts would agree, running high performance, high memory tools on spot is not trivial. In this article, we will attempt to define the problem and discuss in detail how we arrived at this unique solution to address the constraints public cloud infrastructure subjects us to.
Cloud providers plan for capacity of each type of compute resource several quarters in advance to ensure availability of elastic cloud scale. In practical terms, these capacity projections don’t play out perfectly and there’s always a struggle between demand for specific compute virtual machines versus actual supply. When there’s excess capacity of certain compute VMs, cloud providers put these on what is called the “spot market” and make them available at heavily discounted prices with the caveat that these VMs may be removed on short notice. Users have leveraged discounts of up to 80% off on-demand prices in the spot market.
High performance workloads like EDA that can scale on cloud infrastructure need the ability to recover from a spot VM termination signal in order to ensure that there’s no processing time lost when a job has been running for a while. The most common solution to this problem is to build checkpoint-restore functionality in the tools. Several Synopsys tools offer this capability and users have learned to use it well for their needs over the years.
However, just having checkpoint-restore available at your disposal does not enable spot. Spot is a unique beast that adds a more stringent constraint on the deployment architecture by providing a very limited window to take a snapshot of the runtime memory state of the tool. AWS offers a 2 minute warning, which in reality may be much shorter, and Microsoft Azure offers only a 30 second notice currently.
As we all know, each EDA tool is not created equal and tools that have smaller memory footprint can successfully checkpoint their state within this warning window. Users who run verification jobs on Synopsys VCS® functional verification solution have successfully leveraged the tool’s inherent checkpoint-restore capabilities to run on spot and reduced costs significantly. Similarly, for library characterization on Synopsys PrimeLib library characterization solution, since typical distributed jobs run for only a few minutes and runtime state has a very small footprint, customers have successfully enabled spot instances by just ignoring the failures and restarting those jobs.
The challenge is more pronounced when we start exploring high memory workloads such as timing analysis, physical verification, physical design, or RTL-to-Gates implementation. The size of the runtime state for these workloads may run into several hundred gigabytes and the time needed to checkpoint is much longer than the Spot warning window provided by cloud providers. So, jobs which get terminated while running on spot cannot be restored, as no state is saved. This means several hours of runtime and compute usage costs can go to waste. For these workloads, just having checkpoint-restore capability is not enough to effectively use spot.
As each EDA tool behaves differently, we started analyzing runtime memory usage patterns for each tool independently to assess the amount of time needed to successfully save the state. We also explored multiple technical solutions that can either provide alternatives to the standard checkpoint-restore functionality or complement that capability. One unique solution was presented by Exostellar, which leverages machine learning to predict the onset of a spot termination signal in advance. Earlier this year, Synopsys entered into a technical partnership with Exostellar to jointly develop and market an intelligent solution to this problem.
The Synopsys ChipSpot solution is built on the concept of creating a “virtual machine array” optimized for each EDA tool which contains a mix of Spot and on-demand VMs. Based on termination signal predictions from its AI-driven algorithm, the solution migrates the running EDA workload, live, to an on-demand VM in the VM array, thus reducing the chances of the workload being terminated. Once spot availability eases, the running state is migrated back to a spot VM in the array. We tested this architecture extensively on some of our most compute-intensive high memory workloads and announced the spot-optimized Synopsys ChipSpot solution for EDA in August of 2023. Powered by Exostellar X-Spot technology, Synopsys ChipSpot enables customers to save up to 75% off on-demand compute prices. Synopsys ChipSpot is now available for Synopsys Fusion Compiler™, Synopsys PrimeTime®, Synopsys StarRC™, Synopsys VCS® and Synopsys IC Validator™ tools for AWS EC2 spot instances.
To sign-up, please reach out to your Synopsys sales representative or email cloud@synopsys.com.
When we first launched Synopsys Cloud one year ago, one of our primary goals was to enable industry transforming technology for chip designers who want to leverage cloud. With Synopsys Cloud FlexEDA, the industry’s first true pay-per-use business model, and a completely browser-based high performance computing experience, Synopsys Cloud is driving cutting edge innovation to enable our customers to focus on what they do best – design chips, faster.
Synopsys is the industry’s largest provider of electronic design automation (EDA) technology used in the design and verification of semiconductor devices, or chips. With Synopsys Cloud, we’re taking EDA to new heights, combining the availability of advanced compute and storage infrastructure with unlimited access to EDA software licenses on-demand so you can focus on what you do best – designing chips, faster. Delivering cloud-native EDA tools and pre-optimized hardware platforms, an extremely flexible business model, and a modern customer experience, Synopsys has reimagined the future of chip design on the cloud, without disrupting proven workflows.
Take a Test Drive!
Synopsys technology drives innovations that change how people work and play using high-performance silicon chips. Let Synopsys power your innovation journey with cloud-based EDA tools. Sign up to try Synopsys Cloud for free!
Vikram Bhatia is head of cloud product management and GTM strategy at Synopsys. He's responsible for building the industry's first completely browser-based EDA-as-a-Service platform, Synopsys Cloud. He has over 25 years of experience in product strategy, and prior to joining Synopsys, he served in a variety of leadership roles at companies including NetApp, Oracle, HP and Microsoft. Over the last decade, Vikram has exclusively focused on transforming traditional on-premises business models to cloud based SaaS offerings though product management, go-to-market strategy, partnerships, and sales transformation initiatives. Vikram has a Bachelor of Technology degree from the Indian Institute of Technology Kanpur, and graduate degrees from the Colorado School of Mines and the Indian School of Business.