From Digital Age to Nano Age. WorldWide.

Tag: Clouds

Robotic Automations

Alkira connects with $100M for a solution that connects your clouds | TechCrunch


As cloud adoption continues to surge towards the $1 trillion mark in annual spend, we’re seeing a wave of enterprise startups gaining traction with customers and investors for tools to help manage that usage. In the latest development, a startup called Alkira has raised $100 million for “network infrastructure as a service”, which lets users […]

© 2024 TechCrunch. All rights reserved. For personal use only.


Software Development in Sri Lanka

Robotic Automations

Alternative clouds are booming as companies seek cheaper access to GPUs | TechCrunch


The appetite for alternative clouds has never been bigger. Case in point: CoreWeave, the GPU infrastructure provider that began life as a cryptocurrency mining operation, this week raised $1.1 billion in new funding from investors including Coatue, Fidelity and Altimeter Capital. The round brings its valuation to $19 billion post-money, and its total raised to […]

© 2024 TechCrunch. All rights reserved. For personal use only.


Software Development in Sri Lanka

Robotic Automations

Alternative clouds are booming as companies seek cheaper access to GPUs | TechCrunch


The appetite for alternative clouds has never been bigger.

Case in point: CoreWeave, the GPU infrastructure provider that began life as a cryptocurrency mining operation, this week raised $1.1 billion in new funding from investors including Coatue, Fidelity and Altimeter Capital. The round brings its valuation to $19 billion post-money, and its total raised to $5 billion in debt and equity — a remarkable figure for a company that’s less than ten years old.

It’s not just CoreWeave.

Lambda Labs, which also offers an array of cloud-hosted GPU instances, in early April secured a “special purpose financing vehicle” of up to $500 million, months after closing a $320 million Series C round. The nonprofit Voltage Park, backed by crypto billionaire Jed McCaleb, last October announced that it’s investing $500 million in GPU-backed data centers. And Together AI, a cloud GPU host that also conducts generative AI research, in March landed $106 million in a Salesforce-led round.

So why all the enthusiasm for — and cash pouring into — the alternative cloud space?

The answer, as you might expect, is generative AI.

As the generative AI boom times continue, so does the demand for the hardware to run and train generative AI models at scale. GPUs, architecturally, are the logical choice for training, fine-tuning and running models because they contain thousands of cores that can work in parallel to perform the linear algebra equations that make up generative models.

But installing GPUs is expensive. So most devs and organizations turn to the cloud instead.

Incumbents in the cloud computing space — Amazon Web Services (AWS), Google Cloud and Microsoft Azure — offer no shortage of GPU and specialty hardware instances optimized for generative AI workloads. But for at least some models and projects, alternative clouds can end up being cheaper — and delivering better availability.

On CoreWeave, renting an Nvidia A100 40GB — one popular choice for model training and inferencing — costs $2.39 per hour, which works out to $1,200 per month. On Azure, the same GPU costs $3.40 per hour, or $2,482 per month; on Google Cloud, it’s $3.67 per hour, or $2,682 per month.

Given generative AI workloads are usually performed on clusters of GPUs, the cost deltas quickly grow.

“Companies like CoreWeave participate in a market we call specialty ‘GPU as a service’ cloud providers,” Sid Nag, VP of cloud services and technologies at Gartner, told TechCrunch. “Given the high demand for GPUs, they offers an alternate to the hyperscalers, where they’ve taken Nvidia GPUs and provided another route to market and access to those GPUs.”

Nag points out that even some big tech firms have begun to lean on alternative cloud providers as they run up against compute capacity challenges.

Last June, CNBC reported that Microsoft had signed a multi-billion-dollar deal with CoreWeave to ensure that OpenAI, the maker of ChatGPT and a close Microsoft partner, would have adequate compute power to train its generative AI models. Nvidia, the furnisher of the bulk of CoreWeave’s chips, sees this as a desirable trend, perhaps for leverage reasons; it’s said to have given some alternative cloud providers preferential access to its GPUs.

Lee Sustar, principal analyst at Forrester, sees cloud vendors like CoreWeave succeeding in part because they don’t have the infrastructure “baggage” that incumbent providers have to deal with.

“Given hyperscaler dominance of the overall public cloud market, which demands vast investments in infrastructure and range of services that make little or no revenue, challengers like CoreWeave have an opportunity to succeed with a focus on premium AI services without the burden of hypercaler-level investments overall,” he said.

But is this growth sustainable?

Sustar has his doubts. He believes that alternative cloud providers’ expansion will be conditioned by whether they can continue to bring GPUs online in high volume, and offer them at competitively low prices.

Competing on pricing might become challenging down the line as incumbents like Google, Microsoft and AWS ramp up investments in custom hardware to run and train models. Google offers its TPUs; Microsoft recently unveiled two custom chips, Azure Maia and Azure Cobalt; and AWS has Trainium, Inferentia and Graviton.

“Hypercalers will leverage their custom silicon to mitigate their dependencies on Nvidia, while Nvidia will look to CoreWeave and other GPU-centric AI clouds,” Sustar said.

Then there’s the fact that, while many generative AI workloads run best on GPUs, not all workloads need them — particularly if they’re aren’t time-sensitive. CPUs can run the necessary calculations, but typically slower than GPUs and custom hardware.

More existentially, there’s a threat that the generative AI bubble will burst, which would leave providers with mounds of GPUs and not nearly enough customers demanding them. But the future looks rosy in the short term, say Sustar and Nag, both of whom are expecting a steady stream of upstart clouds.

“GPU-oriented cloud startups will give [incumbents] plenty of competition, especially among customers who are already multi-cloud and can handle the complexity of management, security, risk and compliance across multiple clouds,” Sustar said. “Those sorts of cloud customers are comfortable trying out a new AI cloud if it has credible leadership, solid financial backing and GPUs with no wait times.”


Software Development in Sri Lanka

Robotic Automations

Google bets on partners to run their own sovereign Google Clouds | TechCrunch


Data sovereignty and residency laws have become commonplace in recent years. The major clouds, however, were always set up to enable the free movement of data between their various locations, so over the course of the last few years, all of the hyperscalers started looking into how they could offer sovereign clouds that can guarantee that government data, for example, never left a given country. AWS announced its European Sovereign Cloud last October. The Microsoft Azure Cloud for Sovereignty became generally available in December.

Google Cloud’s approach has been a bit different. Back in 2021, Google Cloud partnered with T-Systems to offer a sovereign cloud for Germany. A few weeks ago, it also announced a new partnership with World Wide Technology (WWT) to offer sovereign cloud solutions for government customers in the U.S.

Now Google is renewing its focus on data sovereignty. For the time being, though, it looks like its emphasis is on partnerships, not building its own sovereign clouds.

Google Cloud’s hybrid and on-premises story has changed quite a bit over the last few years. From the Cloud Services Platform to Anthos, GKE On-Prem and likely a few others that time has long forgotten, Google Cloud has aimed to offer a solution for companies that want to use its services and tooling but because of regulations, security, cost or paranoia, don’t want their workloads and data to sit in the Google cloud. Google’s latest effort in this space is branded Google Distributed Cloud (GDC), a fully managed software and hardware solution that can either be connected to the Google Cloud or be completely air-gapped from the internet.

Of course, this wouldn’t be 2024 if Google didn’t put an emphasis on AI in all of these efforts, too.

“Today, customers are looking for entirely new ways to process and analyze data, discover hidden insights, increase productivity and build entirely new applications — all with AI at the core,” said Vithal Shirodkar, VP/GM, Google Distributed Cloud and Geo Expansion, Google Cloud, in Tuesday’s announcement. “However, data sovereignty, regulatory compliance, and low-latency requirements can present a dilemma for organizations eager to adopt AI in the cloud. The need to keep sensitive data in certain locations, adhere to strict regulations, and ensure swift responsiveness can make it difficult to capitalize on the cloud’s inherent advantages of innovation, scalability, and cost-efficiency.”

At Cloud Next, Google Cloud’s annual developer conference, GDC is getting a slew of updates, including new security features (in partnership with Palo Alto Networks), support for the Apigee API management service and more. Developers can also now use a GDC Sandbox in Google Cloud to build and test applications without the need to work with the physical hardware. What’s maybe just as important as these new features is that GDC is now ISO27001 and SOC2 compliant.

On the hardware side, Google Cloud is introducing new AI servers for GDC. These are powered by Nvidia’s L4 Tensor Core GPUs and are now available in addition to the existing GDC AI-optimized servers with the high-powered Nvidia H100 GPUs.

Another interesting aspect to the GDC digital sovereignty story is that Google Cloud is emphasizing its partners, T-Systems, WWT and Clarence, which can deliver sovereign GDC-powered clouds on behalf of their clients.


Software Development in Sri Lanka

Back
WhatsApp
Messenger
Viber