Cloud · Top stories
Google’s electricity use rose 37% in 2025 amid AI infrastructure expansion
Google's electricity consumption surged by 37% in 2025, driven by AI data center growth. Despite this increase, the company achieved a 2% reduction in operational emissions, emphasizing its reliance on clean energy investments.
NVIDIA and AWS Enhance AI Production with New EC2 G7 Instances
NVIDIA and AWS have launched EC2 G7 instances powered by NVIDIA RTX PRO 4500 GPUs, enhancing AI production capabilities. These instances offer substantial performance improvements, enabling enterprises to deploy AI and data analytics workloads at scale with lower operational complexity.
Microsoft announces Azure Cobalt 200 VMs with 50% performance improvement for AI
Microsoft's Azure Cobalt 200 Arm-based VMs provide a 50% performance boost over the Cobalt 100, tailored for agentic AI workloads. This launch signals a shift in cloud architecture due to increased customer demand for compute in AI applications.
Meta Develops Cloud Service to Monetize AI Compute Capacity
Meta is reportedly developing a cloud infrastructure service called 'Meta Compute' to monetize its AI compute capacity, putting it in direct competition with cloud giants like AWS and Google Cloud. This strategic move aims to utilize Meta's significant investments in AI infrastructure and offset costs while potentially disrupting the cloud computing industry. It matters because it reflects a broader trend of tech companies leveraging their data center assets to generate additional revenue.
Cloudflare to Block Mixed-Use AI Web Crawlers by 2026
Cloudflare will block mixed-use web crawlers from accessing ad-supported sites starting September 15, 2026. This policy aims to give website owners more control over how AI companies use their content, potentially affecting AI models' access to web data.
Claude Fable 5 by Anthropic Now Available on Azure with NVIDIA GB300 GPUs
Anthropic's AI model, Claude Fable 5, is now accessible on Microsoft Azure Foundry, utilizing NVIDIA GB300 GPUs. It allows enterprises to leverage AI for complex, autonomous tasks with enhanced performance while integrating NVIDIA's computing power. This development provides businesses with capabilities to manage and deploy advanced AI solutions efficiently, highlighting the collaboration among Microsoft, Anthropic, and NVIDIA.
AWS expands AI and agent capabilities at NY Summit 2026
AWS announced several new AI and agent capabilities during its New York Summit 2026. Key updates include the Amazon Bedrock AgentCore enhancements, AWS Continuum for proactive AI-native security, and the Agentic CX designer for Amazon Connect. These innovations aim to streamline AI development, security, and customer experience processes.
GeForce NOW Adds 12 New Games Including Monopoly: Star Wars Heroes
GeForce NOW adds 12 new games in July, starting with Monopoly: Star Wars Heroes vs. Villains. This expansion includes popular titles and enhances accessibility for gamers on various devices.
AWS Network Firewall introduces container attribute-based rules for EKS and ECS
AWS Network Firewall now supports container attribute-based rules for Amazon EKS and ECS, enhancing security for traffic in Kubernetes environments. This feature allows users to define firewall rules based on container attributes instead of transient IP addresses, addressing challenges in dynamic container workloads.
SOCRadar migrates to AlloyDB for enhanced threat detection capabilities
SOCRadar has migrated from PostgreSQL to AlloyDB, achieving a 20x performance boost and reduced operational overhead. This transition allows SOCRadar to deliver faster threat intelligence, critical for defending against cyberattacks.
Microsoft Introduces Controls to Block Unauthorized AI Bots in Teams Meetings
Microsoft has launched a new Teams admin policy to control external bots joining meetings. By requiring organizer confirmation for bots, the company aims to enhance security and privacy during sensitive discussions.
BoltzGen Deploys on Amazon SageMaker AI for Protein Design
BoltzGen is now available on Amazon SageMaker AI, optimizing protein binder design processes by managing GPU infrastructure automatically. This integration allows researchers and developers to focus on design rather than infrastructure, significantly reducing operational overhead and costs in protein design projects.
Google Cloud receives Dutch DPIA approval for EU public sector use
Google Cloud has gained approval from the Dutch government following a rigorous Data Protection Impact Assessment (DPIA). This milestone assures EU public sector organizations of Google Cloud's compliance with data protection standards, encouraging broader adoption in the region.
Google Cloud Updates Include New AI Models and TPU Enhancements
Google Cloud announced new features including Claude Sonnet 5 and enhancements to TPU model loading. These developments aim to improve AI task completion and model efficiency for enterprises.
Elon Musk Proposes 1 Million Satellite Orbital Data Center Constellation
SpaceX plans to deploy up to 1 million satellites for orbital data centers, as proposed by Elon Musk. However, challenges in launch capacity and satellite manufacturing raise doubts about the feasibility of this plan in the projected timeline.
Google Cloud Introduces Multi-Node KV Cache Offloading for LLMs
Google Cloud has implemented a decentralized attention cache tier using Managed Lustre, enhancing LLM inference efficiency. This system allows for over 50% TCO savings and nearly 60% reduction in GPU-hour requirements for Llama-3.3-70B inference, addressing limitations of local storage pooling.
Google Cloud Monitoring introduces dynamic thresholding for anomaly detection
Google Cloud Monitoring has launched dynamic thresholding for PromQL alert policies, allowing users to configure alerts based on two years of historical metric data. This feature enables more flexible and accurate anomaly detection as it adjusts thresholds based on historical performance rather than static values.
Google Cloud announces new AI features and partnerships for June 2023
Google Cloud introduced several updates in June 2023 including the Open Knowledge Format for AI metadata. Collaborations with Apple and the launch of Anthropic's Claude Fable 5 model on Google Cloud were also highlighted. These developments enhance the capabilities and security of AI applications in the cloud.
Gemini Enterprise Agent Platform launches remote MCP server for developers
The Gemini Enterprise Agent Platform has introduced a remote Managed Control Plane (MCP) server to facilitate secure connection for external AI agents with Google Cloud resources. This development allows developers to create agents using their preferred IDEs while ensuring compliance and data protection.
Google Cloud Introduces AlloyDB Omni for Financial Services Modernization
Google Cloud has launched AlloyDB Omni aimed at modernizing financial services by addressing compliance, speed, and data sovereignty challenges. This new hybrid database solution allows organizations to escape vendor lock-in and leverage cloud-native technologies while maintaining control over their data, crucial for meeting regulatory demands.
Google Spanner Enhances Multi-Model Architecture for AI Integration
Google's Spanner database is evolving to support a unified multi-model architecture for generative AI and autonomous workflows. This shift addresses the need for databases to serve as critical context engines, enabling AI models to leverage diverse data formats in real-time.
Google Cloud Enhances SDLC Security with Autonomous AI Agents
Google Cloud has integrated autonomous AI agents into its software development lifecycle for enhanced security. This transition aims to improve defenses against AI-driven threats by automating security processes and improving code protection.
AWS adds resource-based policies for console access control from specific networks
AWS introduced resource-based policies and resource control policies to restrict AWS Management Console access to specific networks. This change allows organizations to enforce network-based restrictions for compliance and security purposes, significantly enhancing AWS account security.
Agentic Cloud Operations Transition to Insight-Driven Action with AI Integration
Agentic cloud operations enable real-time insight-driven decision-making in hybrid environments using AI agents. This approach is gaining traction, with 79% of organizations deploying such technologies, reflecting a shift in cloud operations management.
AWS emphasizes egress controls to prevent data exfiltration in cloud workloads
Amazon Web Services (AWS) highlights the importance of egress controls to prevent data exfiltration in cloud environments. With traditional threats and emerging AI architectures posing risks, proper egress monitoring is necessary to detect unauthorized data flows and secure workloads.
Azure Storage migration tools aimed at enhancing data transition strategies
Microsoft Azure introduced tools like Azure Migrate and Azure Copilot Migration Agent to streamline enterprise storage migrations. These tools help organizations manage dependencies, assess readiness, and execute tailored migration strategies, which is vital for business continuity.
Amazon Cognito enhances services with high-throughput, encryption, and replication features
Amazon Cognito has introduced high-throughput performance, customer-managed keys, and multi-Region replication capabilities. These enhancements support modern applications and improve data security and business continuity.
AWS KMS launches GetKeyLastUsage API for key management
AWS has introduced the GetKeyLastUsage API, allowing users to check when KMS keys were last utilized. This tool simplifies auditing and reduces reliance on AWS CloudTrail logs, enhancing key management efficiency and compliance tracking.
AWS Network Firewall Supports Transit Gateway Attachment for Cost Optimization
AWS Network Firewall now allows attachment to Transit Gateway, streamlining traffic routing without needing a central inspection VPC. This simplifies network architecture and enables flexible cost allocation for traffic inspection, making it more efficient for AWS users.
AWS Network Firewall introduces URL and Domain Category filtering for easier policy management
AWS Network Firewall now enables URL and domain category filtering, allowing security teams to manage access via predefined categories rather than individual domains. This update simplifies policy management and ensures domain lists stay current automatically, particularly benefiting organizations overseeing rapidly changing areas like AI services.
Amazon EKS now supports Kubernetes version rollbacks for safer upgrades
Amazon Elastic Kubernetes Service (EKS) introduces a version rollback feature for Kubernetes upgrades, allowing administrators to revert to a previous version within seven days if issues arise. This new feature enhances upgrade confidence, addressing concerns over reliability in upgrade procedures, particularly in regulated environments.
AWS launches CloudFormation Express mode to accelerate infrastructure deployments
AWS introduced CloudFormation Express mode, reducing deployment times by up to 4x by bypassing stabilization checks. This enables faster iterative workflows and supports AI-assisted development by allowing quicker feedback loops.
Amazon Launches New EC2 Instances Powered by Graviton5 Processors
Amazon has released new EC2 instances, the M9g, M9gd, C9g, and C9gd, powered by Graviton5 processors. These instances offer substantial performance improvements, with the M9g enhancing MySQL database query performance by 60% and the C9g offering 25% higher performance per vCPU for compute-intensive workloads. This development is significant for businesses utilizing cloud computing, promising improved efficiency and cost management for various applications.
AWS Certificate Manager Adds ACME Support for Automating TLS Certificate Issuance
AWS Certificate Manager now supports the Automatic Certificate Management Environment (ACME) protocol for public TLS certificates, allowing automated issuance and management without manual intervention. This update enables centralized control for PKI administrators and helps organizations streamline certificate management as validity periods shorten.
EU to Discuss Controversial Chat Control Legislation This Weekend
Dr. Patrick Breyer warns of significant legislative moves regarding Chat Control in the EU. Proposed measures could undermine secure messaging and anonymous communication in Europe.
Google restricts Meta's access to Gemini AI due to capacity limitations
Google has limited Meta's use of its Gemini AI models after Meta requested more computing power than Google could provide. This has disrupted and delayed Meta's internal AI projects and highlighted the ongoing struggle for companies to secure sufficient computing resources amidst rising AI demands.
SoftBank CEO Questions Viability of Elon Musk's Orbital Data Centers
SoftBank CEO Masayoshi Son expressed skepticism about Elon Musk's orbital data center vision, stating it may not reduce costs and will take too long to implement. This skepticism highlights the uncertainty in the tech landscape regarding investments in space computing amid pressing AI demands.
AWS Lambda introduces MicroVMs for isolated code execution environments
AWS Lambda launched MicroVMs, offering VM-level isolation for user-generated code in a serverless environment. This addresses the need for dedicated execution environments in applications like AI assistants and data analytics, enabling rapid session launches while managing state effectively.
Amazon EC2 G7 Instances Launch with NVIDIA RTX PRO 4500 GPUs
Amazon has launched EC2 G7 instances featuring NVIDIA RTX PRO 4500 GPUs, enhancing AI inference and graphics performance. This introduces significant improvements over previous generations, with up to 4.6x AI inference performance and 700 Gbps networking throughput, impacting workloads across AI, rendering, and analytics sectors.
Amazon ECS launches high-resolution metrics for faster service auto scaling
Amazon ECS introduced high-resolution metrics for service auto scaling, enhancing responsiveness to workload demands. The new mechanism reduces the time to scale-out from 363 seconds to 86 seconds, improving efficiency and cost-effectiveness for users.