News

Whats New in Heroku AI: New Models and a Flexible Standard Plan

News
Last Updated: February 19, 2026
Anush DSouza, Josh Lewis

Heroku is introducing significant updates to Managed Inference and Agents. These changes focus on reducing developer friction, expanding model catalogue, and streamlining deployment workflows.

Code Execution Sandbox for Agents on Heroku

News
Last Updated: February 17, 2026
Anush DSouza

Large language models are good at writing code. Data from Anthropic shows that allowing Claude to execute scripts, rather than relying on sequential tool calls, reduces token consumption by an average of 37%, with some use cases seeing reductions as high as 98%.

Untrusted code needs a secure and isolated place to execute. We solved this with code execution sandboxes (powered by one-off dynos), launched alongside Heroku Managed Inference and Agents in May 2025.

Heroku and GitHub Enterprise Server: Stronger Security, Seamless Delivery

Ecosystem, News
Last Updated: February 11, 2026
Alberto Sigismondi

Today, we are thrilled to announce the General Availability (GA) of the Heroku GitHub Enterprise Server Integration.

For our Enterprise customers, the bridge between code and production must be more than just convenient. It must be resilient, secure, and governed at scale. While our legacy OAuth integration served us well, the modern security landscape demands a shift away from personal credentials toward managed service identities.

An Update on Heroku

News
Last Updated: February 06, 2026
Nitin T Bhat

Today, Heroku is transitioning to a sustaining engineering model focused on stability, security, reliability, and support. Heroku remains an actively supported, production-ready platform, with an emphasis on maintaining quality and operational excellence rather than introducing new features. We know changes like this can raise questions, and we want to be clear about what this means for customers.

There is no change for customers using Heroku today. Customers who pay via credit card in the Heroku …

Building AI Search on Heroku

Engineering, News
Last Updated: January 29, 2026
Anush DSouza

If you’ve built a RAG (Retrieval Augmented Generation) system, you’ve probably hit this wall: your vector search returns 20 documents that are semantically similar to the query, but half of them don’t actually answer it.

A user asks “how do I handle authentication errors?” and gets back documentation about authentication, errors, and error handling in embedding space, but only one or two are actually useful.

This is the gap between demo and production. Most tutorials stop at vector search. This reference architecture shows what comes next. This AI Search reference app shows you how to build a production grade enterprise AI search using Heroku Managed Inference and Agents.

Optimize Search Precision with Reranking on Heroku AI

News
Last Updated: January 15, 2026
Anush DSouza, Mandeep Bal

Today, we are announcing the general availability of reranking models on Heroku Managed Inference and Agents, featuring support for Cohere Rerank 3.5 and Amazon Rerank 1.0.

Semantic reranking models score documents based on their relevance to a specific query. Unlike keyword search or vector similarity, rerank models understand nuanced semantic relationships to identify the most relevant documents for a given question. Reranking acts as your RAG pipeline’s high-fidelity filter, decreasing noise and token costs by identifying which documents best answer the specific query.

Heroku AI: Accelerating AI Development With New Models, Performance Improvements, and Messages API

News
Last Updated: December 18, 2025
Anush DSouza

This month marks significant expansion for Heroku Managed Inference and Agents, directly accelerating our AI PaaS framework. We’re announcing a substantial addition to our model catalog, providing access to leading proprietary AI models such as Claude Opus 4.5, Nova 2, and open-weight models such as Kimi K2 thinking, MiniMax M2, and Qwen3. These resources are fully managed, secure, and accessible via a single CLI command. We have also refreshed aistudio.heroku.com, please navigate to …

Heroku AppLink: Now Using JWT-Based Authorization for Salesforce

News
Last Updated: December 11, 2025
Vivek Viswanathan

Modern Continuous Integration/Continuous Deployment (CI/CD) pipelines demand machine-to-machine authorization, but traditional web-based flow requires manual steps and often rely on static credentials; a major security risk. Heroku AppLink now uses JWT Authorization to solve both: enabling automated setup and eliminating long-lived secrets.

Heroku GitHub Enterprise Integration: Unlocking Full Continuous Delivery for Enterprise Customers

News
Last Updated: December 10, 2025
Alberto Sigismondi

We’re excited to announce a significant enhancement to how Heroku Enterprise customers connect their deployment pipelines to GitHub Enterprise Server (GHES) and GitHub Enterprise Cloud (GHEC). The new Heroku GitHub Enterprise Integration is now available in a closed pilot, offering a more secure, robust, and permanent connection between your code repositories and your Heroku apps.

Faster Agents with Automatic Prompt Caching

News
Last Updated: December 04, 2025
Anush DSouza, Mandeep Bal

Heroku is launching automatic prompt caching starting December 18, 2025. Prompt caching delivers a notable, zero-effort performance increase for Heroku Managed Inference and Agents. Enabled by default, this feature is designed to deliver significantly faster responses for common workloads. We have taken a pragmatic approach and currently only enabled this to cache system prompts and tool definition, and not user messages or conversation history. You can disable caching for any request by setting X-Heroku-Prompt-Caching: false.

Subscribe to the full-text RSS feed for News.

How Fastcall Delivers Enterprise-Scale Voice and Messaging with Heroku

News