Product Changelog

What’s New in ServerlessScaling, observability, cold starts, multi-GPU & more

March 11, 2026

DashboardAnalyticsServerless

Interval Selection and Chart Zoom in Analytics

The analytics pages now include an Interval picker that lets you control how metrics are bucketed — choose from 30 seconds, 1 minute, 5 minutes, 10 minutes, 1 hour, 1 day, 1 week, or 1 month depending on your selected date range
Use finer intervals (30 sec, 1 min) to pinpoint exact moments when latency spiked or error rates changed, and wider intervals (1 day, 1 week) for high-level trends over longer periods
Click and drag on any chart in App Analytics to select a time range and instantly zoom into that window — the date range updates automatically so you can isolate an incident, compare before-and-after a deployment, or drill into a traffic spike without manually adjusting date pickers
The Interval picker is available on both the Requests and Runners tabs in App Analytics, as well as in Error Analytics

March 11, 2026

DashboardServerless

App List View

Toggle between card view and a new compact list view on the app listing page using the view switcher in the toolbar
List view displays your apps as rows with columns for type, runners, requests/min, latency, errors, machine type, and last updated — making it easy to scan and compare performance across many apps at once
Your view preference is saved automatically, so you’ll see the same layout next time you visit
Especially useful for teams managing dozens of serverless apps who need to quickly spot which apps need attention

March 11, 2026

ServerlessModel APIs

Error Types in API Responses

Error responses now include a machine-readable error_type field identifying the specific failure category — such as request_timeout, runner_disconnected, runner_scheduling_failure, or runner_server_error
Use error_type to build smarter retry logic: runner and timeout errors (e.g. runner_connection_timeout, startup_timeout) are typically transient and worth retrying, while client errors like bad_request should not be
Track error_type values in your monitoring to spot trends — for example, frequent runner_scheduling_failure errors may indicate you need to increase max_concurrency

See Error Reference — Request Error Types for the full list of error types, status codes, and handling guidance.

March 11, 2026

ServerlessRunners

Termination Grace Period

New termination_grace_period_seconds parameter lets you control how long a runner has to finish in-flight requests and run teardown() before being forcefully killed
Defaults to 5 seconds, configurable up to a maximum of 30 seconds

class MyApp(fal.App):
    termination_grace_period_seconds = 20

See Scale Your Application — Termination Grace Period for the full shutdown lifecycle and best practices.

February 25, 2026

DashboardServerless

App Tagging

You can now create and assign custom tags to your serverless apps directly from the dashboard
Organize your app listing by tagging apps by team, project, model type, or any label that makes sense for your workflow
Filter by tag in the app list to quickly find the apps you need
Create, assign, and remove tags right from the app listing page — no CLI or API needed

Tags are visual labels for organizing your dashboard — they don’t affect app behavior, routing, or secrets. For isolated deployment stages (dev, staging, production), use Environments instead.

February 23, 2026

CLIServerless

Machine Type in Runner Output

fal runners list and fal app runners <app> now display the machine type for each runner (e.g. GPU-A100, GPU-H100)
JSON output (--output json) also includes the machine_type field for each runner

February 17, 2026

Serverless

Runner `FAILURE_DELAY` Status

Runners that fail during setup() now show a FAILURE_DELAY status, making it easier to identify runners that are in a cooldown period before retrying initialization
You can filter runners by this state using fal runners list --state failure_delay
See Understanding Runners for details

March 11, 2026

ServerlessRunners

Termination Grace Period

New termination_grace_period_seconds parameter lets you control how long a runner has to finish in-flight requests and run teardown() before being forcefully killed
Defaults to 5 seconds, configurable up to a maximum of 30 seconds

class MyApp(fal.App):
    termination_grace_period_seconds = 20

See Scale Your Application for the full shutdown lifecycle and best practices.

February 25, 2026

DashboardServerless

App Tagging

You can now create and assign custom tags to your serverless apps directly from the dashboard
Organize your app listing by tagging apps by team, project, model type, or any label that makes sense for your workflow
Filter by tag in the app list to quickly find the apps you need
Create, assign, and remove tags right from the app listing page — no CLI or API needed

Tags are visual labels for organizing your dashboard — they don’t affect app behavior, routing, or secrets. For isolated deployment stages (dev, staging, production), use Environments instead.

February 23, 2026

CLIServerless

Machine Type in Runner Output

fal runners list and fal app runners <app> now display the machine type for each runner (e.g. GPU-A100, GPU-H100)
JSON output (--output json) also includes the machine_type field for each runner

February 17, 2026

Serverless

Runner `FAILURE_DELAY` Status

Runners that fail during setup() now show a FAILURE_DELAY status, making it easier to identify runners that are in a cooldown period before retrying initialization
You can filter runners by this state using fal runners list --state failure_delay
See Understanding Runners for details

February 16, 2026

DashboardLogsServerless

Interactive Log Histogram

The logs page now features an interactive histogram that visualizes log volume over time, broken down by severity level
Click and drag to select a time range on the histogram to zoom into that window and filter your logs instantly
Zoom in and out to explore log patterns at different time granularities
Color-coded bars show the distribution of stderr, error, warning, info, and trace logs at a glance

Jump to Context

When viewing a specific log entry, use the new Jump to Context button to instantly scroll to surrounding log lines
Quickly see what happened before and after any log entry without manually searching through timestamps
Especially useful when navigating to a log from a shared link or alert

Switch Log Timezones Between UTC and Local

You can now toggle between UTC and your local timezone directly from the datepicker in the logs page
All log timestamps, filters, and the histogram update instantly when switching timezones

February 12, 2026

DashboardPlatformServerless

Serverless App Cards with Error Rates and Graphs

The app listing page now displays rich cards for each serverless app with at-a-glance performance metrics
Error rate indicators show the health of each app directly on the card
Inline sparkline graphs visualize request volume and error trends over time
Quickly identify apps that need attention without clicking into each one individually

February 11, 2026

DashboardPlatformServerlessLogs

You can now click on any log entry and share the link directly with the rest of your team
Shared log links preserve the full context of the log, making it easy to collaborate on debugging and troubleshooting

Runner Side Sheet

Click into any runner on the Runners page to open a detailed side sheet with telemetry, logs, and the ability to connect to the runner
Makes it much easier to debug and observe your runners without leaving the page

February 11, 2026

ServerlessCLI

`--auth` flag for `fal run`

You can now specify the authentication mode when running your app with fal run using the --auth flag. Supported values are public and private, giving you control over who can access your app during development and testing.

fal run path/to/myapp.py::MyApp --auth private

public — no authentication required, app owner pays
private — only you or your team can access

February 4, 2026

DashboardLogs

Full-Screen Logs

The logs page now opens in a full-screen view, giving you significantly more vertical and horizontal space to work with
See more log lines at once and reduce scrolling when debugging complex issues

February 3, 2026

Serverless

FalBaseModel for better input/output definitions

Define your API inputs and outputs with FalBaseModel a Pydantic base class with built-in support for hidden fields, field ordering, and media type hints.

Hidden fields - Use Hidden(Field(...)) to mark parameters as API-only, hiding them from the playground UI while keeping them accessible via API
Field ordering - Control the order of fields in your API schema with FIELD_ORDERS
Media field helpers - Use ImageField, AudioField, VideoField, and FileField for better playground rendering

Example:

from fal.toolkit import FalBaseModel, Field, Hidden, ImageField

class TextToImageInput(FalBaseModel):
    FIELD_ORDERS = ["prompt", "negative_prompt", "image_size"]

    prompt: str = Field(description="Text description of the image")
    negative_prompt: str = Field(default="", description="What to avoid")
    image_size: str = Field(default="1024x1024")

    # Hidden from playground but accessible via API
    debug_mode: bool = Hidden(Field(default=False))
    internal_seed: int = Hidden(Field(default=-1))

class ImageToImageInput(FalBaseModel):
    image_url: str = ImageField(description="Input image")
    strength: float = Field(default=0.8)

See Handle Inputs and Outputs for details.

February 2, 2026

DashboardServerless

Switch environments from app pages

You can now quickly switch between environments directly from any app page using the new environment dropdown in the dashboard.

Environment dropdown - Click the environment badge on any app page to see all environments where the app is deployed
One-click switching - Select an environment to navigate to the same app in that environment
Quick access - View environment secrets or create new environments directly from the dropdown

January 30, 2026

ServerlessRunnersBreaking Change

Track when runners are idle and waiting for work

Potential Breaking Change - The runner state model has been updated. If you’re monitoring or tracking runner states programmatically, you may need to update your integration to handle the new IDLE state.

Understand runner utilization with the new IDLE state that shows when runners are ready but not actively processing requests.

IDLE state visibility - see when runners finish processing and are waiting for new work
Better resource monitoring - distinguish between actively processing requests and waiting states
Improved observability - track idle time to optimize scaling and resource allocation

See the new Understanding Runners guide and Optimizing Cold Starts guide for complete details on runner lifecycle, states, and performance optimization.

January 25, 2026

Model APIs

Control queue wait time with `start_timeout`

You can now set a start_timeout on requests which ensures that when queue time is too long, the request is aborted without starting.See Client Libraries → Start Timeout for details.

January 21, 2026

Serverless

Environments for isolated deployments

Organize your applications, secrets, and configurations across different stages of your workflow with the new environments feature.

Create isolated environments for development, staging, and production
Environment-scoped secrets - use different API keys and credentials per environment
Deploy to specific environments using the --env flag

# Create a staging environment
fal environments create staging

# Deploy to staging
fal deploy my_app.py --env staging

# Set environment-specific secrets
fal secrets set API_KEY=staging-key --env staging

See environments documentation for details.

January 19, 2026

Serverless

Handle graceful shutdown with `handle_exit()` and `teardown()`

You can now define handle_exit() and teardown() methods in your app to handle graceful shutdown.

handle_exit() - Called when the runner is requested to terminate to signal handlers to stop early.
teardown() - Called when the runner is shutting down to clean up resources.

See lifespan docs for details.

January 13, 2026

Serverless

Include local files in container builds with COPY and ADD

Experimental Feature - This feature is currently experimental.

You can now use standard Docker COPY and ADD commands to include local files in your container builds.

Automatic file parsing - fal parses your Dockerfile to find COPY/ADD commands and collects referenced files
Hash-based deduplication - Files are uploaded to fal’s storage with content-addressable deduplication (files from app_files are reused automatically)
.dockerignore support - Create a .dockerignore file or use add_dockerignore() to exclude files
Multi-stage build support - COPY --from=... commands are correctly handled (only local files are collected)
Smart rebuilds - Changes to your fal.App file don’t trigger rebuilds (it’s pickled separately); only changes to COPY/ADD referenced files trigger rebuilds

Example:

dockerfile_str = """
FROM python:3.11
WORKDIR /app
COPY requirements.txt .
RUN pip install -r requirements.txt
COPY src/ ./src/
"""

class MyApp(fal.App):
    image = ContainerImage.from_dockerfile_str(dockerfile_str)

See custom container image docs.

January 9, 2026

Serverless

Skip retries for specific conditions

You can now skip retries for specific conditions using the skip_retry_conditions option.

class MyApp(fal.App):
    skip_retry_conditions=["timeout"]  # This app won't retry on timeout
    ...

Available conditions: "server_error", "timeout".See retry policy docs for details.

January 7, 2026

Serverless

Graceful shutdown of fal apps

Starting with fal>=1.61.0, runners now receive a SIGTERM signal when terminated and are given a 5-second grace period to complete ongoing requests before being forcefully terminated with SIGKILL.This applies to all termination scenarios: expiration, manual stop/kill, and scaling down. Use the teardown() method to handle cleanup during this grace period.See lifespan docs for details.

December 22, 2025

Serverless

Add a health check endpoint to your application

Add a health check endpoint to your application to automatically replace unhealthy runners.

Health check endpoint - Pass the health_check parameter to the @fal.endpoint() decorator to configure an endpoint as your health check
Periodic checks and recovery - fal periodically (every 15 seconds) calls this endpoint and replace unhealthy runners if it fails for a few consecutive calls

Example:

class MyApp(fal.App):
    @fal.endpoint("/health", health_check=fal.HealthCheck(failure_threshold=3))
    def health(self) -> HealthResponse:
        if not self.connection.is_alive():
            raise RuntimeError("Lost connection to the external service")
        return HealthResponse(status="ok")

See health check endpoint docs.

December 22, 2025

Serverless

Disable environment build cache

You can now disable the environment build cache by passing the --no-cache flag to the fal deploy or fal run command.See custom container image docs.

November 25, 2025

Serverless

Scale your application with the new scaling delay feature

Scale your application with the new scaling delay feature.

Scaling delay - the amount of seconds the system will wait for a request to be picked up by a runner before triggering a scale up of a runner

Example:

class MyApp(fal.App):
    scaling_delay = 30
    # ...

See scaling docs.

November 17, 2025

Serverless

Reduce cold start times with shared compiled PyTorch caches

Dramatically reduce cold start times for torch.compile() models with the new inductor cache utilities.

Load pre-compiled CUDA kernels in ~2 seconds instead of recompiling for 20-30 seconds on each worker
GPU-specific caching automatically organized by GPU type (H100, H200, A100)
Two usage patterns: Manual control with load_inductor_cache() / sync_inductor_cache() or automatic with synchronized_inductor_cache() context manager
Persistent shared storage at /data/inductor-caches/<GPU_TYPE>/<cache_key>.zip
First worker compiles and shares, subsequent workers load instantly

Example:

from fal.toolkit import synchronized_inductor_cache

with synchronized_inductor_cache("mymodel/v1"):
    self.model = torch.compile(self.model)
    self.warmup()  # Compilation happens once, synced automatically

See compilation cache docs.

November 14, 2025

ServerlessDashboard

Get Slack notifications for serverless app failures

Never miss critical issues with instant Slack alerts for your serverless applications.

Connect your workspace with one-click OAuth installation
Choose notification channel from a dropdown of your Slack channels
Instant alerts for:
- App startup failures and timeouts
- Critical platform issues
- Real-time error notifications
Team visibility - everyone in the channel sees important updates
Configure at https://fal.ai/dashboard/notifications/settings

November 4, 2025

ServerlessDashboardRunnersLogs

Stop and kill runners directly from the dashboard

No more switching to the CLI to manage your runners. You now have full lifecycle control right from the dashboard.

Graceful shutdown or force kill runners with a single click
Access at https://fal.ai/dashboard/apps/{username}/{appname}/runners

Stream platform logs to your own endpoint with drains

Integrate fal’s logging with your existing observability stack using the new Serverless Drains feature.

Automatic log forwarding from apps, runners, and file operations in NDJSON format
Works with Datadog, Splunk, Elasticsearch, or any HTTP endpoint
Configure at https://fal.ai/dashboard/drains

November 2, 2025

CLIFilesServerless

Upload larger files with improved timeout handling

We’ve significantly improved the reliability of file uploads from URLs, especially for large datasets and model files.

Extended timeout to 10 minutes for fal files upload and fal files upload-url
Upload multi-GB files without timeout errors
See fal files docs

November 1, 2025

CLIRunnersServerless

Restart all runners without redeploying

Apply environment changes or recover from bad states instantly with the new fal apps rollout command.

Restart all runners for an app without creating a new deployment
Graceful by default (runners finish current requests) or use --force for immediate restart
Pick up new secrets, environment variables, or clear memory issues
See fal apps rollout docs

Stop specific runners without affecting others

Target individual runners for maintenance with graceful shutdown via fal runners stop.

Stop specific runners without affecting others, useful for targeted maintenance
See fal runners docs

Debug production runners with interactive shell access

Jump directly into any running container to troubleshoot issues in real-time with fal runners shell.

SSH-like access to inspect files, environment variables, and dependencies
Debug production issues without redeploying
See fal runners shell docs

October 31, 2025

DashboardServerless

See everything happening in your app with the events timeline

Complete activity history for runners, deployments, and config changes in one place.

Unified timeline of runner events, deployments, and config changes
Access at https://fal.ai/dashboard/apps/{username}/{appname}/events

October 25, 2025

DashboardServerlessOnboarding

Get from zero to deployed in minutes with in-app onboarding

New interactive guide walks you through your first serverless deployment step-by-step.

Step-by-step walkthrough from installation to deployment with copy-paste examples
Access at https://fal.ai/dashboard/serverless-get-started

October 22, 2025

CLIFiles

Delete files from fal storage

Remove files and directories with the new fal files rm command.

Recursive deletion: fal files rm path/to/file-or-directory
See fal files docs

October 21, 2025

Platform APIModels APIAnalyticsServerless

Platform APIs v1 officially released

Programmatically manage your model deployments with the new Platform APIs.

Model discovery - search and metadata retrieval for 600+ models
Pricing and cost estimation - real-time pricing information
Usage tracking - detailed line items with quantities and prices
Analytics - request counts, error rates, and latency percentiles
Available at https://api.fal.ai/v1 - see docs

Get notified when you hit concurrent requests limits

Never wonder why requests are queuing—we now send notifications when you reach your concurrency limit.

Email and dashboard notifications with smart throttling (immediate, 1h, 1d, weekly)
Limit value included in 429 responses for programmatic handling

Debug errors faster with the new errors page

Comprehensive error analytics to identify and resolve issues quickly.

Server vs client error rates with 4xx/5xx breakdown and sparklines
Error timeline with status code distribution and endpoint-level breakdown
Access at https://fal.ai/dashboard/apps/{username}/{appname}/errors

October 20, 2025

CLIRunnersServerless

Stop or kill individual runners from the command line

Precise control over each runner’s lifecycle without touching the dashboard.

fal runners stop - gracefully stop a runner, allowing in-flight requests to complete
fal runners kill - immediately terminate a runner without waiting
See fal runners docs

October 16, 2025

DashboardServerlessCLI

See exactly how long runners spend starting up

Identify GPU availability bottlenecks and optimize cold start times.

Pending uptime metrics show how long runners wait before becoming active
Track PENDING, DOCKER_PULL, and SETUP state durations separately

October 15, 2025

ServerlessMCPModels API

Connect fal docs to Cursor with MCP

Access the complete fal documentation directly in Cursor using Model Context Protocol.

Complete documentation in your IDE with AI-powered suggestions
Simple setup: add fal MCP server to your mcp.json - see guide

Personalized dashboard with creator and developer views

The dashboard now adapts to your workflow with two distinct experiences.

Creator view - gallery-focused with favorite models and visual generation history
Developer view - metrics-driven with usage stats, error tracking, and API analytics
Quick stats showing credits, requests, and errors with sparklines

October 13, 2025

ServerlessModels APIMulti-GPU

Add custom headers to your API requests

Integrate seamlessly with analytics, auth, and middleware by passing custom HTTP headers.

Add custom headers for analytics, authentication, or middleware integration
Works with all client libraries

Multi-GPU inference and training with fal.distributed

Scale AI workloads across multiple GPUs with the new fal.distributed module.

Data parallelism - generate multiple outputs simultaneously (e.g., 4 images on 4 GPUs)
Model parallelism - split large models across GPUs for faster generation
Distributed training - synchronized gradient updates with DDP
Supports 2, 4, or 8 GPU configurations on H100s and A100s
See distributed docs

October 10, 2025

DashboardServerless

Dedicated pages for Analytics, Runners, Logs, and Versions

Complete app details redesign gives each deployment aspect its own focused view.

New Analytics page - runner-focused metrics with date range filtering
New Runners page - app-scoped runner view with enhanced filters
New Logs page - dedicated log viewer for debugging
New Versions page - manage and view app revisions
Enhanced Overview - endpoint stats and performance metrics at a glance

October 9, 2025

Product

Compare models side-by-side in the new Sandbox

Find the perfect model by testing multiple options in parallel with the same prompt.

Run multiple models simultaneously with the same prompt
Available at https://fal.ai/sandbox

October 8, 2025

Serverless

Manage deployments from Python without async/await

New synchronous client makes serverless management feel just like the CLI.

Manage apps, runners, and deployments programmatically without async/await
Same API as CLI: client.apps.*, client.runners.*, client.deploy()
See Python client docs

October 6, 2025

Serverless

Bring your own container to any deployment

Full control over your runtime environment with custom Docker images.

Use ContainerImage.from_dockerfile_str() or ContainerImage.from_dockerfile()
Install any dependencies, tools, or system packages you need
See custom containers guide

October 3, 2025

ServerlessCLIDashboardFiles

Dynamic auto-scaling with percentage-based buffers

Scale more intelligently by setting concurrency buffers as percentages instead of fixed numbers.

Configure buffer as a percentage of current concurrency for dynamic scaling
See scaling docs

Runner logs with streaming and filtering

Real-time log streaming and powerful filtering for faster debugging.

Stream logs in real-time with fal runners logs --follow
Filter by time range with --since and --until
Search logs with --search parameter
Scrollable and searchable in the dashboard with SSE-powered updates
See fal runners logs docs

Include local files in your deployments automatically

Bring configs, utilities, and code from your local machine into serverless apps.

Specify files with relative or absolute paths to include at runtime
Works with fal run and fal deploy
See app files docs

Clearer dashboard structure groups features by workflow: Generate, Serverless, and Manage.

Generate group: Sandbox, Model Gallery
Serverless group: Apps, Logs, Files, Runners
Manage group: Usage, Billing, API Keys, Webhooks, Team Members

October 2, 2025

DashboardRunnersCLI

Know exactly which version each runner is running

Track deployments better with revision IDs shown on every runner.

Revision ID displayed on runners to track which version is running
State renamed: “DEAD” → “TERMINATED” for clarity

October 1, 2025

LogsDashboardCLIServerless

Filter logs with custom labels and powerful queries

Find what you need instantly with EXACT/CONTAINS matching and multi-condition filters.

EXACT or CONTAINS matching for label values
Multiple conditions with OR logic (e.g., status IN ["error", "warning"])
Available in dashboard and API
Examples: error_type = "ValidationError", endpoint CONTAINS "/api/v2/"

See what runners are doing during startup

Track exactly where runners are in the startup process—pending, pulling images, or setting up.

fal runners list now shows PENDING, DOCKER_PULL, and SETUP states
Understand deployment progress in real-time

View all app endpoints and config at a glance

Redesigned app details page surfaces the information you need most.

Endpoints, configuration, and status all in one place

September 27, 2025

CLIServerless

Monitor and clear your request queue from the CLI

Check how many requests are queued and flush them when needed.

fal queue size app_name - check queue size for an app
fal queue flush app_name - flush all pending requests
See fal queue docs

September 10, 2025

CLIRunnersServerless

View runner history with time-based filtering

See terminated runners and filter by state to debug failures.

fal runners list --since "1h" - view runners from the last hour (max 24h)
fal runners list --state dead - filter by state (running, pending, setup, dead)
Helpful for debugging failed deployments and understanding runner lifecycle
See fal runners list docs

August 29, 2025

CLIFilesServerless

Reorganize files in fal storage without re-uploading

Move and rename files instantly with the new fal files mv command.

Rename or move files in fal storage: fal files mv source destination
See fal files docs

August 26, 2025

CLIServerless

See all your endpoint URLs immediately when testing

No more guessing which URL to use—CLI shows playground, sync, and async routes for every run.

CLI prints playground, synchronous, and asynchronous routes for fal run

​Interval Selection and Chart Zoom in Analytics

​App List View

​Error Types in API Responses

​Termination Grace Period

​App Tagging

​Machine Type in Runner Output

​Runner FAILURE_DELAY Status

​Termination Grace Period

​App Tagging

​Machine Type in Runner Output

​Runner FAILURE_DELAY Status

​Interactive Log Histogram

​Jump to Context

​Switch Log Timezones Between UTC and Local

​Serverless App Cards with Error Rates and Graphs

​Share Logs with Your Team

​Runner Side Sheet

​--auth flag for fal run

​Full-Screen Logs

​FalBaseModel for better input/output definitions

​Switch environments from app pages

​Track when runners are idle and waiting for work

​Control queue wait time with start_timeout

​Environments for isolated deployments

​Handle graceful shutdown with handle_exit() and teardown()

​Include local files in container builds with COPY and ADD

​Skip retries for specific conditions

​Graceful shutdown of fal apps

​Add a health check endpoint to your application

​Disable environment build cache

​Scale your application with the new scaling delay feature

​Reduce cold start times with shared compiled PyTorch caches

​Get Slack notifications for serverless app failures

​Stop and kill runners directly from the dashboard

​Stream platform logs to your own endpoint with drains

​Upload larger files with improved timeout handling

​Restart all runners without redeploying

​Stop specific runners without affecting others

​Debug production runners with interactive shell access

​See everything happening in your app with the events timeline

​Get from zero to deployed in minutes with in-app onboarding

​Delete files from fal storage

​Platform APIs v1 officially released

​Get notified when you hit concurrent requests limits

​Debug errors faster with the new errors page

​Stop or kill individual runners from the command line

​See exactly how long runners spend starting up

​Connect fal docs to Cursor with MCP

​Personalized dashboard with creator and developer views

​Add custom headers to your API requests

​Multi-GPU inference and training with fal.distributed

​Dedicated pages for Analytics, Runners, Logs, and Versions

​Compare models side-by-side in the new Sandbox

​Manage deployments from Python without async/await

​Bring your own container to any deployment

​Dynamic auto-scaling with percentage-based buffers

​Runner logs with streaming and filtering

​Include local files in your deployments automatically

​Find what you need faster with reorganized navigation

​Know exactly which version each runner is running

​Filter logs with custom labels and powerful queries

​See what runners are doing during startup

​View all app endpoints and config at a glance

​Monitor and clear your request queue from the CLI

​View runner history with time-based filtering

​Reorganize files in fal storage without re-uploading

​See all your endpoint URLs immediately when testing

Interval Selection and Chart Zoom in Analytics

App List View

Error Types in API Responses

Termination Grace Period

App Tagging

Machine Type in Runner Output

Runner `FAILURE_DELAY` Status

Termination Grace Period

App Tagging

Machine Type in Runner Output

Runner `FAILURE_DELAY` Status

Interactive Log Histogram

Jump to Context

Switch Log Timezones Between UTC and Local

Serverless App Cards with Error Rates and Graphs

Share Logs with Your Team

Runner Side Sheet

`--auth` flag for `fal run`

Full-Screen Logs

FalBaseModel for better input/output definitions

Switch environments from app pages

Track when runners are idle and waiting for work

Control queue wait time with `start_timeout`

Environments for isolated deployments

Handle graceful shutdown with `handle_exit()` and `teardown()`

Include local files in container builds with COPY and ADD

Skip retries for specific conditions

Graceful shutdown of fal apps

Add a health check endpoint to your application

Disable environment build cache

Scale your application with the new scaling delay feature

Reduce cold start times with shared compiled PyTorch caches

Get Slack notifications for serverless app failures

Stop and kill runners directly from the dashboard

Stream platform logs to your own endpoint with drains

Upload larger files with improved timeout handling

Restart all runners without redeploying

Stop specific runners without affecting others

Debug production runners with interactive shell access

See everything happening in your app with the events timeline

Get from zero to deployed in minutes with in-app onboarding

Delete files from fal storage

Platform APIs v1 officially released

Get notified when you hit concurrent requests limits

Debug errors faster with the new errors page

Stop or kill individual runners from the command line

See exactly how long runners spend starting up

Connect fal docs to Cursor with MCP

Personalized dashboard with creator and developer views

Add custom headers to your API requests

Multi-GPU inference and training with fal.distributed

Dedicated pages for Analytics, Runners, Logs, and Versions

Compare models side-by-side in the new Sandbox

Manage deployments from Python without async/await

Bring your own container to any deployment

Dynamic auto-scaling with percentage-based buffers

Runner logs with streaming and filtering

Include local files in your deployments automatically

Find what you need faster with reorganized navigation

Know exactly which version each runner is running

Filter logs with custom labels and powerful queries

See what runners are doing during startup

View all app endpoints and config at a glance

Monitor and clear your request queue from the CLI

View runner history with time-based filtering

Reorganize files in fal storage without re-uploading

See all your endpoint URLs immediately when testing