Documentation Index
Fetch the complete documentation index at: https://fal.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
How to Set Parameters
You can set scaling parameters in three ways.In Your Code
Via CLI
Adjust parameters for a deployed app without redeploying:Via Dashboard
Navigate to Dashboard > Apps > [your-app] and adjust scaling parameters from the app settings.How Parameters Behave Across Deploys
Not all parameters behave the same when you deploy. Understanding this prevents surprises.Runtime-tunable parameters
keep_alive, min_concurrency, max_concurrency, concurrency_buffer, concurrency_buffer_perc, scaling_delay, request_timeout, regions
These affect cost and performance and are safe to change without understanding the code. When you adjust them via CLI or dashboard:
- Changes take effect immediately
- Changes persist across deploys — your next
fal deployinherits the CLI/dashboard values, not what’s in your code
Code-specific parameters
max_multiplexing, startup_timeout, machine_type
These affect correctness — changing them without updating code can break your app:
max_multiplexing: Your code must handle concurrent requests. Setting this to 4 via CLI when your handlers are synchronous would break things.startup_timeout: Depends on what yoursetup()does. Only the code author knows the right value.machine_type: Your model is sized for specific GPU memory. Switching GPUs without code changes risks OOM crashes.
--reset-scale
By default, fal deploy inherits runtime-tunable params from the previous revision. If you want to discard all CLI/dashboard overrides and go back to what’s in your code:
- You want to go back to the values in your code after tuning via CLI
- You’re making a major change and want a clean slate
- You’ve updated scaling values in code and want them to take effect
Example
- Deploy:
min_concurrency=1,keep_alive=60 - CLI:
fal apps scale myapp --min-concurrency 5 --keep-alive 300- Now:
min_concurrency=5,keep_alive=300
- Now:
- Deploy again (
fal deploy):min_concurrency=5,keep_alive=300(inherited from step 2)
- Deploy with reset (
fal deploy --reset-scale):min_concurrency=1,keep_alive=60(reset to code)
Parameter Reference
See what each scaling parameter does