Adding schedule
Overview
This page describes how to configure hybrid scaling in Kyvos by combining time-based and load-based scaling within a single day.
From release 2026.5 onwards, Kyvos supports hybrid scaling which enables users to maintain fixed Query Engine capacity during predictable workload periods (time-based), such as business hours, while automatically scaling resources based on system load (load-based) during periods of variable or unpredictable activity.
Refer to the Load‑based Scaling and Schedule‑based Scaling pages for more information on these scheduling options.
On this page:
Note
Hybrid scaling is recommended for environments where workload patterns change throughout the day and require both operational stability and resource elasticity.
This section details how to create schedule-based scaling in Kyvos to automatically scale cluster resources up or down at specified times based on expected workloads, helping optimize system performance and reduce infrastructure costs.
What is fixed scheduling?
When the usage pattern of a cluster is specific, fixed-based scaling is recommended. To configure cluster scaling for a specific duration based on expected usage, use the schedule-based scaling option. This allows you to specify the duration at which the cluster runs and at what capacity.
You can view and set schedules for the Kyvos cluster and query engine capacity.
To efficiently utilize the cluster, you can create a schedule to:
Increase or decrease query engine capacity for any time interval.
Configure the days for weekdays or weekends.
Change the time format to 12 or 24 hours.
Set the cluster status as online and define the query engine capacity, or you can shut down the cluster by selecting the offline mode.
Change the Query Engine cluster state
Note
Force start is not supported on GCP.
You can view on-screen notifications that provide you with timely information about the state of the cluster.
If you use Query Engines as a compute server, Query Engines will be automatically started when the semantic model is processed, but if you have enabled scheduled-based scaling, the Query Engines will not auto start. In this case, Kyvos recommends switching to Load-based scaling.
When you specify a schedule, ensure the following:
If you modify the start time or the cluster capacity of the running schedule, the changes apply from the next schedule cycle.
If you modify the start time, end time, or capacity of the running schedule, the changes will be applicable from the next schedule.
If you modify the start time or the end time or change the cluster capacity of the past schedule, the changes apply from the next schedule cycle.
What is load-based scheduling?
When the usage pattern of a cluster is not fixed, load-based scaling is recommended. This type of scaling allows you to automatically scale up or scale down the Query Engine instances based on the resource utilization of the Query Engine instances. By configuring cluster scaling in this way, you can optimize the utilization of your cloud cluster and reduce compute costs. For more details, see the Configuring load-based scaling rules section.
Adding a schedule
To configure a schedule scaling, perform these steps:
Go to the Cluster Scaling page and click the Add Schedule button on top. The Add Schedule dialog box is displayed. Every cluster capacity option is associated with a title, and the associated title is displayed in the Title box. You can use the default title or change the title for that schedule as needed.
Select which days the event occurs and specify the time.
Select whether the service should remain online or offline in Cluster Status. For example, you may want the service to be offline on the weekends at certain times.
Select the scaling mode as Fixed Capacity or Load Based. If selecting the option Fixed Capacity, then select the capacity level for Query Engine in Run at as:
Zero capacity: Select this if you want to run the Query Engine at zero capacity.
Full capacity (100 percent): Select this if you want to run the Query Engine at full capacity.
Note
To select the Load Based scaling mode, click the load based scaling rules link to open a new window where you can view and modify the scaling rules.
Click Add. The schedule entry is displayed in the schedule.
Click Save. The added schedules are now saved.
Repeat the above steps to configure Fixed Capacity and Load‑based options for different schedules. Once all schedules are configured, you can view them on the Cluster Scaling page.
Editing a schedule
To edit an existing schedule, perform the following steps.
From the Toolbox, click Setup, and then click Cluster Scaling.
Right-click a schedule entry in the schedule, and then click Edit.
Make changes as needed and click Save.
A message is displayed when users try to sign in before a cluster goes down according to its schedule. If they are currently using the system, they will see a message that the server is scheduled to be down in 10 minutes, and after that, no requests will be served. Users are prompted to wait and retry the request later when the cluster is down.
Deleting a schedule
Open the Cluster Scaling page.
Locate the schedule that you want to delete from the calendar view.
Click the scheduled event block to display the available actions.
Select Delete from the menu.
Repeat the process if you want to delete additional schedules.
Click Save to apply and persist the changes.
Deleting a schedule removes the configured scaling event from the cluster scaling calendar. The change takes effect only after you save the configuration.