Task Group Settings

The task group is mainly used to control the concurrency of task instances, and is designed to control the pressure of other resources (it can also control the pressure of the Hadoop cluster, the cluster will have queue control it). When creating a new task definition, you can configure the corresponding task group and configure the priority of the task running in the task group.

Task Group Configuration

Create Task Group

create-taskGroup

The user clicks Resources -> Task Group Management -> Task Group option -> Create Task Group

create-taskGroup

You need to enter the information inside the picture:

  • Task group name: The name displayed when the task group is used
  • Project name: The project that the task group functions, this item is optional, if not selected, all the projects in the whole system can use this task group.
  • Resource pool size: The maximum number of concurrent task instances allowed.

View Task Group Queue

view-queue

Click the button to view task group usage information:

view-queue

Use of Task Groups

Note: The use of task groups is applicable to tasks executed by workers, such as switch nodes, condition nodes, sub_process and other node types executed by the master are not controlled by the task group.

Let’s take the shell node as an example:

use-queue

Regarding the configuration of the task group, all you need to do is to configure these parts in the red box:

  • Task group name: The task group name is displayed on the task group configuration page. Here you can only see the task group that the project has permission to access (the project is selected when creating a task group) or the task group that scope globally (no project is selected when creating a task group).

  • Priority: When there is a waiting resource, the task with high priority will be distributed to the worker by the master first. The larger the value of this part, the higher the priority.

Implementation Logic of Task Group

Get Task Group Resources

The master judges whether the task is configured with a task group when distributing the task. If the task is not configured, it is normally thrown to the worker to run; if a task group is configured, it checks whether the remaining size of the task group resource pool meets the current task operation before throwing it to the worker for execution. , if the resource pool -1 is satisfied, continue to run; if not, exit the task distribution and wait for other tasks to wake up.

Release and Wake Up

When the task that has occupied the task group resource is finished, the task group resource will be released. After the release, it will check whether there is a task waiting in the current task group. If there is, mark the task with the best priority to run, and create a new executable event. The event stores the task ID that is marked to acquire the resource, and then the task obtains the task group resource and run.

Task Group Flowchart

task_group