GPU blocks
Previously, `dstack` could process only one workload per instance at a time, even if the instance had enough resources to handle multiple workloads. With a new `blocks` [fleet](https://dstack.ai/docs/reference/dstack.yml/fleet/) property you can split the instance into blocks (virtual subinstances), allowing users to run workloads simultaneously, utilizing a fraction of GPU, CPU and memory resources.
Cloud fleet
yaml
type: fleet
name: my-fleet
nodes: 1
resources:
gpu: 8:24GB
blocks: 4 split into 4 blocks, 2 GPU per block
SSH fleet
yaml
type: fleet
name: my-fleet
ssh_config:
user: ubuntu
identity_file: ~/.ssh/id_rsa
hosts:
- hostname: 3.255.177.51
blocks: auto as many as possible, e.g., 8 GPUS -> 8 blocks
You can see how many instance blocks are currently busy in the `dstack fleet` output:
$ dstack fleet
FLEET INSTANCE BACKEND RESOURCES PRICE STATUS CREATED
fleet-gaudi2 0 ssh (remote) 152xCPU, 1007GB, 8xGaudi2 (96GB), 387.0GB (disk) $0.0 3/8 busy 56 sec ago
The remaining blocks can be used for new runs.
SSH fleets with head node
With a new [`proxy_jump`](https://dstack.ai/docs/reference/dstack.yml/fleet/#proxy_jump) fleet property, `dstack` now supports network configurations where worker nodes are located behind a head node and are not reachable directly:
yaml
type: fleet
name: my-fleet
ssh_config:
user: ubuntu
identity_file: ~/.ssh/worker_node_key
hosts:
worker nodes
- 3.255.177.51
- 3.255.177.52
head node proxy; can also be configured per worker node
proxy_jump:
hostname: 3.255.177.50
user: ubuntu
identity_file: ~/.ssh/head_node_key
Check [the documentation](https://dstack.ai/docs/concepts/fleets/#head-node) for details.
Inactivity duration
You can now configure dev environments to automatically stop after a period of inactivity by specifying `inactivity_duration`:
yaml
type: dev-environment
ide: vscode
Stop if inactive for 2 hours
inactivity_duration: 2h
A dev environment is considered inactive if there are no SSH connections to it, including VS Code connections, `ssh <run name>` shells, and attached `dstack apply` or `dstack attach` commands. For more details on using `inactivity_duration`, see the [docs](https://dstack.ai/docs/concepts/dev-environments/#inactivity-duration).
Multiple EFA interfaces
`dstack` now attaches the maximum possible number of EFA interfaces when provisioning AWS instances with EFA support. For example, when provisioning p5.48xlarge instance, `dstack` configures an [optimal set up with 32 interfaces](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/efa-acc-inst-types.html#efa-for-p5) providing total network bandwidth capacity of 3,200 Gbps, of which up to 800 Gbps can be utilized for IP network traffic.
Note: Multiple EFA interface are enabled if the `aws` backend config has `public_ips: false` set. If instances have public IPs, only one EFA interface is enabled per instance due to AWS limitations.
Volumes for distributed tasks
You can now use single-attach volumes such as AWS EBS with distributed tasks by attaching different volumes to different nodes. This is done using `dstack` variable interpolation:
yaml
type: task
nodes: 8
commands:
- ...
volumes:
- name: data-volume-${{ dstack.node_rank }}
path: /volume_data
Tip: To create volumes for all nodes using one volume configuration, specify volume name with `-n`:
shell
$ for i in {0..7}; do dstack apply -f vol.dstack.yml -n data-volume-$i -y; done
Availability zones
It's now possible to specify `availability_zone` in volume configurations:
yaml
type: volume
name: my-volume
backend: aws
region: eu-west-1
availability_zone: eu-west-1c
size: 100GB
and `availability_zones` in fleet and run configurations
yaml
type: fleet
name: my-fleet
nodes: 2
availability_zones: [eu-west-1c]
This has multiple use cases:
* Specify the same availability zone when provisioning volumes and fleets to ensure they can be used together.
* Specify a volume availability zone that has instance types that you work with.
* Create volumes for all availability zones to be able to use any zone and improve GPU availability.
The ` dstack fleet -v` and `dstack volumes -v` commands now display availability zones along with regions.
Deployment considerations
* If you deploy the `dstack` server using rolling deployments (old and new replicas co-exist), it's advised to stop runs and fleets before deploying 0.18.41. Otherwise, you may see error logs from the old replica. It should not have major implications.
What's Changed
* Implement GPU `blocks` property by un-def in https://github.com/dstackai/dstack/pull/2253
* Show deleted runs in the UI by olgenn in https://github.com/dstackai/dstack/pull/2272
* [Bug]: The UI issues many API requests when stopping multiple runs by olgenn in https://github.com/dstackai/dstack/pull/2273
* Ensure frontend displays errors when getting 400 from the server by olgenn in https://github.com/dstackai/dstack/pull/2275
* Support --name for all configurations by r4victor in https://github.com/dstackai/dstack/pull/2269
* Support per-job volumes by r4victor in https://github.com/dstackai/dstack/pull/2276
* Full EFA attachment for non-public IPs by solovyevt in https://github.com/dstackai/dstack/pull/2271
* Return deleted runs in /api/runs/list by r4victor in https://github.com/dstackai/dstack/pull/2158
* Fix process_submitted_jobs instance lock by un-def in https://github.com/dstackai/dstack/pull/2279
* Change `dstack fleet` `STATUS` for block instances by un-def in https://github.com/dstackai/dstack/pull/2280
* [Docs] Restructure concept pages to ensure dstack apply is not lost at the end of the page by peterschmidt85 in https://github.com/dstackai/dstack/pull/2283
* Allow specifying Azure resource_group by r4victor in https://github.com/dstackai/dstack/pull/2288
* Allow configuring availability zones by r4victor in https://github.com/dstackai/dstack/pull/2266
* Track SSH connections in dstack-runner by jvstme in https://github.com/dstackai/dstack/pull/2287
* Add the `inactivity_timeout` configuration option by jvstme in https://github.com/dstackai/dstack/pull/2289
* Show dev environment inactivity in `dstack ps -v` by jvstme in https://github.com/dstackai/dstack/pull/2290
* Support non-root Docker images in RunPod by jvstme in https://github.com/dstackai/dstack/pull/2286
* Fix terminating runs when job is terminated by jvstme in https://github.com/dstackai/dstack/pull/2295
* [Docs]: Dev environment inactivity duration by jvstme in https://github.com/dstackai/dstack/pull/2296
* [Docs]: Add `availability_zones` to offer filters by jvstme in https://github.com/dstackai/dstack/pull/2297
* Add head node support for SSH fleets by un-def in https://github.com/dstackai/dstack/pull/2292
* Support services with head node setup by un-def in https://github.com/dstackai/dstack/pull/2299
**Full Changelog**: https://github.com/dstackai/dstack/compare/0.18.40...0.18.41