Fallback to pinned host memory when managed memory is not supported by zcbenz · Pull Request #3075 · ml-explore/mlx

zcbenz · 2026-01-28T01:58:55Z

For 4090 the managed memory does not work on Windows (crashed when accessed on CPU) even though the API reports it being supported, so I'm just disabling managed memory for Windows unless there is hardware unified memory support.

The memory allocation code looks a little messy with all the conditions but I think it is fine for now.

awni · 2026-01-29T15:28:38Z

+    auto& d = device(i);
+    free_streams_.emplace_back(d);


I feel like I did this without accessing the MLX device intentionally. Maybe there was some initialization order thing that was causing problems.. I wish I had left a comment :/.

But then again maybe it's fixed now .. if the tests clear then they clear..

I think you ran into #3062 (comment).

awni · 2026-01-29T15:32:27Z

+        if (supports_managed_memory()) {
+          CHECK_CUDA_ERROR(cudaMallocManaged(&data, size));
+        } else {
+          CHECK_CUDA_ERROR(cudaMallocHost(&data, size));
+        }


There are a few cases of if (supports_managed) do x else y fi

It might make sense to refactor to a unifiedMalloc and unifiedFree to keep the code a little more readable.

awni · 2026-01-29T15:33:34Z

+    if (d.memory_pools()) {
+      CHECK_CUDA_ERROR(cudaDeviceGetDefaultMemPool(&mem_pools_[i], i));
+    }


What's the purpose of that check here? Some devices do not support memory pools?

Yeah according to https://github.com/ml-explore/mlx/pull/2972/changes#diff-3e8aaaff4c1529bbcf6ea804df3793a6c354f2812ff63377dffec82b8ca4321d some devices do not have memory pools. Also I just realized that cudaMallocAsync should not be used when memory pools is not available, will make change.

awni

Looks great. Left some minor comments. Feel free to merge when ready!

…ported Extend the Windows managed memory check from ml-explore#3075 to also apply to WSL, as the underlying behavior is the same.

awni reviewed Jan 29, 2026

View reviewed changes

Comment thread mlx/backend/cuda/allocator.cpp Outdated

awni reviewed Jan 29, 2026

View reviewed changes

awni approved these changes Jan 29, 2026

View reviewed changes

Fallback to pinned host memory when managed memory is not supported

bf8502c

zcbenz force-pushed the move-to-unified branch from c63799f to bf8502c Compare January 30, 2026 02:24

zcbenz merged commit 212077f into ml-explore:main Jan 30, 2026
16 checks passed

zcbenz deleted the move-to-unified branch January 30, 2026 04:18

jessegross mentioned this pull request Feb 3, 2026

Disable managed memory on WSL when concurrentManagedAccess is not supported #3095

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fallback to pinned host memory when managed memory is not supported#3075

Fallback to pinned host memory when managed memory is not supported#3075
zcbenz merged 1 commit intoml-explore:mainfrom
zcbenz:move-to-unified

zcbenz commented Jan 28, 2026

Uh oh!

awni Jan 29, 2026

Uh oh!

zcbenz Jan 29, 2026

Uh oh!

Uh oh!

awni Jan 29, 2026

Uh oh!

awni Jan 29, 2026

Uh oh!

zcbenz Jan 29, 2026

Uh oh!

awni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zcbenz commented Jan 28, 2026

Uh oh!

awni Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

zcbenz Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

awni Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

awni Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

zcbenz Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants