RT by @awnihannun: In the new release of MLX we are bringing thread safety: (opens in new tab)
In the new release of MLX we are bringing thread safety: def worker(): print(mx.arange(10)) threading.Thread(target=worker) which makes parallel inference easier to implement, a feature that had been driven by projects like omlx/vmlx/vllm-mlx. github.com/ml-explore/mlx/re…
Read the original article