metro-file-map: Spawn fewer workers for smaller workloads #1439

robhogan · 2025-02-16T10:06:57Z

Summary:
metro-file-map currently spins up maxWorkers workers (processes, or threads if enableWorkerThreads) for any batch workload. maxWorkers is os.availableParallelism by default.

When we have a warm cache or Saved State, we may only have a handful of changed files to process, and frequently create more workers than we have files.

This implements a simple heuristic factor maxFilesPerWorker, such that we use Math.ceil(numFiles/maxFilesPerWorker) parallelism (where 1 "worker" means we process in-band).

The default value of 100 is somewhat finger-in-air as a bunch of factors would determine the optimal value - including threads or not, which host platform, and computational cost of the haste impl and dependency extraction. We could expose this as Metro configuration in future.

Changelog:

[Performance] Don't start an excessive number of workers for hashing files during startup.

Differential Revision: D69704168

Summary: This error message isn't accurate since Metro added symlink support, and doesn't suggest the most likely cause, which is that the requested file isn't watched. Metro's resolver should never resolve an unwatched file, but custom resolvers (like `rnx-kit`'s) might. Changelog: Internal Reviewed By: vzaidman Differential Revision: D69397742

Summary: `metro-file-map` currently spins up `maxWorkers` workers (processes, or threads if `enableWorkerThreads`) for any batch workload. `maxWorkers` is `os.availableParallelism` by default. When we have a warm cache or Saved State, we may only have a handful of changed files to process, and frequently create more workers than we have files. This implements a simple heuristic factor `maxFilesPerWorker`, such that we use `Math.ceil(numFiles/maxFilesPerWorker)` parallelism (where 1 "worker" means we process in-band). The default value of 100 is somewhat finger-in-air as a bunch of factors would determine the optimal value - including threads or not, which host platform, and computational cost of the haste impl and dependency extraction. We could expose this as Metro configuration in future. Changelog: - **[Performance]** Don't start an excessive number of workers for hashing files during startup. Differential Revision: D69704168

facebook-github-bot · 2025-02-16T10:07:14Z

This pull request was exported from Phabricator. Differential Revision: D69704168

robhogan added 2 commits February 16, 2025 02:06

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 16, 2025

facebook-github-bot added the fb-exported label Feb 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metro-file-map: Spawn fewer workers for smaller workloads #1439

metro-file-map: Spawn fewer workers for smaller workloads #1439

robhogan commented Feb 16, 2025

facebook-github-bot commented Feb 16, 2025

metro-file-map: Spawn fewer workers for smaller workloads #1439

Are you sure you want to change the base?

metro-file-map: Spawn fewer workers for smaller workloads #1439

Conversation

robhogan commented Feb 16, 2025

facebook-github-bot commented Feb 16, 2025