MPI 工具 ¶

Table of Contents

MPI 工具
- Core MPI Utilities
- MPI + Tensorflow Utilities

Core MPI Utilities ¶

spinup.utils.mpi_tools.mpi_avg(x)[源代码]¶: Average a scalar or vector over MPI processes.

spinup.utils.mpi_tools.mpi_fork(n, bind_to_core=False)[源代码]¶

Re-launches the current script with workers linked by MPI.

Also, terminates the original process that launched it.

Taken almost without modification from the Baselines function of the same name.

参数:	n (int) – Number of process to split into. bind_to_core (bool) – Bind each MPI process to a core.

spinup.utils.mpi_tools.mpi_statistics_scalar(x, with_min_and_max=False)[源代码]¶

Get mean/std and optional min/max of scalar x across MPI processes.

参数:	x – An array containing samples of the scalar to produce statistics for. with_min_and_max (bool) – If true, return min and max of x in addition to mean and std.

spinup.utils.mpi_tools.num_procs()[源代码]¶: Count active MPI processes.

spinup.utils.mpi_tools.proc_id()[源代码]¶: Get rank of calling process.

The spinup.utils.mpi_tf contains a a few tools to make it easy to use the AdamOptimizer across many MPI processes. This is a bit hacky—if you’re looking for something more sophisticated and general-purpose, consider horovod.

class spinup.utils.mpi_tf.MpiAdamOptimizer(**kwargs)[源代码]¶

Adam optimizer that averages gradients across MPI processes.

The compute_gradients method is taken from Baselines MpiAdamOptimizer. For documentation on method arguments, see the Tensorflow docs page for the base AdamOptimizer.

apply_gradients(grads_and_vars, global_step=None, name=None)[源代码]¶: Same as normal apply_gradients, except sync params after update.

compute_gradients(loss, var_list, **kwargs)[源代码]¶: Same as normal compute_gradients, except average grads over processes.

spinup.utils.mpi_tf.sync_all_params()[源代码]¶: Sync all tf variables across MPI processes.

MPI 工具¶

Core MPI Utilities¶

MPI + Tensorflow Utilities¶

MPI 工具 ¶

Core MPI Utilities ¶

MPI + Tensorflow Utilities ¶