DeepSpeed
训练设置
推理设置
训练 API
推理 API
模型检查点
激活检查点
ZeRO
专家混合 (MoE)
Transformer 内核
流水线并行
优化器
学习率调度器
浮点运算 (FLOPs) 分析器
自动调整
内存需求
监控
DeepSpeed
索引
在 GitHub 上编辑
索引
A
|
B
|
C
|
D
|
E
|
F
|
G
|
I
|
J
|
L
|
M
|
N
|
O
|
P
|
Q
|
R
|
S
|
T
|
U
|
W
|
Z
A
activation (deepspeed.inference.config.QuantizationConfig 属性)
allgather_bucket_size (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
allgather_partitions (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
allreduce_tied_weight_gradients() (deepspeed.pipe.PipelineModule 方法)
api_key (deepspeed.monitor.config.CometConfig 属性)
autotuner (在 deepspeed.autotuning 模块中)
B
BackwardPass (deepspeed.runtime.pipe.schedule 类)
base_dir (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.InferenceCheckpointConfig 属性)
buffer_count (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
(deepspeed.runtime.zero.config.DeepSpeedZeroOffloadParamConfig 属性)
buffer_size (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadParamConfig 属性)
BufferOpInstruction (deepspeed.runtime.pipe.schedule 类)
build() (deepspeed.pipe.LayerSpec 方法)
C
checkpoint (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
checkpoint_config (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
checkpoint_dir (deepspeed.inference.config.InferenceCheckpointConfig 属性)
ckpt_layer_path() (deepspeed.pipe.PipelineModule 方法)
ckpt_layer_path_list() (deepspeed.pipe.PipelineModule 方法)
ckpt_prefix() (deepspeed.pipe.PipelineModule 方法)
clone_tensors_for_torch_save() (在 deepspeed.checkpoint.utils 模块中)
config (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
contiguous_gradients (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
convert_zero_checkpoint_to_fp32_state_dict() (在 deepspeed.utils.zero_to_fp32 模块中)
cpu_offload (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
cpu_offload_param (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
cpu_offload_use_pin_memory (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
D
DataParallelSchedule (deepspeed.runtime.pipe.schedule 类)
deepspeed.profiling.flops_profiler.profiler
模块
deepspeed.runtime.pipe.schedule
模块
DeepSpeedCPUAdam (deepspeed.ops.adam 类)
device (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
(deepspeed.runtime.zero.config.DeepSpeedZeroOffloadParamConfig 属性)
dtype (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
E
elastic_checkpoint (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
enable_cuda_graph (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
enabled (deepspeed.inference.config.DeepSpeedMoEConfig 属性)
(deepspeed.inference.config.DeepSpeedTPConfig 属性)
(deepspeed.inference.config.QuantizationConfig 属性)
(deepspeed.monitor.config.CometConfig 属性)
(deepspeed.monitor.config.CSVConfig 属性)
(deepspeed.monitor.config.TensorBoardConfig 属性)
(deepspeed.monitor.config.WandbConfig 属性)
end_profile() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
ep_group (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.DeepSpeedMoEConfig 属性)
ep_mp_group (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.DeepSpeedMoEConfig 属性)
ep_size (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.DeepSpeedMoEConfig 属性)
estimate_zero2_model_states_mem_needs_all_cold() (在 deepspeed.runtime.zero.stage_1_and_2 模块中)
estimate_zero2_model_states_mem_needs_all_live() (在 deepspeed.runtime.zero.stage_1_and_2 模块中)
estimate_zero3_model_states_mem_needs_all_cold() (在 deepspeed.runtime.zero.stage3 模块中)
estimate_zero3_model_states_mem_needs_all_live() (在 deepspeed.runtime.zero.stage3 模块中)
experiment_key (deepspeed.monitor.config.CometConfig 属性)
experiment_name (deepspeed.monitor.config.CometConfig 属性)
F
fast_init (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
filter_match() (deepspeed.runtime.pipe.ProcessTopology 方法)
FlopsProfiler (deepspeed.profiling.flops_profiler.profiler 类)
forward() (deepspeed.moe.layer.MoE 方法)
(deepspeed.pipe.PipelineModule 方法)
ForwardPass (deepspeed.runtime.pipe.schedule 类)
FusedAdam (deepspeed.ops.adam 类)
FusedLamb (deepspeed.ops.lamb 类)
G
gather_16bit_weights_on_model_save (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
get_additional_losses() (deepspeed.pipe.PipelineModule 方法)
get_axis_comm_lists() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_axis_list() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_axis_names() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_coord() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_dim() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_fp32_state_dict_from_zero_checkpoint() (在 deepspeed.utils.zero_to_fp32 模块中)
get_model_profile() (在 deepspeed.profiling.flops_profiler.profiler 模块中)
get_rank() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_rank_repr() (deepspeed.runtime.pipe.ProcessTopology 方法)
get_total_duration() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
get_total_flops() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
get_total_macs() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
get_total_params() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
group (deepspeed.monitor.config.WandbConfig 属性)
I
ignore_unused_parameters (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
InferenceSchedule (deepspeed.runtime.pipe.schedule 类)
injection_policy (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
injection_policy_tuple (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
is_first_stage (deepspeed.runtime.pipe.schedule.PipeSchedule 属性)
is_last_stage (deepspeed.runtime.pipe.schedule.PipeSchedule 属性)
J
job_name (deepspeed.monitor.config.CSVConfig 属性)
(deepspeed.monitor.config.TensorBoardConfig 属性)
L
LayerSpec (deepspeed.pipe 类)
legacy_stage1 (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
load_from_fp32_weights (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
load_state_dict_from_zero_checkpoint() (在 deepspeed.utils.zero_to_fp32 模块中)
LoadMicroBatch (deepspeed.runtime.pipe.schedule 类)
LRRangeTest (deepspeed.runtime.lr_schedules 类)
M
max_in_cpu (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadParamConfig 属性)
max_live_parameters (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
max_out_tokens (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
max_reuse_distance (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
memory_efficient_linear (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
mics_hierarchical_params_gather (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
mics_shard_size (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
min_out_tokens (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
mode (deepspeed.monitor.config.CometConfig 属性)
model_persistence_threshold (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
模块
deepspeed.profiling.flops_profiler.profiler
deepspeed.runtime.pipe.schedule
module_granularity_threshold (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
MoE (deepspeed.moe.layer 类)
moe (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
moe_experts (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.DeepSpeedMoEConfig 属性)
moe_type (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
mp_size (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
mpu (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.DeepSpeedTPConfig 属性)
N
num_micro_batches (deepspeed.runtime.pipe.schedule.PipeSchedule 属性)
num_pipe_buffers() (deepspeed.runtime.pipe.schedule.DataParallelSchedule 方法)
(deepspeed.runtime.pipe.schedule.InferenceSchedule 方法)
(deepspeed.runtime.pipe.schedule.PipeSchedule 方法)
(deepspeed.runtime.pipe.schedule.TrainSchedule 方法)
num_stages (deepspeed.runtime.pipe.schedule.PipeSchedule 属性)
nvme_path (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
(deepspeed.runtime.zero.config.DeepSpeedZeroOffloadParamConfig 属性)
O
offload_optimizer (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
offload_param (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
OneCycle (deepspeed.runtime.lr_schedules 类)
online (deepspeed.monitor.config.CometConfig 属性)
OptimizerStep (deepspeed.runtime.pipe.schedule 类)
output_path (deepspeed.monitor.config.CSVConfig 属性)
(deepspeed.monitor.config.TensorBoardConfig 属性)
overlap_comm (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
override_module_apply (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
P
param_persistence_threshold (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
pin_memory (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
(deepspeed.runtime.zero.config.DeepSpeedZeroOffloadParamConfig 属性)
PipeInstruction (deepspeed.runtime.pipe.schedule 类)
pipeline_loading_checkpoint (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
pipeline_read (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
pipeline_write (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
PipelineModule (deepspeed.pipe 类)
PipeSchedule (deepspeed.runtime.pipe.schedule 类)
prefetch_bucket_size (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
print_model_aggregated_profile() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
print_model_profile() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
ProcessTopology (deepspeed.runtime.pipe 类)
project (deepspeed.monitor.config.CometConfig 属性)
(deepspeed.monitor.config.WandbConfig 属性)
Q
qkv (deepspeed.inference.config.QuantizationConfig 属性)
quant (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
R
ratio (deepspeed.runtime.zero.config.DeepSpeedZeroOffloadOptimizerConfig 属性)
RecvActivation (deepspeed.runtime.pipe.schedule 类)
RecvGrad (deepspeed.runtime.pipe.schedule 类)
reduce_bucket_size (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
reduce_scatter (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
ReduceGrads (deepspeed.runtime.pipe.schedule 类)
ReduceTiedGrads (deepspeed.runtime.pipe.schedule 类)
replace_method (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
replace_with_kernel_inject (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
reset_profile() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
return_tuple (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
round_robin_gradients (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
S
safe_get_full_fp32_param() (在 deepspeed.utils 模块中)
safe_get_full_grad() (在 deepspeed.utils 模块中)
safe_get_full_optimizer_state() (在 deepspeed.utils 模块中)
safe_get_local_fp32_param() (在 deepspeed.utils 模块中)
safe_get_local_grad() (在 deepspeed.utils 模块中)
safe_get_local_optimizer_state() (在 deepspeed.utils 模块中)
safe_set_full_fp32_param() (在 deepspeed.utils 模块中)
safe_set_full_grad() (在 deepspeed.utils 模块中)
safe_set_full_optimizer_state() (在 deepspeed.utils 模块中)
safe_set_local_fp32_param() (在 deepspeed.utils 模块中)
safe_set_local_grad() (在 deepspeed.utils 模块中)
safe_set_local_optimizer_state() (在 deepspeed.utils 模块中)
samples_log_interval (deepspeed.monitor.config.CometConfig 属性)
save_mp_checkpoint_path (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
(deepspeed.inference.config.InferenceCheckpointConfig 属性)
SendActivation (deepspeed.runtime.pipe.schedule 类)
SendGrad (deepspeed.runtime.pipe.schedule 类)
set_empty_params (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
stage (deepspeed.runtime.pipe.schedule.PipeSchedule 属性)
(deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
stage3_gather_fp16_weights_on_model_save (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
start_profile() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
steps() (deepspeed.runtime.pipe.schedule.PipeSchedule 方法)
stop_profile() (deepspeed.profiling.flops_profiler.profiler.FlopsProfiler 方法)
sub_group_size (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
T
team (deepspeed.monitor.config.WandbConfig 属性)
tensor_parallel (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
TiedLayerSpec (deepspeed.pipe 类)
topology() (deepspeed.pipe.PipelineModule 方法)
tp_group (deepspeed.inference.config.DeepSpeedTPConfig 属性)
tp_size (deepspeed.inference.config.DeepSpeedTPConfig 属性)
training_mp_size (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
TrainSchedule (deepspeed.runtime.pipe.schedule 类)
transposed_mode (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
triangular_masking (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
triton_autotune (deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
type (deepspeed.inference.config.DeepSpeedMoEConfig 属性)
U
use_all_reduce_for_fetch_params (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
use_multi_rank_bucket_allreduce (deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
use_triton(deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
W
WarmupCosineLR(deepspeed.runtime.lr_schedules 中的类)
WarmupDecayLR(deepspeed.runtime.lr_schedules 中的类)
WarmupLR(deepspeed.runtime.lr_schedules 中的类)
weight(deepspeed.inference.config.QuantizationConfig 属性)
workspace(deepspeed.monitor.config.CometConfig 属性)
Z
zero(deepspeed.inference.config.DeepSpeedInferenceConfig 属性)
zero_hpz_partition_size(deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
zero_quantized_gradients(deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
zero_quantized_nontrainable_weights(deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)
zero_quantized_weights(deepspeed.runtime.zero.config.DeepSpeedZeroConfig 属性)