deepspeed

1.3

159

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

distributed training

1.3

Rating

Installs

Machine Learning

Quick Review

No summary available.

LLM Signals

Description coverage-

Task knowledge-

Structure-

Novelty-

GitHub Signals

Last commit 0 days ago

Publisher

majiayu000

Skill Author

Loading SKILL.md…

Try onlineView on GitHub

Publisher

majiayu000

Skill Author

Related Skills

ml-pipeline

Jeffallan

6.4

pyvene-interventions

zechenzhangAGI

7.6

nnsight-remote-interpretability

zechenzhangAGI

7.0

mlflow

zechenzhangAGI

7.6

Try online

Improve

deepspeed

1.3

by majiayu000

159

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

distributed training

1.3

Rating

Installs

Machine Learning

Quick Review

No summary available.

LLM Signals

Description coverage-

Task knowledge-

Structure-

Novelty-

GitHub Signals

Last commit 0 days ago

Publisher

majiayu000

Skill Author

Loading SKILL.md…

Try onlineView on GitHub

Publisher

majiayu000

Skill Author

Related Skills

ml-pipeline

Jeffallan

6.4

pyvene-interventions

zechenzhangAGI

7.6

nnsight-remote-interpretability

zechenzhangAGI

7.0

mlflow

zechenzhangAGI

7.6

Try online