VerifiedGlobalGit

v1.0.0

moe-comm-overlap

Name: moe-comm-overlap
Availability: InStock
Rating: 5 (497 reviews)
Author: NVIDIA-NeMo

by @NVIDIA-NeMo0 pulls

URLopenbooklet.com/s/moe-comm-overlap

Pinnedopenbooklet.com/s/moe-comm-overlap@1.0.0

APIGET /api/v1/skills/moe-comm-overlap

MoE expert-parallel communication overlap in Megatron Bridge. Use when the user asks about overlap_moe_expert_parallel_comm, MoE dispatch overlap, flex dispatcher, DeepEP overlap, or expert wgrad scheduling.

10 skills from this repoNVIDIA-NeMo/Megatron-Bridge

moe-comm-overlapviewing

cuda-graphsskills/perf-techniques/cuda-graphs/SKILL.md

Validate and use CUDA graph capture in Megatron Bridge, including local full-iteration graphs and Transformer Engine scoped graphs for attention, MLP, and MoE modules.

expert-parallel-overlapskills/perf-techniques/expert-parallel-overlap/SKILL.md

Operational guide for enabling MoE expert-parallel communication overlap in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.

hybrid-context-parallelskills/perf-techniques/hybrid-context-parallel/SKILL.md

Operational guide for enabling hierarchical context parallelism in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification. Use when the user asks about hierarchical_context_parallel_sizes, a2a+p2p, CP scaling beyond KV heads, or multi-level context parallelism.

megatron-fsdpskills/perf-techniques/megatron-fsdp/SKILL.md

Operational guide for enabling Megatron FSDP in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.

packed-sequences-long-contextskills/perf-techniques/packed-sequences-long-context/SKILL.md

Sequence packing and long-context training in Megatron Bridge. Use when the user asks about packed sequences, sequence packing, long context training, PackedSequenceSpecs, pack_sequences_in_batch, or CP with packing.

parallelism-strategiesskills/perf-techniques/parallelism-strategies/SKILL.md

Operational guide for choosing and combining parallelism strategies in Megatron Bridge, including sizing rules, hardware topology mapping, and combined parallelism configuration.

resiliencyskills/resiliency/SKILL.md

Resiliency features in Megatron Bridge including fault tolerance, straggler detection, in-process restart, preemption, and re-run state machine. Use when the user asks about fault tolerance, straggler detection, hang detection, automatic restart, preemption, in-process restart, checkpoint recovery, or nvidia-resiliency-ext.

sequence-packingskills/perf-techniques/sequence-packing/SKILL.md

Operational guide for enabling packed sequences and long-context config paths in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.

tp-dp-comm-overlapskills/perf-techniques/tp-dp-comm-overlap/SKILL.md

Operational guide for enabling TP, DP, and PP communication overlap in Megatron-Bridge, including config knobs, code anchors, pitfalls, and verification.

Auto-indexed from NVIDIA-NeMo/Megatron-Bridge

Are you the author? Claim this skill to take ownership and manage it.

Related Skills

@openbooklet

graceful-error-recovery

Use this skill when a tool call, command, or API request fails. Diagnose the root cause systematically before retrying or changing approach. Do not retry the same failing call without first understanding why it failed.

1.1K0

@openbooklet

audience-aware-communication

Use this skill when writing any explanation, documentation, or response that will be read by someone else. Match vocabulary, depth, and format to the audience's expertise level before writing.

1.1K0

@openbooklet

Refactoring Expert

Expert in systematic code refactoring, code smell detection, and structural optimization. Use PROACTIVELY when encountering duplicated code, long methods, complex conditionals, or any code quality issues. Detects code smells and applies proven refactoring techniques without changing external behavior.

600

@openbooklet

Research Expert

Specialized research expert for parallel information gathering. Use for focused research tasks with clear objectives and structured output requirements.

600

@openbooklet

clarify-ambiguous-requests

Use this skill when the user's request is ambiguous, under-specified, or could be interpreted in multiple ways. If proceeding with a wrong assumption would waste significant work, always ask exactly one focused clarifying question before doing anything.

1.1K0

@openbooklet

structured-step-by-step-reasoning

Use this skill for any problem that involves multiple steps, tradeoffs, or non-trivial logic. Think out loud before answering to improve accuracy and transparency. Apply whenever the answer is not immediately obvious.

1.1K0