[AutoParallel] FillZeroForEmpty* support AutoParallel #58716

wanghuancoder · 2023-11-06T08:14:06Z

PR types

Others

PR changes

Others

Description

FillZeroForEmpty* support AutoParallel
8卡本地测试OK
Pcard-73145

… grad_node_info_autoparallel_1

GhostScreaming · 2023-11-07T03:19:59Z

paddle/fluid/eager/grad_node_info.h

@@ -178,6 +190,8 @@ class GradSlotMeta {
  // Save the dist attr of the forward input Tensor for proper resharding
  // operation when compute the input Tensor's gradient
  phi::distributed::TensorDistAttr dist_attr_;
+  phi::DDim dist_tensor_global_dims_;
+  bool is_dist_meta{false};


属性需要加下划线吗？is_dist_meta_

好的好的，疏忽了

GhostScreaming · 2023-11-07T03:46:11Z

paddle/fluid/eager/auto_code_generator/generator/eager_gen.py

-  if (!IsRunAutoParallel()) {{
-    egr::EagerUtils::FillZeroForEmptyGradInput(&grads[{fwd_position}], input_metas[{fwd_position}]);
-  }}
+  egr::EagerUtils::FillZeroForEmptyGradInput(&grads[{fwd_position}], input_metas[{fwd_position}]);


欢哥，FillZeroForEmpty*会创建一个全0的Grad Tensor占有显存，但pp的非计算op前反向都不能占显存，需要控制一下进不进这个函数：

bool rank_is_in_current_mesh = true; if (IsRunAutoParallel()) { auto mesh = std::static_pointer_cast<phi::distributed::DistTensor>(grads[{fwd_position}].impl())->dist_attr().process_mesh(); rank_is_in_current_mesh = phi::distributed::IsCurRankInMesh(mesh); } if (rank_is_in_current_mesh) {{ egr::EagerUtils::FillZeroForEmptyGradInput(&grads[{fwd_position}], input_metas[{fwd_position}]); }}

哦，这个是通过下面这种逻辑来控制的
if (tensor_meta.dims.size() != -1) {
auto tensor_with_zero =
paddle::experimental::full(phi::vectorize(tensor_meta.dims),
0.0,
tensor_meta.dtype,
grad_in_meta.GetPlace());
*(static_castphi::distributed::DistTensor*(in_grad->impl().get())
->unsafe_mutable_value()) =
*(static_castphi::DenseTensor*(tensor_with_zero.impl().get()));
}

…into grad_node_info_autoparallel_1

GhostScreaming

LGTM

…8716) * grad_node_info.cc support autoparallel 1

wanghuancoder added 4 commits November 6, 2023 03:30

grad_node_info.cc support autoparallel 1

b346fe6

refine

50e6d0d

refine

608bd07

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

7197690

… grad_node_info_autoparallel_1

GhostScreaming reviewed Nov 7, 2023

View reviewed changes

wanghuancoder added 4 commits November 7, 2023 04:57

refine

0eec1a6

refine

5a7ac20

refine

4ee5297

Merge branch 'fix_coflict' of https://github.com/wanghuancoder/Paddle …

5e86549

…into grad_node_info_autoparallel_1

GhostScreaming approved these changes Nov 8, 2023

View reviewed changes

zyfncg approved these changes Nov 8, 2023

View reviewed changes

wanghuancoder merged commit a70f859 into PaddlePaddle:develop Nov 8, 2023

danleifeng pushed a commit to danleifeng/Paddle that referenced this pull request Nov 14, 2023

[AutoParallel] FillZeroForEmpty* support AutoParallel (PaddlePaddle#5…

2b3d899

…8716) * grad_node_info.cc support autoparallel 1

SecretXV pushed a commit to SecretXV/Paddle that referenced this pull request Nov 28, 2023

[AutoParallel] FillZeroForEmpty* support AutoParallel (PaddlePaddle#5…

4de5e6d

…8716) * grad_node_info.cc support autoparallel 1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoParallel] FillZeroForEmpty* support AutoParallel #58716

[AutoParallel] FillZeroForEmpty* support AutoParallel #58716

Uh oh!

wanghuancoder commented Nov 6, 2023 •

edited

Loading

Uh oh!

GhostScreaming Nov 7, 2023

Uh oh!

wanghuancoder Nov 7, 2023

Uh oh!

GhostScreaming Nov 7, 2023

Uh oh!

wanghuancoder Nov 7, 2023

Uh oh!

GhostScreaming left a comment

Uh oh!

Uh oh!

[AutoParallel] FillZeroForEmpty* support AutoParallel #58716

[AutoParallel] FillZeroForEmpty* support AutoParallel #58716

Uh oh!

Conversation

wanghuancoder commented Nov 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Uh oh!

GhostScreaming Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

wanghuancoder Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

GhostScreaming Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

wanghuancoder Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

GhostScreaming left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wanghuancoder commented Nov 6, 2023 •

edited

Loading