You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
PR types
New features
PR changes
APIs
Description
使得下图所示计算图中的quant dequant weight dequant 和 matmul_v2 以及 elementwise_add 融合成 quant_linear 以便进行 int8 的推理


融合后结果:
目前实现方法:
先直接调用 delete_quant_dequant_linear_op_pass 和 delete_weight_dequant_linear_op_pass 的内容把 quant dequant 以及 weight_dequant 删掉,然后再调用自己的实现来融合剩下的 matmul_v2 以及 elementwise_add 算子变成 quant_linear
想请教的地方:
我不大清楚哪些 attributes 是必须的,我看 quant_linear op 的实现里有这些 attributes,请问是否这些 attributes 都要添加?
