【Hackathon 6th No.10】Add isposinf / isneginf / isreal / isin API to Paddle #834

NKNaN · 2024-03-22T08:23:45Z

新增 paddle.isposinf，paddle.isneginf，paddle.isreal，paddle.isin API的设计文档

paddle-bot · 2024-03-22T08:23:50Z

你的PR提交成功，感谢你对开源项目的贡献!
请检查PR提交格式和内容是否完备，具体请参考示例和模版。
Your PR has been submitted. Thanks for your contribution!
Please check its format and content. For this, you can refer to Template and Demo.

luotao1 · 2024-03-29T09:53:28Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+PyTorch 中的 torch.isposinf API文档 (https://pytorch.org/docs/stable/generated/torch.isposinf.html#torch-isposinf)
+PyTorch 中的 torch.isneginf API文档 (https://pytorch.org/docs/stable/generated/torch.isneginf.html#torch-isneginf)
+PyTorch 中的 torch.isreal API文档 (https://pytorch.org/docs/stable/generated/torch.isreal.html#torch-isreal)
+PyTorch 中的 torch.isin API文档 (https://pytorch.org/docs/stable/generated/torch.isin.html#torch-isin)


这一段格式有点乱

luotao1 · 2024-03-29T09:54:13Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+
+## API实现方案
+1. paddle.isposinf
+利用 paddle.isinf 与 paddle.signbit 组合实现 **(目前 paddle.signbit 中调用了 Tensor.numpy() 只能用于动态图，若需 paddle.isposinf 也能用于静态图，需要升级 paddle.signbit)**


需要升级 paddle.signbit

luotao1 · 2024-03-29T09:55:22Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+利用Tensor数据类型判断和 paddle.imag 实现
+
+4. paddle.isin
+参考 pytorch 在 _decompose 中的设计：当test_elements元素个数较少时直接进行暴力搜索，较多时则采取基于排序的算法（利用 flatten，concat，index_put_，searchsorted等API组合实现）。暂时去掉 assume_unique 参数，因为**当前 paddle 的 argsort kernel 使用的是 std::sort 的不稳定排序，与 pytorch 和 numpy 的结果就会存在差异。若后期需要加 assume_unique 参数并用 argsort 实现 isin，则需要先实现 stable 的 argsort。**


则需要先实现 stable 的 argsort

这一块工作量有多大？

我可以先试试看，我看 argsort 的 kernel 里面是用的 std::sort ，可以给 kernel 加个参数 stable 默认false，然后等于 true 的时候把 std::sort 换成 std::stable_sort 。这样的话就需要把用到 argsort kernel 相关的 python api 都要稍微改一下，包括 paddle.sort 和 paddle.argsort 。

这样的话就需要把用到 argsort kernel 相关的 python api 都要稍微改一下，包括 paddle.sort 和 paddle.argsort 。

好的，等你试完后再评估下，应该不影响原有paddle.sort和paddle.argsort的行为

@NKNaN 在API易用性升级工作中，也有一个sort/argsort支持stable稳定排序的工作，先做这个API升级的工作吧，然后再做is_in API新增，你可以在先假定已经满足了你计划实现的 sort/argsort支持稳定排序这个前提下，来编写这篇设计文档。

@NKNaN 可以先把其他三个API的PR弄好，merge。等【sort/argsort支持stable稳定排序的工作】完成后，再开始isin的工作

@NKNaN 在API易用性升级工作中，也有一个sort/argsort支持stable稳定排序的工作，先做这个API升级的工作吧，然后再做is_in API新增，你可以在先假定已经满足了你计划实现的 sort/argsort支持稳定排序这个前提下，来编写这篇设计文档。

@NKNaN 可以先把其他三个API的PR弄好，merge。等【sort/argsort支持stable稳定排序的工作】完成后，再开始isin的工作

好的

luotao1 · 2024-03-29T09:56:02Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+- 正确性验证：可以与 NumPy 的结果对齐；
+  - 不同 shape；
+  - 前向计算；
+  - 计算dtype类型：验证 `float64`，`int32`等；


需要写明每个API都支持哪些数据类型，paddle.isreal 下同

luotao1 · 2024-04-01T02:58:10Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+- 正确性验证：可以与 NumPy 的结果对齐；
+  - 不同 shape；
+  - 前向计算；
+  - 计算dtype类型：验证 `float32`，`float64`，`int32`，`int64`（paddle.signbit 在 CPU 上的 kernel 没有注册 `float16`）；


paddle.isposinf，paddle.isneginf 这里的数据类型支持的不够全，是因为什么原因呢？

这两个需要用 isinf 和 signbit 组合实现。isinf 目前支持的是 float16、float32、float64、int32、int64，然后 signbit 目前调用了 paddle.sign，写测试案例的时候发现 sign kernel 在 CPU 上没有注册 float16 的数据类型。

如果要升级 signbit 的话可能是得再写个 signbit 的 kernel 了，可以绕过 paddle.sign。

写测试案例的时候发现 sign kernel 在 CPU 上没有注册 float16 的数据类型。

这个没问题，不需要再写个 signbit 的 kernel 了绕过 paddle.sign。sign的数据类型是上次黑客松的时候评估过的。

isinf 目前支持的是 float16、float32、float64、int32、int64

isinf 可以扩展数据类型么。paddle.signbit支持 int8，int16，int32，int64，float16，float32 或 float64。

isinf 可以扩展数据类型么。paddle.signbit支持 int8，int16，int32，int64，float16，float32 或 float64。

可以的，isinf kernel 注册的时候加一下 uint8，int8，int16 就行

这个没问题，不需要再写个 signbit 的 kernel 了绕过 paddle.sign。sign的数据类型是上次黑客松的时候评估过的。

好的，那 signbit 对静态图的兼容就用 paddle.static.nn.py_func 可以吗，因为现在最核心的部分用的是 np.copysign ，要把 Tensor 转换成 numpy 来处理

可以的，可以提个PR上来看下

luotao1 · 2024-04-01T03:00:24Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+利用Tensor数据类型判断和 paddle.imag 实现
+
+4. paddle.isin
+参考 pytorch 在 _decompose 中的设计：当test_elements元素个数较少时直接进行暴力搜索，较多时则采取基于排序的算法（利用 flatten，concat，index_put_，searchsorted等API组合实现）。暂时去掉 assume_unique 参数，因为**当前 paddle 的 argsort kernel 使用的是 std::sort 的不稳定排序，与 pytorch 和 numpy 的结果就会存在差异。若后期需要加 assume_unique 参数并用 argsort 实现 isin，则需要先实现 stable 的 argsort。**


这样的话就需要把用到 argsort kernel 相关的 python api 都要稍微改一下，包括 paddle.sort 和 paddle.argsort 。

好的，等你试完后再评估下，应该不影响原有paddle.sort和paddle.argsort的行为

luotao1 · 2024-04-01T03:01:18Z

rfcs/APIs/20240322_api_design_for_isposinf_isneginf_isreal_isin.md

+
+## API实现方案
+1. paddle.isposinf
+利用 paddle.isinf 与 paddle.signbit 组合实现。**(目前 paddle.signbit 中调用了 Tensor.numpy() 只能用于动态图，需先升级 paddle.signbit 使其也能用于静态图)**


可以单独提PR来升级paddle.signbit 使其也能用于静态图

NKNaN · 2024-04-09T09:45:25Z

argsort的kernel目前应该是cpu下是用的不稳定排序算法，gpu下是用的基数排序，应该是稳定的，这个还需要改吗，如果确定要改的话，修改的地方就是在cpu下加一个稳定排序的算法，然后在gpu下加一个不稳定排序的算法，但是这样改可能会影响现有的paddle.sort和paddle.argsort，因为现有的这个两排序都不区分稳不稳定，在cpu下是不稳定的，在gpu下是稳定的

NKNaN · 2024-04-09T09:47:16Z

（gpu的kernel排序也不是完全稳定的，根据输入的形状有一种情况会调用thrust::sort_by_key，也是不稳定的，其他情况都是调用的基数排序）

luotao1 · 2024-04-18T03:59:58Z

@NKNaN 要不你把前三个RFC单独提一个PR，先合入？这样review PR的时候，保证已经有RFC在了。

NKNaN · 2024-04-18T04:01:10Z

@NKNaN 要不你把前三个RFC单独提一个PR，先合入？这样review PR的时候，保证已经有RFC在了。

好的

add rfcs

47855a0

paddle-bot bot added the contributor label Mar 22, 2024

NKNaN added 3 commits March 22, 2024 16:26

update

c7a713c

update

6b48584

update

17c3075

luotao1 mentioned this pull request Mar 25, 2024

【Hackathon 6th】开源贡献个人挑战赛 PaddlePaddle/Paddle#62905

Closed

luotao1 self-assigned this Mar 25, 2024

update

b225463

NKNaN mentioned this pull request Mar 27, 2024

【Hackathon 6th NO.10】Add isposinf / isneginf / isreal / isin API to Paddle PaddlePaddle/Paddle#63042

Closed

luotao1 reviewed Mar 29, 2024

View reviewed changes

revise rfs

2f4c6b5

luotao1 reviewed Apr 1, 2024

View reviewed changes

NKNaN mentioned this pull request Apr 3, 2024

【Hackthon 6th No. 10】Upgrade isinf to support int8 int16 uint8 -part PaddlePaddle/Paddle#63222

Merged

luotao1 mentioned this pull request Apr 18, 2024

【Hackathon 6th No.10】Add isposinf / isneginf / isreal API to Paddle - part PaddlePaddle/Paddle#63523

Merged

NKNaN mentioned this pull request Apr 18, 2024

【Hackathon 6th No.10】Add isposinf / isneginf / isreal / isin API to Paddle -part #876

Merged

luotao1 closed this Apr 29, 2024

NKNaN deleted the isposinf branch August 27, 2024 11:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【Hackathon 6th No.10】Add isposinf / isneginf / isreal / isin API to Paddle #834

【Hackathon 6th No.10】Add isposinf / isneginf / isreal / isin API to Paddle #834

NKNaN commented Mar 22, 2024

paddle-bot bot commented Mar 22, 2024

luotao1 Mar 29, 2024

NKNaN Mar 29, 2024

luotao1 Mar 29, 2024

NKNaN Mar 29, 2024

luotao1 Mar 29, 2024

NKNaN Mar 29, 2024

luotao1 Apr 1, 2024

NKNaN Apr 1, 2024

zhwesky2010 Apr 9, 2024 •

edited

Loading

luotao1 Apr 10, 2024

NKNaN Apr 10, 2024

luotao1 Mar 29, 2024

NKNaN Mar 29, 2024

luotao1 Apr 1, 2024

NKNaN Apr 1, 2024

luotao1 Apr 1, 2024 •

edited

Loading

NKNaN Apr 1, 2024

luotao1 Apr 2, 2024

luotao1 Apr 1, 2024

luotao1 Apr 1, 2024

NKNaN Apr 1, 2024

NKNaN commented Apr 9, 2024

NKNaN commented Apr 9, 2024

luotao1 commented Apr 18, 2024

NKNaN commented Apr 18, 2024

【Hackathon 6th No.10】Add isposinf / isneginf / isreal / isin API to Paddle #834

【Hackathon 6th No.10】Add isposinf / isneginf / isreal / isin API to Paddle #834

Conversation

NKNaN commented Mar 22, 2024

paddle-bot bot commented Mar 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhwesky2010 Apr 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

luotao1 Apr 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NKNaN commented Apr 9, 2024

NKNaN commented Apr 9, 2024

luotao1 commented Apr 18, 2024

NKNaN commented Apr 18, 2024

zhwesky2010 Apr 9, 2024 •

edited

Loading

luotao1 Apr 1, 2024 •

edited

Loading