i-robot
3e141dd499
!26667 md5 master 1123
...
Merge pull request !26667 from mindspore_ding/md5_master_1115
2021-11-24 07:36:05 +00:00
Margaret_wangrui
f9a384456a
Add the check of function return None.
2021-11-24 15:30:35 +08:00
i-robot
d461433648
!26723 full quant support skip node
...
Merge pull request !26723 from yeyunpeng2020/quant_bak2
2021-11-24 07:16:05 +00:00
Xiaoda Zhang
23387e154c
using multiple operators to implement the sematic of a large transpose
2021-11-24 15:13:47 +08:00
wenbean
31053edbe4
Use Allocator and workspace pre allocat mem in GPU
2021-11-24 14:47:12 +08:00
i-robot
2cc51099b1
!26718 [ME][Fallback] Add some fallback numpy test cases
...
Merge pull request !26718 from Margaret_wangrui/fallback
2021-11-24 06:43:13 +00:00
i-robot
dfc7d164b8
!25954 Add file storage module
...
Merge pull request !25954 from zyli2020/ps_cache_feature
2021-11-24 06:32:10 +00:00
i-robot
b472850a75
!26594 Replace std::unordered_map/set with robin-hood-hashing
...
Merge pull request !26594 from hewei/use_robin_hood
2021-11-24 06:30:36 +00:00
i-robot
2785b59b43
!26615 modify api comments for parallel
...
Merge pull request !26615 from yangzhenzhang/modify-api-comments-for-parallel
2021-11-24 06:23:34 +00:00
zhengyuanhua
c0332b4fc7
310 support image size
2021-11-24 14:15:43 +08:00
i-robot
30d182ac18
!26626 fix reduce ops axis multiple bug in GPU
...
Merge pull request !26626 from zhangbuxue/fix_reduce_ops_axis_multiple_bug_in_GPU
2021-11-24 06:15:10 +00:00
i-robot
519f14a909
!26006 slice recompute activation
...
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
2021-11-24 06:14:28 +00:00
yeyunpeng2020
35e0526b51
full quant support skip node
2021-11-24 14:05:27 +08:00
zengxianglong
229b94e8a7
fix npu compatibility bugs and strided_slice-transpose decrease bugs
2021-11-24 11:45:09 +08:00
i-robot
de8b6db07c
!26585 add warnings for set_dump API, see I4AUIR for details
...
Merge pull request !26585 from wenkai/wk1120setdumpwarning1
2021-11-24 03:42:25 +00:00
wangshuide2020
399680a711
optimizes the error description of Transpose and get_broadcast_shape.
2021-11-24 11:35:28 +08:00
l00591931
21df240f23
Enable mindir to load initialize weight from python
2021-11-24 11:33:43 +08:00
yuchaojie
c3eea22ab4
make trace source lines more accurate
2021-11-24 11:22:49 +08:00
i-robot
66b9de8e70
!25764 [Dataset] Fix lite cv resize bilinear core dump problem
...
Merge pull request !25764 from xiefangqi/md_fix_resize_bilinear_fuzz_issue
2021-11-24 03:20:48 +00:00
i-robot
9f29ab20ba
!25764 [Dataset] Fix lite cv resize bilinear core dump problem
...
Merge pull request !25764 from xiefangqi/md_fix_resize_bilinear_fuzz_issue
2021-11-24 03:20:09 +00:00
Margaret_wangrui
16ac3638d3
[ME][Fallback] Add some fallback numpy test cases
2021-11-24 11:20:06 +08:00
He Wei
41dcac9c49
Replace std::unordered_map/set with robin-hood-hashing
...
Robin-hood-hashing (https://github.com/martinus/robin-hood-hashing )
is considered faster then std::unordered_map/set,
so we use it to improve mindspore performance.
1. robin_hood head file in `third_party/robin_hood/include`;
2. In `utils/hash_map.h` and `utils/hash_set.h`, we define:
- mindspore::HashMap as an alias of robin_hood::unordered_map;
- mindspore::HashSet as an alias of robin_hood::unordered_set;
3. Replace:
- `#include <unordered_map>` --> `#include "utils/hash_map.h"`;
- `#include <unordered_set>` --> `#include "utils/hash_set.h"`;
- `std::unordered_map` --> `mindspore::HashMap`;
- `std::unordered_set` --> `mindspore::HashSet`;
- `map.insert(std::pair(key, value))` --> `map.emplace(key, value)`;
- `[] (const std::pair<K, V> &p) {..} ` --> `[] (const auto &p) {..} `;
4. Fix issues found by switch to robin_hood:
- AnfNodeConfig hash and equal;
- Fix a bug in `Slice::operator==()`;
- Fix a bug in `CNode::HasPrimalAttr()`;
- Fix map.erase() usage bugs: `map.erase(iter++)` --> `iter = map.erase(iter)`;
- Fix some iterator invalidated problem;
5. Some std::unordered_map/set can not replace by robin_hood:
- As parameter of functions that exposed to python by pybind11;
- Use bad hash that cause robin_hood::map over_flow, such as AbstractBasePtrListHasher;
6. Update cpp unit tests;
7. Add build option '-F' to enable robin_hood, default on.
2021-11-24 10:47:40 +08:00
lizhenyu
a736171895
add file storage module
2021-11-24 10:43:05 +08:00
limingqi107
f016fe0775
unified runtime support the empty kernel graph in control flow
2021-11-24 10:37:22 +08:00
i-robot
354b7d44df
!26703 fix converter input shape
...
Merge pull request !26703 from yeyunpeng2020/converter
2021-11-24 02:33:11 +00:00
i-robot
4deba2a463
!26685 [Numpy-Native] fix searchsorted
...
Merge pull request !26685 from huangmengxi/fix_numpy
2021-11-24 02:23:04 +00:00
i-robot
a17849b669
!26567 fix allreduce notify bug
...
Merge pull request !26567 from hwjiaorui/fix-stream-label
2021-11-24 02:07:51 +00:00
i-robot
5b4dfee77c
!26704 Fix Embedding APi
...
Merge pull request !26704 from huangxinjing/code_docs_fix_embedding
2021-11-24 01:59:15 +00:00
i-robot
87cbffc456
!26641 Change the parallel strategy data to JSON format
...
Merge pull request !26641 from ougongchang/trans_json
2021-11-24 01:49:50 +00:00
i-robot
9a5fd32bd2
!26280 fix mindir export's error when using _encrypt
...
Merge pull request !26280 from lianliguang/master
2021-11-24 01:48:32 +00:00
wzw
86c5ad20c8
parallel_ut_refactoring1
2021-11-24 09:41:52 +08:00
i-robot
d996ad5e1e
!26642 [MS][LITE][develop] add c api
...
Merge pull request !26642 from sunsuodong/add_c_api_master
2021-11-24 01:33:22 +00:00
i-robot
3d0b785241
!26680 Decouple GraphKernelCluster from ME backend
...
Merge pull request !26680 from DeshiChen/1122_cluster
2021-11-24 01:32:50 +00:00
huangxinjing
55efaabe2a
Fix api doc error
2021-11-24 09:24:40 +08:00
i-robot
9ce3acff4d
!26693 Calibrator support setting symmetry
...
Merge pull request !26693 from yeyunpeng2020/quant_bak
2021-11-24 01:18:17 +00:00
i-robot
08982f4888
!26648 add more log to locate op compile failed reason
...
Merge pull request !26648 from liubuyu/SB
2021-11-24 01:15:56 +00:00
i-robot
b1d878ca6b
!26648 add more log to locate op compile failed reason
...
Merge pull request !26648 from liubuyu/SB
2021-11-24 01:15:48 +00:00
yeyunpeng2020
dc44f76ac2
fix converter input shape
2021-11-24 09:04:19 +08:00
i-robot
c4a48ee77b
!26669 Fix MD Profiler connector size order
...
Merge pull request !26669 from Mohammad Motallebi/fix_profiler_save_sampling
2021-11-24 00:37:48 +00:00
mohammad
d6c50e0f26
fix connector size order + autotune thread + minor changes
2021-11-23 11:46:48 -05:00
i-robot
3fc995a6ae
!26664 Add support float64 as input type for ReduceProd GPU op.
...
Merge pull request !26664 from hezhenhao1/add_prod
2021-11-23 15:06:05 +00:00
i-robot
46e53a51c9
!26570 [Auto-par][d-rec] Change Onehot OP type to increase partitioning quality
...
Merge pull request !26570 from petitquentin/Gather_version_update
2021-11-23 14:31:56 +00:00
luon
5485ff0cb6
[feat] [assistant] [I40GGH] add new ascend operator Cummin
2021-11-23 21:57:23 +08:00
wenkai
c493f19b6b
add warnings for set_dump API, see I4AUIR for details
2021-11-23 21:55:36 +08:00
ougongchang
c01dbf25c9
Change the parallel strategt data to JSON format
...
Because Protobuf parses the pb file data slowly, in a matter of seconds, it is changed to JSON data
2021-11-23 21:42:04 +08:00
i-robot
875f35d6d8
!26541 Fix file name and field type changes generated by HCCL in profiler.
...
Merge pull request !26541 from casgj/master_hccl
2021-11-23 13:16:28 +00:00
sunsuodong
1615edc359
add c api
2021-11-23 05:11:45 -08:00
i-robot
df25ee8c68
!26675 Optimize oom log
...
Merge pull request !26675 from tanghuikang/oom_nullptr_log
2021-11-23 13:07:49 +00:00
i-robot
20be757f18
!26430 replace short-circuit eval with deferred evaluation of backward prop function.
...
Merge pull request !26430 from xychow/replace-shortcurit-eval-with-lazy-eval
2021-11-23 12:53:23 +00:00
lby
47dbd2dd9c
add more log to locate compile failed reason
2021-11-23 20:44:02 +08:00