Commit Graph

34517 Commits

Author SHA1 Message Date
i-robot 3e141dd499 !26667 md5 master 1123
Merge pull request !26667 from mindspore_ding/md5_master_1115
2021-11-24 07:36:05 +00:00
Margaret_wangrui f9a384456a Add the check of function return None. 2021-11-24 15:30:35 +08:00
i-robot d461433648 !26723 full quant support skip node
Merge pull request !26723 from yeyunpeng2020/quant_bak2
2021-11-24 07:16:05 +00:00
Xiaoda Zhang 23387e154c using multiple operators to implement the sematic of a large transpose 2021-11-24 15:13:47 +08:00
wenbean 31053edbe4 Use Allocator and workspace pre allocat mem in GPU 2021-11-24 14:47:12 +08:00
i-robot 2cc51099b1 !26718 [ME][Fallback] Add some fallback numpy test cases
Merge pull request !26718 from Margaret_wangrui/fallback
2021-11-24 06:43:13 +00:00
i-robot dfc7d164b8 !25954 Add file storage module
Merge pull request !25954 from zyli2020/ps_cache_feature
2021-11-24 06:32:10 +00:00
i-robot b472850a75 !26594 Replace std::unordered_map/set with robin-hood-hashing
Merge pull request !26594 from hewei/use_robin_hood
2021-11-24 06:30:36 +00:00
i-robot 2785b59b43 !26615 modify api comments for parallel
Merge pull request !26615 from yangzhenzhang/modify-api-comments-for-parallel
2021-11-24 06:23:34 +00:00
zhengyuanhua c0332b4fc7 310 support image size 2021-11-24 14:15:43 +08:00
i-robot 30d182ac18 !26626 fix reduce ops axis multiple bug in GPU
Merge pull request !26626 from zhangbuxue/fix_reduce_ops_axis_multiple_bug_in_GPU
2021-11-24 06:15:10 +00:00
i-robot 519f14a909 !26006 slice recompute activation
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
2021-11-24 06:14:28 +00:00
yeyunpeng2020 35e0526b51 full quant support skip node 2021-11-24 14:05:27 +08:00
zengxianglong 229b94e8a7 fix npu compatibility bugs and strided_slice-transpose decrease bugs 2021-11-24 11:45:09 +08:00
i-robot de8b6db07c !26585 add warnings for set_dump API, see I4AUIR for details
Merge pull request !26585 from wenkai/wk1120setdumpwarning1
2021-11-24 03:42:25 +00:00
wangshuide2020 399680a711 optimizes the error description of Transpose and get_broadcast_shape. 2021-11-24 11:35:28 +08:00
l00591931 21df240f23 Enable mindir to load initialize weight from python 2021-11-24 11:33:43 +08:00
yuchaojie c3eea22ab4 make trace source lines more accurate 2021-11-24 11:22:49 +08:00
i-robot 66b9de8e70 !25764 [Dataset] Fix lite cv resize bilinear core dump problem
Merge pull request !25764 from xiefangqi/md_fix_resize_bilinear_fuzz_issue
2021-11-24 03:20:48 +00:00
i-robot 9f29ab20ba !25764 [Dataset] Fix lite cv resize bilinear core dump problem
Merge pull request !25764 from xiefangqi/md_fix_resize_bilinear_fuzz_issue
2021-11-24 03:20:09 +00:00
Margaret_wangrui 16ac3638d3 [ME][Fallback] Add some fallback numpy test cases 2021-11-24 11:20:06 +08:00
He Wei 41dcac9c49 Replace std::unordered_map/set with robin-hood-hashing
Robin-hood-hashing (https://github.com/martinus/robin-hood-hashing)
is considered faster then std::unordered_map/set,
so we use it to improve mindspore performance.

1. robin_hood head file in `third_party/robin_hood/include`;
2. In `utils/hash_map.h` and `utils/hash_set.h`, we define:
 - mindspore::HashMap as an alias of robin_hood::unordered_map;
 - mindspore::HashSet as an alias of robin_hood::unordered_set;
3. Replace:
 - `#include <unordered_map>` --> `#include "utils/hash_map.h"`;
 - `#include <unordered_set>` --> `#include "utils/hash_set.h"`;
 - `std::unordered_map` --> `mindspore::HashMap`;
 - `std::unordered_set` --> `mindspore::HashSet`;
 - `map.insert(std::pair(key, value))` --> `map.emplace(key, value)`;
 - `[] (const std::pair<K, V> &p) {..} ` --> `[] (const auto &p) {..} `;
4. Fix issues found by switch to robin_hood:
 - AnfNodeConfig hash and equal;
 - Fix a bug in `Slice::operator==()`;
 - Fix a bug in `CNode::HasPrimalAttr()`;
 - Fix map.erase() usage bugs: `map.erase(iter++)` --> `iter = map.erase(iter)`;
 - Fix some iterator invalidated problem;
5. Some std::unordered_map/set can not replace by robin_hood:
 - As parameter of functions that exposed to python by pybind11;
 - Use bad hash that cause robin_hood::map over_flow, such as AbstractBasePtrListHasher;
6. Update cpp unit tests;
7. Add build option '-F' to enable robin_hood, default on.
2021-11-24 10:47:40 +08:00
lizhenyu a736171895 add file storage module 2021-11-24 10:43:05 +08:00
limingqi107 f016fe0775 unified runtime support the empty kernel graph in control flow 2021-11-24 10:37:22 +08:00
i-robot 354b7d44df !26703 fix converter input shape
Merge pull request !26703 from yeyunpeng2020/converter
2021-11-24 02:33:11 +00:00
i-robot 4deba2a463 !26685 [Numpy-Native] fix searchsorted
Merge pull request !26685 from huangmengxi/fix_numpy
2021-11-24 02:23:04 +00:00
i-robot a17849b669 !26567 fix allreduce notify bug
Merge pull request !26567 from hwjiaorui/fix-stream-label
2021-11-24 02:07:51 +00:00
i-robot 5b4dfee77c !26704 Fix Embedding APi
Merge pull request !26704 from huangxinjing/code_docs_fix_embedding
2021-11-24 01:59:15 +00:00
i-robot 87cbffc456 !26641 Change the parallel strategy data to JSON format
Merge pull request !26641 from ougongchang/trans_json
2021-11-24 01:49:50 +00:00
i-robot 9a5fd32bd2 !26280 fix mindir export's error when using _encrypt
Merge pull request !26280 from lianliguang/master
2021-11-24 01:48:32 +00:00
wzw 86c5ad20c8 parallel_ut_refactoring1 2021-11-24 09:41:52 +08:00
i-robot d996ad5e1e !26642 [MS][LITE][develop] add c api
Merge pull request !26642 from sunsuodong/add_c_api_master
2021-11-24 01:33:22 +00:00
i-robot 3d0b785241 !26680 Decouple GraphKernelCluster from ME backend
Merge pull request !26680 from DeshiChen/1122_cluster
2021-11-24 01:32:50 +00:00
huangxinjing 55efaabe2a Fix api doc error 2021-11-24 09:24:40 +08:00
i-robot 9ce3acff4d !26693 Calibrator support setting symmetry
Merge pull request !26693 from yeyunpeng2020/quant_bak
2021-11-24 01:18:17 +00:00
i-robot 08982f4888 !26648 add more log to locate op compile failed reason
Merge pull request !26648 from liubuyu/SB
2021-11-24 01:15:56 +00:00
i-robot b1d878ca6b !26648 add more log to locate op compile failed reason
Merge pull request !26648 from liubuyu/SB
2021-11-24 01:15:48 +00:00
yeyunpeng2020 dc44f76ac2 fix converter input shape 2021-11-24 09:04:19 +08:00
i-robot c4a48ee77b !26669 Fix MD Profiler connector size order
Merge pull request !26669 from Mohammad Motallebi/fix_profiler_save_sampling
2021-11-24 00:37:48 +00:00
mohammad d6c50e0f26 fix connector size order + autotune thread + minor changes 2021-11-23 11:46:48 -05:00
i-robot 3fc995a6ae !26664 Add support float64 as input type for ReduceProd GPU op.
Merge pull request !26664 from hezhenhao1/add_prod
2021-11-23 15:06:05 +00:00
i-robot 46e53a51c9 !26570 [Auto-par][d-rec] Change Onehot OP type to increase partitioning quality
Merge pull request !26570 from petitquentin/Gather_version_update
2021-11-23 14:31:56 +00:00
luon 5485ff0cb6 [feat] [assistant] [I40GGH] add new ascend operator Cummin 2021-11-23 21:57:23 +08:00
wenkai c493f19b6b add warnings for set_dump API, see I4AUIR for details 2021-11-23 21:55:36 +08:00
ougongchang c01dbf25c9 Change the parallel strategt data to JSON format
Because Protobuf parses the pb file data slowly, in a matter of seconds, it is changed to JSON data
2021-11-23 21:42:04 +08:00
i-robot 875f35d6d8 !26541 Fix file name and field type changes generated by HCCL in profiler.
Merge pull request !26541 from casgj/master_hccl
2021-11-23 13:16:28 +00:00
sunsuodong 1615edc359 add c api 2021-11-23 05:11:45 -08:00
i-robot df25ee8c68 !26675 Optimize oom log
Merge pull request !26675 from tanghuikang/oom_nullptr_log
2021-11-23 13:07:49 +00:00
i-robot 20be757f18 !26430 replace short-circuit eval with deferred evaluation of backward prop function.
Merge pull request !26430 from xychow/replace-shortcurit-eval-with-lazy-eval
2021-11-23 12:53:23 +00:00
lby 47dbd2dd9c add more log to locate compile failed reason 2021-11-23 20:44:02 +08:00