hesham
cf3ff77cfd
AutoTune support non-sink mode
...
Add time sts to iterator tracing
2021-11-24 17:51:25 -05:00
i-robot
eef5f3f8d4
!22925 [assistant][ops] add operator Cummin
...
Merge pull request !22925 from yangwm/cummin
2021-11-24 11:46:23 +00:00
i-robot
d251800c62
!26705 Add the check of function return None.
...
Merge pull request !26705 from Margaret_wangrui/return_none
2021-11-24 11:09:27 +00:00
i-robot
cfc6ea32ff
!24714 replace rtmemcpyxx to acl memcpy
...
Merge pull request !24714 from jjfeing/br_replace_rtmemcpyxx_with_acl_api
2021-11-24 07:47:43 +00:00
Margaret_wangrui
f9a384456a
Add the check of function return None.
2021-11-24 15:30:35 +08:00
i-robot
2cc51099b1
!26718 [ME][Fallback] Add some fallback numpy test cases
...
Merge pull request !26718 from Margaret_wangrui/fallback
2021-11-24 06:43:13 +00:00
i-robot
b472850a75
!26594 Replace std::unordered_map/set with robin-hood-hashing
...
Merge pull request !26594 from hewei/use_robin_hood
2021-11-24 06:30:36 +00:00
i-robot
519f14a909
!26006 slice recompute activation
...
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
2021-11-24 06:14:28 +00:00
i-robot
de8b6db07c
!26585 add warnings for set_dump API, see I4AUIR for details
...
Merge pull request !26585 from wenkai/wk1120setdumpwarning1
2021-11-24 03:42:25 +00:00
l00591931
21df240f23
Enable mindir to load initialize weight from python
2021-11-24 11:33:43 +08:00
Margaret_wangrui
16ac3638d3
[ME][Fallback] Add some fallback numpy test cases
2021-11-24 11:20:06 +08:00
He Wei
41dcac9c49
Replace std::unordered_map/set with robin-hood-hashing
...
Robin-hood-hashing (https://github.com/martinus/robin-hood-hashing )
is considered faster then std::unordered_map/set,
so we use it to improve mindspore performance.
1. robin_hood head file in `third_party/robin_hood/include`;
2. In `utils/hash_map.h` and `utils/hash_set.h`, we define:
- mindspore::HashMap as an alias of robin_hood::unordered_map;
- mindspore::HashSet as an alias of robin_hood::unordered_set;
3. Replace:
- `#include <unordered_map>` --> `#include "utils/hash_map.h"`;
- `#include <unordered_set>` --> `#include "utils/hash_set.h"`;
- `std::unordered_map` --> `mindspore::HashMap`;
- `std::unordered_set` --> `mindspore::HashSet`;
- `map.insert(std::pair(key, value))` --> `map.emplace(key, value)`;
- `[] (const std::pair<K, V> &p) {..} ` --> `[] (const auto &p) {..} `;
4. Fix issues found by switch to robin_hood:
- AnfNodeConfig hash and equal;
- Fix a bug in `Slice::operator==()`;
- Fix a bug in `CNode::HasPrimalAttr()`;
- Fix map.erase() usage bugs: `map.erase(iter++)` --> `iter = map.erase(iter)`;
- Fix some iterator invalidated problem;
5. Some std::unordered_map/set can not replace by robin_hood:
- As parameter of functions that exposed to python by pybind11;
- Use bad hash that cause robin_hood::map over_flow, such as AbstractBasePtrListHasher;
6. Update cpp unit tests;
7. Add build option '-F' to enable robin_hood, default on.
2021-11-24 10:47:40 +08:00
i-robot
9a5fd32bd2
!26280 fix mindir export's error when using _encrypt
...
Merge pull request !26280 from lianliguang/master
2021-11-24 01:48:32 +00:00
wzw
86c5ad20c8
parallel_ut_refactoring1
2021-11-24 09:41:52 +08:00
luon
5485ff0cb6
[feat] [assistant] [I40GGH] add new ascend operator Cummin
2021-11-23 21:57:23 +08:00
wenkai
c493f19b6b
add warnings for set_dump API, see I4AUIR for details
2021-11-23 21:55:36 +08:00
i-robot
875f35d6d8
!26541 Fix file name and field type changes generated by HCCL in profiler.
...
Merge pull request !26541 from casgj/master_hccl
2021-11-23 13:16:28 +00:00
casgj
b15c09db6d
Fix file name and field type changes generated by HCCL.
2021-11-23 12:48:59 +08:00
i-robot
76206a8894
!26162 Add automatic detection for the changes of python scripts when enable compilation cache
...
Merge pull request !26162 from YuJianfeng/compile_cache
2021-11-23 02:54:38 +00:00
i-robot
310bbcebe1
!26539 [GraphKernel] support lite_adapter for graph kernel.
...
Merge pull request !26539 from chenlei_autodiff/lite_callback
2021-11-23 02:44:13 +00:00
jjfeing
05485d991c
replace api with acl api
2021-11-23 09:06:28 +08:00
yujianfeng
f2bd7de442
Add automatic detection for the changes of python scripts when enable compile cache
2021-11-22 21:27:02 +00:00
i-robot
4f55a3f52f
!26275 [API] Tensor python float trans to ms float32
...
Merge pull request !26275 from kingxian/master
2021-11-22 11:13:15 +00:00
chenlei_autodiff
4c0d5dcfe7
[GraphKernel] support lite_adapter for graph kernel.
2021-11-22 15:10:30 +08:00
lianliguang
a6e5dcfb22
if parameter total larger than 1T split to another file
2021-11-22 11:17:33 +08:00
kingxian
545dc83fcb
fix: `cell` does not support deep copy, tensor default dtype -> float32
2021-11-22 09:08:32 +08:00
i-robot
1b8c2ff0e9
!26414 fault_recover_by_mirror_group_fix_opt_shard
...
Merge pull request !26414 from yao_yf/fault_recover_by_mirror_group_fix_opt_shard
2021-11-20 03:06:45 +00:00
i-robot
4f34c32c6d
!26235 AutoTune v1
...
Merge pull request !26235 from danishfarid/autotune
2021-11-19 23:40:40 +00:00
Rescue
da986710f0
Updated concat-zip check and UT.
2021-11-19 17:38:16 +00:00
hesham
40797c40da
AutoTune
2021-11-19 09:27:12 -05:00
i-robot
70363899e7
!24080 [assistant][ops] Add new audio operator ComputeDeltas
...
Merge pull request !24080 from Rainfor/compute_detlas
2021-11-19 12:29:49 +00:00
i-robot
c6821bde0a
!21648 [assistant][ops] Add new data loading operator YesNoDataset
...
Merge pull request !21648 from 杨旭华/YesNoDataset
2021-11-19 09:05:07 +00:00
vddong
f465cd0242
[feat][assistant][I3T96L]add new dataset operator YesNo
2021-11-19 13:36:24 +08:00
i-robot
f38df5c888
!26234 New dataset config option, auto_offload
...
Merge pull request !26234 from markuskunej/offload_config
2021-11-19 03:44:56 +00:00
i-robot
55463892e4
!26402 add mem offload strategy
...
Merge pull request !26402 from kisnwang/add-mem-offload-strategy
2021-11-19 03:42:52 +00:00
yao_yf
188d39da83
slice_activation_in_recompute
...
slice recompute activation
2021-11-19 10:51:54 +08:00
i-robot
7559d5b798
!26494 [Auto parallel] Adjusting sharding propagation
...
Merge pull request !26494 from Xiaoda/102-adjusting-sharding-propagation
2021-11-19 02:00:46 +00:00
yao_yf
01dc4bbdf9
fix fault recover in optimizer shard
2021-11-19 09:46:27 +08:00
i-robot
3c39afad11
!24547 [MS][crowdfunding]New operator implementation, AutoAugment
...
Merge pull request !24547 from yangwm/autoaugment
2021-11-19 01:00:51 +00:00
markuskunej
29cf442521
Added dataset.config.set_auto_offload(), which will automatically attempt to offload dataset map operations if set to True.
2021-11-18 21:13:20 +00:00
i-robot
117df75afc
!26126 MD Profiling - Add Start Stop Python UT
...
Merge pull request !26126 from cathwong/ckw_ut_prof_startstop
2021-11-18 14:05:00 +00:00
i-robot
ce00ee1ad1
!25367 use acl api to control profiling
...
Merge pull request !25367 from yanghaitao/yht_condation_start_profiler
2021-11-18 12:36:14 +00:00
zhuchuang
4c3931f5c9
[feat] [assistant] [I3CKEE] add new audio operator ComputeDeltas
2021-11-18 17:18:43 +08:00
i-robot
9d6248194e
!26310 MindSpore support load custom aicpu kernels.
...
Merge pull request !26310 from linqingke/aicpu
2021-11-18 08:09:26 +00:00
Xiaoda Zhang
df67e74eaf
making sharding_propagation smooth, add a reshape justification:
...
1) when propagate sharding strategy from one op to another, try to find the strategy with zero communication cost;
2) if there is no such strategy, find the strategy with minimum communication cost, and raise a warning;
2021-11-18 15:55:50 +08:00
kswang
7c312bd38c
add mem offload strategy
2021-11-18 15:54:16 +08:00
i-robot
9f52343a6a
!26350 add check for resizenearestneighbor parallel op
...
Merge pull request !26350 from yangzhenzhang/add-check-for-resize-op
2021-11-18 05:45:57 +00:00
despicablemme
09897ce7ec
[feat] [assistant] [I40GZP] add new data op AutoAugment
2021-11-18 11:38:18 +08:00
i-robot
707e7fbbf9
!26453 [Fallback] Add some fallback numpy test cases
...
Merge pull request !26453 from Margaret_wangrui/fallback_tests
2021-11-18 03:01:47 +00:00
Margaret_wangrui
c6582b9390
[Fallback] Add some fallback numpy test cases
2021-11-18 10:18:13 +08:00
Cathy Wong
b1094b9d81
MD Profiling - Add Start Stop Python UT
2021-11-17 13:49:43 -05:00
yanghaitao1
c94aa6b872
use profiler acl api instead
2021-11-17 09:33:31 -05:00
i-robot
4e88f2e5e8
!26345 add examples of function grad, jvp and vjp
...
Merge pull request !26345 from chenzhuo/jvp
2021-11-17 08:35:38 +00:00
linqingke
bef2923acf
MindSpore support load custom aicpu ops.
2021-11-17 14:58:50 +08:00
i-robot
36517de69c
!24519 [assistant][ops] Add new audio operator SlidingWindowCmn.
...
Merge pull request !24519 from Rainfor/SlidingWindowCmn
2021-11-17 06:40:43 +00:00
i-robot
9f2e1edc00
!26279 neighborexchangev2 fix bug
...
Merge pull request !26279 from TuDouNi/neighborexchangev2_bug
2021-11-17 02:29:21 +00:00
wangkc123
706f9e2cbb
[feat] [assistant] [I3CKEH] add new audio operator SlidingWindowCmn
2021-11-16 22:00:09 +08:00
yangzhenzhang
ba99e4c505
add check for resize op
2021-11-16 15:39:52 +08:00
i-robot
58b69a05ee
!20576 [assistant][ops] add new dataset loading operator LJSpeechDataset
...
Merge pull request !20576 from 杨旭华/LJSpeechDataset
2021-11-16 07:00:20 +00:00
chenzhuo
d07f24462c
add examples of grad, jvp and vjp
2021-11-16 14:15:26 +08:00
ttudu
33ac1de062
fix bug
2021-11-16 11:14:58 +08:00
i-robot
7cdef3da6e
!26334 [MD] fix occasional fail in random rampler ut
...
Merge pull request !26334 from liyong126/fix_random_sampler
2021-11-16 02:49:59 +00:00
i-robot
6bdd38399a
!25811 fault_recover_by_mirror_group
...
Merge pull request !25811 from yao_yf/fault_recover_by_mirror_group
2021-11-16 02:20:58 +00:00
liyong
9663884db3
fix occasional failed in random sampler
2021-11-16 09:53:38 +08:00
i-robot
5af8572cd7
!25488 add the set_dump API
...
Merge pull request !25488 from wenkai/wk1027setdump1
2021-11-16 01:47:32 +00:00
i-robot
0113b3eee1
!26236 Add MD Profiler save()
...
Merge pull request !26236 from Mohammad Motallebi/add_profiler_save
2021-11-16 01:04:53 +00:00
mohammad
5c8ab5f60c
add MD Profiler Save()
2021-11-15 13:18:36 -05:00
wenkai
d2eca80d40
add the set_dump API, see I4AUIR for details
2021-11-15 21:28:40 +08:00
i-robot
9dfe08b805
!25909 [Fallback] Add global Py param when processing global namespace info
...
Merge pull request !25909 from huangbingjian/update_global_params
2021-11-15 11:43:35 +00:00
i-robot
9522ee9686
!25703 [assistant][ops] Add math operator LpNorm
...
Merge pull request !25703 from 孟权令/LpNorm
2021-11-15 02:36:54 +00:00
vddong
6fe2d35ae2
[feat][assistant][I3T96D] add new dataset loading operator LJSpeechDataset
2021-11-14 14:52:15 +08:00
yao_yf
501b978d16
find data parallel common group in auto parallel
2021-11-13 10:23:45 +08:00
i-robot
04da5c2808
!26140 Add callback support to BatchOp Datasets to enable Add/Remove workers
...
Merge pull request !26140 from h.farahat/cb_batch
2021-11-12 17:23:19 +00:00
i-robot
e154dca4b1
!22193 [assistant][ops] Add new loader operator DBpedia
...
Merge pull request !22193 from 杨旭华/DBpedia
2021-11-12 09:02:28 +00:00
i-robot
0f07408425
!26034 F.grad support sens_param and fix graph_mode bug
...
Merge pull request !26034 from zhang_sss/grad
2021-11-12 06:38:15 +00:00
i-robot
e773f0621e
!25918 add Function jvp and vjp
...
Merge pull request !25918 from chenzhuo/jvp
2021-11-12 06:37:46 +00:00
i-robot
5869e02fa1
!26135 Enable tuple input of optimizer as variable
...
Merge pull request !26135 from LiangZhibo/broaden3
2021-11-12 06:29:35 +00:00
i-robot
6c587dc2d3
!25391 Support to profiling parallel strategy
...
Merge pull request !25391 from ougongchang/profiling_stategy
2021-11-12 06:07:30 +00:00
Carry955
0d26c38693
[feat][assistant][I40GXT] add new loader DBpedia
2021-11-11 19:02:22 -08:00
i-robot
7a73bae5c3
!26036 add output strategy for matmul operator
...
Merge pull request !26036 from yangzhenzhang/add-output-strategy-for-op-init
2021-11-12 01:22:22 +00:00
i-robot
e03544685b
!26049 [Auto parallel] Support reshape in sharding propagation
...
Merge pull request !26049 from Xiaoda/99-ad-reshape-adjusting-in-sharding-propagation
2021-11-12 01:14:18 +00:00
chenzhuo
f36edec3ff
add function jvp
2021-11-12 08:44:41 +08:00
hesham
2662b6d5c3
Add callback to Batch op
2021-11-11 10:58:01 -05:00
zhang__sss
0dd68aa7eb
grad
2021-11-11 23:18:35 +08:00
i-robot
1792d5f8a4
!26131 Offload Hardware Acceleration Computing
...
Merge pull request !26131 from alashkari/hw_accel_move_copy
2021-11-11 13:18:07 +00:00
ougongchang
9229f1c1ff
profiler support to collect parallel strategy info
...
If SetNodeOutputType functions forcibly splits into multiple functions, the readability decreases, so it blocks lizard scans
2021-11-11 20:11:42 +08:00
i-robot
b910870ecc
!19986 [assistant][ops] Add new dataset loading operator AGNEWS
...
Merge pull request !19986 from 杨旭华/AGNEWS
2021-11-11 06:22:25 +00:00
Xiaoda Zhang
a772767265
support reshape in sharding propagation:
...
1) using 'swc index of strategy_cost_' as reshape's selected strategy;
2) when encountering reshape in BFS, select the 'swc index' with zero communication cost;
3) when encountering a reshape that is already visited, check whether there exists communication between reshape and current operator. It is OK if communication happens between two configured operators;
4) currently, two consecutive reshapes are not supported;
5) adjusting BFS structure in graph_costmodel.cc;
6) adjusting some code in step_auto_parallel.cc to avoid cyclomatic complexity.
2021-11-11 11:35:43 +08:00
l00591931
9353bd10a4
Enable tuple as variable in optimizer
2021-11-11 11:31:05 +08:00
zetongzhao
2de8142c44
profiler test
2021-11-10 11:55:19 -05:00
Megalomania
c34bbabed9
[feat] [assistant] [I3T96M] add new loader AGNEWS
2021-11-09 23:51:19 -08:00
yangzhenzhang
8431ba616c
add output strategy for op init
2021-11-10 15:35:45 +08:00
rescue
22e03ab12e
Adding offload hardware accelerator.
2021-11-10 07:03:43 +00:00
i-robot
8bf7e28fa6
!25410 add dump flag for fusion nodes
...
Merge pull request !25410 from yuchaojie/ir_fusion3
2021-11-10 02:25:08 +00:00
i-robot
d4c8dde6d2
!25923 Move IsRealKernel and IsRealCNodeKernel from AnfAlgo to AnfUtils
...
Merge pull request !25923 from DeshiChen/1101_anfalgo
2021-11-10 01:56:07 +00:00
i-robot
2e174427c9
!25963 [MD][Autotune] Add WaitFor
...
Merge pull request !25963 from harshvardhangupta/hesh_tree_mod
2021-11-09 22:43:00 +00:00
i-robot
b7244ac987
!25852 MD C++ UT: Remove old tests part 2
...
Merge pull request !25852 from hetshah/hs_remove_old_tests
2021-11-09 22:07:59 +00:00
hetshah
5c703015e4
remove old tests and move coverage
2021-11-09 21:27:37 +00:00
i-robot
61b53fee3a
!25980 Add callback support to Mappable Datasets to enable Add/Remove workers
...
Merge pull request !25980 from h.farahat/cb_mappable
2021-11-09 20:05:01 +00:00
i-robot
f1f7731fb2
!25964 Fix serdes multithreading bug
...
Merge pull request !25964 from zetongzhao/serdes_multithreading
2021-11-09 15:48:06 +00:00