i-robot
8c58c2dc1d
!26730 [Auto parallel] [MoE] Changing large transpose for performace in MoE
...
Merge pull request !26730 from Xiaoda/109-change-moe-transpose-for-performance
2021-11-25 02:22:54 +00:00
i-robot
a92d97adb2
!25850 Add CANN tensor data callback support for ascend async dump - part2
...
Merge pull request !25850 from TinaMengtingZhang/cann_api_adapt
2021-11-25 02:11:28 +00:00
i-robot
8571b6d7d1
!26608 Add Chinese API docs for Audio and Callbacks
...
Merge pull request !26608 from xiaotianci/code_docs_add_cn_api
2021-11-25 02:07:24 +00:00
i-robot
7332c935d0
!26721 optimizes the error description of Transpose and get_broadcast_shape.
...
Merge pull request !26721 from wangshuide/wsd_master_new
2021-11-25 02:05:42 +00:00
i-robot
953acc0335
!26672 Use GPU mem Allocator and workspace instead of self allocator
...
Merge pull request !26672 from wuwenbing/master
2021-11-25 01:49:14 +00:00
ms_yan
556f1e5bc0
add part of dataset api
2021-11-25 09:46:28 +08:00
Xiaoda Zhang
6d7eaea884
1) fix the int64_t and size_t mixup problem; 2) avoid the <= in std::sort
2021-11-25 09:26:15 +08:00
i-robot
a78b4fd60f
!26722 vm err log modify
...
Merge pull request !26722 from zhaosida/code_clean_master
2021-11-25 01:18:18 +00:00
ZPaC
e01e67b921
Adapt dlopen macro to windows.
2021-11-25 09:04:01 +08:00
i-robot
c6745ecda6
!26758 Fix for GPU dump dataset_sin_mode=True issue
...
Merge pull request !26758 from parastooashtari/gpu_dump_sink_issue
2021-11-25 01:02:04 +00:00
hesham
cf3ff77cfd
AutoTune support non-sink mode
...
Add time sts to iterator tracing
2021-11-24 17:51:25 -05:00
Parastoo Ashtari
8f12b496b4
fix GPU dump sink=True issue
2021-11-24 14:33:09 -05:00
TinaMengtingZhang
07b653103e
Support Cann callback api for ascend async dump
2021-11-24 10:35:29 -05:00
i-robot
a0ab39248d
!26649 move a segment ops infer function to inner
...
Merge pull request !26649 from lianliguang/add-functoll-api-for-primitive
2021-11-24 12:19:24 +00:00
i-robot
049d83cbb6
!25702 [assistant][ops]New operator implementation, include ApplyGradientDescent
...
Merge pull request !25702 from 张璇/ApplyGradientDescent
2021-11-24 11:48:05 +00:00
i-robot
eef5f3f8d4
!22925 [assistant][ops] add operator Cummin
...
Merge pull request !22925 from yangwm/cummin
2021-11-24 11:46:23 +00:00
i-robot
d251800c62
!26705 Add the check of function return None.
...
Merge pull request !26705 from Margaret_wangrui/return_none
2021-11-24 11:09:27 +00:00
lilei
f8827a09fb
modify parallel API note for master
2021-11-24 17:44:35 +08:00
huangbingjian
00f3fa99e2
add fallback testcases
2021-11-24 17:39:55 +08:00
dayschan
174e4ea3ce
Bugfix in ConvertNonscalarTensorToParameter
...
In the last commit (015d7354c7
), I deleted the check on whether different ValueNode have
same tensor value, but forgot the situation that several nodes use the same ValueNode,
in this case, the function will create several parameter for the same ValueNode, but all
ValueNode is replaced with the first parameter, and the remaining parameters are not used.
This will result in a "parameter has no user" error.
Use a std::set for the ValueNodes can resolve this problem.
2021-11-24 17:34:01 +08:00
Liu_Xuu
7176613889
[MSLITE] fix mem leak and bugs for diverse networks compatibility in tensorrt delegate
2021-11-24 17:28:23 +08:00
i-robot
fa5ea7b3a6
!26370 DynamicRNNGrad support `hidden_size not multiple of 16` scene
...
Merge pull request !26370 from yuchaojie/ir_fusion4
2021-11-24 09:20:30 +00:00
Xiao Tianci
a06f4a92c8
add chinese api
2021-11-24 17:17:44 +08:00
songyuanwei
8d212d4812
add loss landscape
2021-11-24 17:10:57 +08:00
i-robot
43fabaeef8
!26668 add api cn
...
Merge pull request !26668 from yingchen/code_docs_api6
2021-11-24 08:50:51 +00:00
i-robot
289ff6518b
!26681 [MS][LITE] fix npu compatibility bugs and strided_slice-transpose decrease bugs
...
Merge pull request !26681 from XianglongZeng/myms_new
2021-11-24 08:44:58 +00:00
zhangxuetong
9ef0fa84bb
support parameter cache and distribution predict
2021-11-24 16:38:46 +08:00
yingchen
6f7a104e7c
update api cn
2021-11-24 16:34:48 +08:00
zhaosida
8759ac41ad
fix err log
2021-11-24 16:34:29 +08:00
i-robot
6c0547106e
!26697 make trace source lines more accurate
...
Merge pull request !26697 from yuchaojie/ir_fusion
2021-11-24 08:30:17 +00:00
limingqi107
79765ceb7f
fix the bug of single thread execution stack overflow
2021-11-24 16:24:04 +08:00
i-robot
1b1401e73d
!26577 update dataset chinese comment
...
Merge pull request !26577 from luoyang/code_docs_chinese
2021-11-24 08:21:16 +00:00
VectorSL
710289a72d
add tensor array
2021-11-24 16:12:04 +08:00
i-robot
cfc6ea32ff
!24714 replace rtmemcpyxx to acl memcpy
...
Merge pull request !24714 from jjfeing/br_replace_rtmemcpyxx_with_acl_api
2021-11-24 07:47:43 +00:00
i-robot
3e141dd499
!26667 md5 master 1123
...
Merge pull request !26667 from mindspore_ding/md5_master_1115
2021-11-24 07:36:05 +00:00
Margaret_wangrui
f9a384456a
Add the check of function return None.
2021-11-24 15:30:35 +08:00
i-robot
d461433648
!26723 full quant support skip node
...
Merge pull request !26723 from yeyunpeng2020/quant_bak2
2021-11-24 07:16:05 +00:00
Xiaoda Zhang
23387e154c
using multiple operators to implement the sematic of a large transpose
2021-11-24 15:13:47 +08:00
wenbean
31053edbe4
Use Allocator and workspace pre allocat mem in GPU
2021-11-24 14:47:12 +08:00
i-robot
2cc51099b1
!26718 [ME][Fallback] Add some fallback numpy test cases
...
Merge pull request !26718 from Margaret_wangrui/fallback
2021-11-24 06:43:13 +00:00
i-robot
dfc7d164b8
!25954 Add file storage module
...
Merge pull request !25954 from zyli2020/ps_cache_feature
2021-11-24 06:32:10 +00:00
i-robot
b472850a75
!26594 Replace std::unordered_map/set with robin-hood-hashing
...
Merge pull request !26594 from hewei/use_robin_hood
2021-11-24 06:30:36 +00:00
i-robot
2785b59b43
!26615 modify api comments for parallel
...
Merge pull request !26615 from yangzhenzhang/modify-api-comments-for-parallel
2021-11-24 06:23:34 +00:00
zhengyuanhua
c0332b4fc7
310 support image size
2021-11-24 14:15:43 +08:00
i-robot
30d182ac18
!26626 fix reduce ops axis multiple bug in GPU
...
Merge pull request !26626 from zhangbuxue/fix_reduce_ops_axis_multiple_bug_in_GPU
2021-11-24 06:15:10 +00:00
i-robot
519f14a909
!26006 slice recompute activation
...
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
2021-11-24 06:14:28 +00:00
yeyunpeng2020
35e0526b51
full quant support skip node
2021-11-24 14:05:27 +08:00
zengxianglong
229b94e8a7
fix npu compatibility bugs and strided_slice-transpose decrease bugs
2021-11-24 11:45:09 +08:00
i-robot
de8b6db07c
!26585 add warnings for set_dump API, see I4AUIR for details
...
Merge pull request !26585 from wenkai/wk1120setdumpwarning1
2021-11-24 03:42:25 +00:00
wangshuide2020
399680a711
optimizes the error description of Transpose and get_broadcast_shape.
2021-11-24 11:35:28 +08:00