Commit Graph

34449 Commits

Author SHA1 Message Date
i-robot 8c58c2dc1d !26730 [Auto parallel] [MoE] Changing large transpose for performace in MoE
Merge pull request !26730 from Xiaoda/109-change-moe-transpose-for-performance
2021-11-25 02:22:54 +00:00
i-robot a92d97adb2 !25850 Add CANN tensor data callback support for ascend async dump - part2
Merge pull request !25850 from TinaMengtingZhang/cann_api_adapt
2021-11-25 02:11:28 +00:00
i-robot 8571b6d7d1 !26608 Add Chinese API docs for Audio and Callbacks
Merge pull request !26608 from xiaotianci/code_docs_add_cn_api
2021-11-25 02:07:24 +00:00
i-robot 7332c935d0 !26721 optimizes the error description of Transpose and get_broadcast_shape.
Merge pull request !26721 from wangshuide/wsd_master_new
2021-11-25 02:05:42 +00:00
i-robot 953acc0335 !26672 Use GPU mem Allocator and workspace instead of self allocator
Merge pull request !26672 from wuwenbing/master
2021-11-25 01:49:14 +00:00
ms_yan 556f1e5bc0 add part of dataset api 2021-11-25 09:46:28 +08:00
Xiaoda Zhang 6d7eaea884 1) fix the int64_t and size_t mixup problem; 2) avoid the <= in std::sort 2021-11-25 09:26:15 +08:00
i-robot a78b4fd60f !26722 vm err log modify
Merge pull request !26722 from zhaosida/code_clean_master
2021-11-25 01:18:18 +00:00
ZPaC e01e67b921 Adapt dlopen macro to windows. 2021-11-25 09:04:01 +08:00
i-robot c6745ecda6 !26758 Fix for GPU dump dataset_sin_mode=True issue
Merge pull request !26758 from parastooashtari/gpu_dump_sink_issue
2021-11-25 01:02:04 +00:00
hesham cf3ff77cfd AutoTune support non-sink mode
Add time sts to iterator tracing
2021-11-24 17:51:25 -05:00
Parastoo Ashtari 8f12b496b4 fix GPU dump sink=True issue 2021-11-24 14:33:09 -05:00
TinaMengtingZhang 07b653103e Support Cann callback api for ascend async dump 2021-11-24 10:35:29 -05:00
i-robot a0ab39248d !26649 move a segment ops infer function to inner
Merge pull request !26649 from lianliguang/add-functoll-api-for-primitive
2021-11-24 12:19:24 +00:00
i-robot 049d83cbb6 !25702 [assistant][ops]New operator implementation, include ApplyGradientDescent
Merge pull request !25702 from 张璇/ApplyGradientDescent
2021-11-24 11:48:05 +00:00
i-robot eef5f3f8d4 !22925 [assistant][ops] add operator Cummin
Merge pull request !22925 from yangwm/cummin
2021-11-24 11:46:23 +00:00
i-robot d251800c62 !26705 Add the check of function return None.
Merge pull request !26705 from Margaret_wangrui/return_none
2021-11-24 11:09:27 +00:00
lilei f8827a09fb modify parallel API note for master 2021-11-24 17:44:35 +08:00
huangbingjian 00f3fa99e2 add fallback testcases 2021-11-24 17:39:55 +08:00
dayschan 174e4ea3ce Bugfix in ConvertNonscalarTensorToParameter
In the last commit (015d7354c7), I deleted the check on whether different ValueNode have
same tensor value, but forgot the situation that several nodes use the same ValueNode,
in this case, the function will create several parameter for the same ValueNode, but all
ValueNode is replaced with the first parameter, and the remaining parameters are not used.
This will result in a "parameter has no user" error.
Use a std::set for the ValueNodes can resolve this problem.
2021-11-24 17:34:01 +08:00
Liu_Xuu 7176613889 [MSLITE] fix mem leak and bugs for diverse networks compatibility in tensorrt delegate 2021-11-24 17:28:23 +08:00
i-robot fa5ea7b3a6 !26370 DynamicRNNGrad support `hidden_size not multiple of 16` scene
Merge pull request !26370 from yuchaojie/ir_fusion4
2021-11-24 09:20:30 +00:00
Xiao Tianci a06f4a92c8 add chinese api 2021-11-24 17:17:44 +08:00
songyuanwei 8d212d4812 add loss landscape 2021-11-24 17:10:57 +08:00
i-robot 43fabaeef8 !26668 add api cn
Merge pull request !26668 from yingchen/code_docs_api6
2021-11-24 08:50:51 +00:00
i-robot 289ff6518b !26681 [MS][LITE] fix npu compatibility bugs and strided_slice-transpose decrease bugs
Merge pull request !26681 from XianglongZeng/myms_new
2021-11-24 08:44:58 +00:00
zhangxuetong 9ef0fa84bb support parameter cache and distribution predict 2021-11-24 16:38:46 +08:00
yingchen 6f7a104e7c update api cn 2021-11-24 16:34:48 +08:00
zhaosida 8759ac41ad fix err log 2021-11-24 16:34:29 +08:00
i-robot 6c0547106e !26697 make trace source lines more accurate
Merge pull request !26697 from yuchaojie/ir_fusion
2021-11-24 08:30:17 +00:00
limingqi107 79765ceb7f fix the bug of single thread execution stack overflow 2021-11-24 16:24:04 +08:00
i-robot 1b1401e73d !26577 update dataset chinese comment
Merge pull request !26577 from luoyang/code_docs_chinese
2021-11-24 08:21:16 +00:00
VectorSL 710289a72d add tensor array 2021-11-24 16:12:04 +08:00
i-robot cfc6ea32ff !24714 replace rtmemcpyxx to acl memcpy
Merge pull request !24714 from jjfeing/br_replace_rtmemcpyxx_with_acl_api
2021-11-24 07:47:43 +00:00
i-robot 3e141dd499 !26667 md5 master 1123
Merge pull request !26667 from mindspore_ding/md5_master_1115
2021-11-24 07:36:05 +00:00
Margaret_wangrui f9a384456a Add the check of function return None. 2021-11-24 15:30:35 +08:00
i-robot d461433648 !26723 full quant support skip node
Merge pull request !26723 from yeyunpeng2020/quant_bak2
2021-11-24 07:16:05 +00:00
Xiaoda Zhang 23387e154c using multiple operators to implement the sematic of a large transpose 2021-11-24 15:13:47 +08:00
wenbean 31053edbe4 Use Allocator and workspace pre allocat mem in GPU 2021-11-24 14:47:12 +08:00
i-robot 2cc51099b1 !26718 [ME][Fallback] Add some fallback numpy test cases
Merge pull request !26718 from Margaret_wangrui/fallback
2021-11-24 06:43:13 +00:00
i-robot dfc7d164b8 !25954 Add file storage module
Merge pull request !25954 from zyli2020/ps_cache_feature
2021-11-24 06:32:10 +00:00
i-robot b472850a75 !26594 Replace std::unordered_map/set with robin-hood-hashing
Merge pull request !26594 from hewei/use_robin_hood
2021-11-24 06:30:36 +00:00
i-robot 2785b59b43 !26615 modify api comments for parallel
Merge pull request !26615 from yangzhenzhang/modify-api-comments-for-parallel
2021-11-24 06:23:34 +00:00
zhengyuanhua c0332b4fc7 310 support image size 2021-11-24 14:15:43 +08:00
i-robot 30d182ac18 !26626 fix reduce ops axis multiple bug in GPU
Merge pull request !26626 from zhangbuxue/fix_reduce_ops_axis_multiple_bug_in_GPU
2021-11-24 06:15:10 +00:00
i-robot 519f14a909 !26006 slice recompute activation
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
2021-11-24 06:14:28 +00:00
yeyunpeng2020 35e0526b51 full quant support skip node 2021-11-24 14:05:27 +08:00
zengxianglong 229b94e8a7 fix npu compatibility bugs and strided_slice-transpose decrease bugs 2021-11-24 11:45:09 +08:00
i-robot de8b6db07c !26585 add warnings for set_dump API, see I4AUIR for details
Merge pull request !26585 from wenkai/wk1120setdumpwarning1
2021-11-24 03:42:25 +00:00
wangshuide2020 399680a711 optimizes the error description of Transpose and get_broadcast_shape. 2021-11-24 11:35:28 +08:00