i-robot
a9cdbd5ae8
!32005 fix bugs of moe
...
Merge pull request !32005 from bichaoyang/master
2022-03-29 01:30:10 +00:00
larrygld
aeb0b2090b
[assistant][ops] Add array operators Triu
2022-03-28 21:25:29 +08:00
王超
0cf25e180f
[assistant] [ops] [I48O8K] add new array operator RightShift
2022-03-28 18:51:44 +08:00
i-robot
c2e0796bdb
!25339 [assistant][ops] Add new Tril
...
Merge pull request !25339 from 张凯磊/Tril
2022-03-28 09:26:13 +00:00
b00518648
93da6bab46
fix bugs of moe: only use a fewer dp in moe
2022-03-28 17:22:48 +08:00
i-robot
5c095e75fa
!32006 Fix exception when sending large messages
...
Merge pull request !32006 from chengang/fix_send_large_msg
2022-03-28 06:27:19 +00:00
Parallels
f660b3c58e
Add cluster topology state
2022-03-28 05:48:11 +00:00
i-robot
8add6a926c
!31925 modify interface of shard function
...
Merge pull request !31925 from wangjun/interface_modify_0325
2022-03-28 02:24:16 +00:00
i-robot
a15f682803
!31603 support str.format
...
Merge pull request !31603 from lianliguang/supported-str-format
2022-03-28 01:33:58 +00:00
lilei
452362332e
xmodify stridedslice for master
2022-03-28 09:24:13 +08:00
Parallels
7bcf3f7b03
Fix exception when sending large messages
2022-03-27 02:10:40 +00:00
Carry955
3d182380d8
[feat][assistant][I4862P] add new aicpu operator Tril
2022-03-26 19:36:29 +08:00
i-robot
5a4e25db92
!26226 [assistant][ops] Add NonDeterministicInts
...
Merge pull request !26226 from yyxhgg/NonDeterministicInts
2022-03-26 09:42:53 +00:00
i-robot
d66af6eb86
!26399 [assistant][ops] Add CropAndResizeGradBoxes
...
Merge pull request !26399 from wangyantao/CropAndResizeGradBoxes
2022-03-26 09:41:38 +00:00
lianliguang
6304c2ef3d
support str.format
2022-03-26 14:53:06 +08:00
lianliguang
424c16d116
fix abs(tensor) when tesnor data is a 0-dim number
2022-03-26 14:20:03 +08:00
wangjun
789539cbaa
modify interface name for shard
2022-03-26 09:10:14 +08:00
i-robot
3f18180606
!31742 [MD][Autotune] Change autotune config
...
Merge pull request !31742 from harshvardhangupta/at_config_suggestion
2022-03-25 21:00:12 +00:00
wangyantao
668cdb936f
[feat] [assistant] [I471CP] add new operator CropAndResizeGradBoxes
2022-03-26 00:30:34 +08:00
ZCX
0cb72a21f3
[feat] [assistant] [I471DI] add new Ascend operator NonDeterministicInts
2022-03-25 22:29:47 +08:00
harshvardhangupta
0086f4ee05
Modify the Dataset AutoTune config #31742
2022-03-25 09:03:38 -04:00
i-robot
40015e003f
!31896 fix isolated node error
...
Merge pull request !31896 from lianliguang/fix-isolated-node-bug
2022-03-25 11:52:21 +00:00
lizhenyu
812127940d
add ut for recovery
2022-03-25 16:58:08 +08:00
lianliguang
3d7c56c4cc
fix isolate node bug when using fallback interpret node
2022-03-25 16:41:16 +08:00
i-robot
c6da65ecfa
!31879 Training support recovery
...
Merge pull request !31879 from zyli2020/worker_failover_bp
2022-03-25 07:06:55 +00:00
i-robot
1af8b365de
!31880 [MD] add OBSMindDataset
...
Merge pull request !31880 from liyong126/add_obs_mindrecord_dataset
2022-03-25 06:50:58 +00:00
i-robot
8589335879
!31889 Add some ut pipeline test case for C++ dataset
...
Merge pull request !31889 from liucunwei/dataset-lcw-0324
2022-03-25 03:27:37 +00:00
i-robot
5f343bcd09
!31824 add waring suggestion log and map op error check
...
Merge pull request !31824 from guozhijian/fix_more_info
2022-03-25 03:24:07 +00:00
lizhenyu
275f81c47b
failover support exit process when timeout occur
2022-03-25 11:16:04 +08:00
liyong
419109cf0a
add OBSMindDataset
2022-03-25 11:03:53 +08:00
liucunwei
03b32e4eab
Add some ut pipeline test case for C++ dataset
2022-03-25 10:25:19 +08:00
i-robot
3c83d3f600
!31668 [Auto-Par] [D-Rec] Modify the name of the functions and variables of Parameter shared User strategy treatment & Add new treatment case of MatMul/Gather
...
Merge pull request !31668 from 王浩然/D-Rec-deliver
2022-03-25 02:11:41 +00:00
jonyguo
317c48a933
add log warning and op error check
2022-03-25 09:26:29 +08:00
7347157+joylvliang@user.noreply.gitee.com
39e70e8519
fix_bug_of_tensor_copy_canot_work_in_int64_with_D
2022-03-24 20:47:42 +08:00
i-robot
0b6f330d7e
!31666 [MS][LITE] new core ops api and lite adapter new api
...
Merge pull request !31666 from luoyuan/core2
2022-03-24 08:50:40 +00:00
i-robot
aca631a73b
!31714 [MD] fix GIL acquire issue
...
Merge pull request !31714 from liyong126/fix_two_level_pipeline
2022-03-24 08:31:12 +00:00
yanghaoran
e0ee78497d
takedown tcptest.sendlargemessages to ensure gate stability
2022-03-24 15:20:30 +08:00
haoran.wang
fdfbe2dedc
Modify the name of the funtions and variables of Parameter shared User strategy treatment
2022-03-24 06:16:55 +00:00
luoyuan
3e6e4bfdd3
add core ops api and adapter new mindapi
2022-03-24 10:47:40 +08:00
liyong
bff0fbc76f
fix GIL acquire Misuse
2022-03-24 08:45:30 +08:00
i-robot
4110c3d6d6
!31646 [dataset][bugfix] 修复config 接口typeerror的问题
...
Merge pull request !31646 from xiefangqi/md_fix_interface_comments
2022-03-23 02:45:01 +00:00
i-robot
f59de99ed6
!31654 [ME]Support raise in constant scenarios.
...
Merge pull request !31654 from Margaret_wangrui/raise
2022-03-23 02:28:37 +00:00
Margaret_wangrui
6d10adc9c5
[ME]Support raise in constant scenarios.
2022-03-22 19:33:47 +08:00
i-robot
a50223f93a
!31680 [assistant][Fade] bug fix
...
Merge pull request !31680 from QingfengLi/fade
2022-03-22 07:58:59 +00:00
i-robot
ecde0749bb
!31576 dynamic shape CPU & GPU
...
Merge pull request !31576 from TuDouNi/dynamic_shape_gpu_cpu_2
2022-03-22 07:27:50 +00:00
i-robot
32ecb8cbea
!31664 fix grpc not found in gpu serving
...
Merge pull request !31664 from zhoufeng/xiu-ba-ge
2022-03-22 06:10:24 +00:00
i-robot
200a77c3d1
!31393 fix csr/coo docs and checks
...
Merge pull request !31393 from wangrao124/pr_0316
2022-03-22 05:15:38 +00:00
ttudu
adb18b82ca
dynamic shape CPU & GPU
2022-03-22 13:03:31 +08:00
chenx2ovo
c50a4611ec
[fix][assistant][I3CEGF] fix can not compute bool tensor
2022-03-22 10:20:01 +08:00
xiefangqi
f81c7967ec
fix config interface comments
2022-03-22 10:04:12 +08:00
i-robot
3b24d825bf
!31578 Fix exception for sending big message using TCP client.
...
Merge pull request !31578 from chengang/fix_rpc_big_msg
2022-03-22 01:27:29 +00:00
i-robot
9bfa6ebc58
!31611 code check clean
...
Merge pull request !31611 from yuchaojie/code-clean
2022-03-22 01:20:44 +00:00
zhoufeng
55f6d075e8
fix grpc not found in serving
...
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
2022-03-21 22:06:46 +08:00
i-robot
264bed91a4
!31594 Add ps execution mode implementation
...
Merge pull request !31594 from ZPaC/add-dist-execution-mode
2022-03-21 13:07:24 +00:00
i-robot
4bb2fbdb5f
!30695 Support third-party modules
...
Merge pull request !30695 from huangbingjian/import_module
2022-03-21 08:36:39 +00:00
yuchaojie
42818b17f6
code check clean
2022-03-21 16:32:22 +08:00
i-robot
4dd3a94d27
!31571 clean code
...
Merge pull request !31571 from kisnwang/clean_code
2022-03-21 08:31:38 +00:00
ZPaC
506989abb8
Add ps execution mode implementation
2022-03-21 14:35:25 +08:00
huangbingjian
a62484c5ad
Added recognition of third-party libraries
2022-03-21 14:10:23 +08:00
i-robot
91d9585df7
!31346 add vmap ut and st
...
Merge pull request !31346 from Erpim/vmap_v12
2022-03-21 03:49:08 +00:00
kswang
70b85a16cb
clean code
2022-03-21 09:54:06 +08:00
Erpim
0e7e1f8400
add vmap st and ut
2022-03-21 09:38:38 +08:00
i-robot
af1f1b346d
!31509 [MD] fix mindrecord commit issue
...
Merge pull request !31509 from liyong126/fix_mindrecord_issue
2022-03-20 11:16:31 +00:00
i-robot
048d089f9a
!31520 test ut
...
Merge pull request !31520 from gaoyong10/dynamic_shape_01
2022-03-20 04:24:23 +00:00
Parallels
d3bcc66d15
Fix sending large message using rpc.
2022-03-20 10:42:52 +08:00
gaoyong10
89c98f8b73
Add ut for dynamic shape and control flow.
2022-03-19 19:48:44 +08:00
zhoufeng
da11e0f150
change debug.a to object
...
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
2022-03-19 15:39:49 +08:00
liyong
00371d523f
fix mindrecord issue
2022-03-19 12:04:27 +08:00
i-robot
dc5f77f585
!31298 build libmindspore_backend.so
...
Merge pull request !31298 from zhoufeng/backend-frontend-decoupling
2022-03-18 09:41:51 +00:00
wangrao124
5e5415f617
fix csr/coo
2022-03-18 16:46:38 +08:00
i-robot
c0c5b8c143
!31480 Reduce UT CPP profiler tests time
...
Merge pull request !31480 from zetongzhao/reduce_profiler_test_time
2022-03-17 19:10:57 +00:00
zetongzhao
694ede05cf
change set_dataset
2022-03-17 13:13:01 -04:00
i-robot
d37ee6e729
!31173 Stream Assign Refactor
...
Merge pull request !31173 from jiaorui/refactor-stream
2022-03-17 11:39:11 +00:00
zhoufeng
b7bb53ff73
build libmindspore_backend.so
...
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
2022-03-17 19:14:45 +08:00
i-robot
2e516c735b
!28640 [assistant] [ops] Add the FillV2 operator
...
Merge pull request !28640 from 爱科学的小松鼠/OpFill
2022-03-17 09:48:14 +00:00
i-robot
c2a50f81df
!31398 fix code check for minddata
...
Merge pull request !31398 from guozhijian/fix_code_check
2022-03-17 06:49:43 +00:00
hu-daiwang
3b093696e7
[feat] [assistant] [I48O6L] add the fillV2 operator
2022-03-17 11:19:59 +08:00
jiaorui
91bd673538
refactor
2022-03-17 10:50:12 +08:00
i-robot
e6dadfd999
!31159 Add st and ut testcase for pass tuple_transform
...
Merge pull request !31159 from chenfei_mindspore/master-develop
2022-03-17 02:01:44 +00:00
i-robot
c3d15a079e
!31325 Training supports failover
...
Merge pull request !31325 from zyli2020/worker_failover_bp
2022-03-17 01:37:30 +00:00
jonyguo
11079eabbb
fix: code check
2022-03-17 08:23:12 +08:00
i-robot
8e1f6d6b9b
!31268 Move UT CPP Helper Functions
...
Merge pull request !31268 from zetongzhao/ut_cpp_helper
2022-03-16 18:20:24 +00:00
i-robot
1d6834ce4d
!31334 [MD][Autotune] Modify the generated autotune config to improve readability
...
Merge pull request !31334 from harshvardhangupta/modify_at_config
2022-03-16 18:18:57 +00:00
lizhenyu
545707b80c
Training supports failover
2022-03-16 23:34:29 +08:00
zetongzhao
ab5f79f522
move common helper function
2022-03-16 10:30:43 -04:00
chenfei
3872e29c3b
add ut for tuple trans
2022-03-16 21:33:54 +08:00
i-robot
88691d054e
!31321 Convert AbstractList to AbstractTuple recursively
...
Merge pull request !31321 from hewei/fix_opt_perf
2022-03-16 07:09:52 +00:00
i-robot
c2212f88b4
!31164 Fix the global norm missing insert allreduce
...
Merge pull request !31164 from huangxinjing/fx_global_norm_error
2022-03-16 06:46:13 +00:00
i-robot
72ea797e4d
!30339 [BUG][MD][FUNC]RandomInvert
...
Merge pull request !30339 from yangwm/randominvert
2022-03-16 06:39:53 +00:00
i-robot
8e8391dadb
!30429 BUG][MD][FUNC]RandomAutoContrast
...
Merge pull request !30429 from yangwm/autocontrast
2022-03-16 06:38:54 +00:00
He Wei
51c147ab12
Convert AbstractList to AbstractTuple recursively
2022-03-16 09:59:27 +08:00
i-robot
bf03f0e030
!31252 Implementation of element wise parallel ops
...
Merge pull request !31252 from liuluobin/element_wise_ops
2022-03-16 01:32:29 +00:00
i-robot
5f75c701d5
!31332 Fix bug in terminating a failed pipeline
...
Merge pull request !31332 from h.farahat/fix_reset
2022-03-15 20:17:05 +00:00
i-robot
272d25b291
!31137 Test SkipFirstEpochSampler
...
Merge pull request !31137 from zetongzhao/test_skip_first_epoch_sampler
2022-03-15 18:26:28 +00:00
harshvardhangupta
accc32da6c
modify the at config file
2022-03-15 13:49:00 -04:00
i-robot
12d8906e2e
!30253 Support more ops for dump flag
...
Merge pull request !30253 from huanghui/enhance-dump-flag
2022-03-15 16:49:11 +00:00
hesham
3d16a4dee2
Fix bug in terminating a failed pipeline
2022-03-15 11:35:28 -04:00
i-robot
059c939854
!31269 Fix ut debug mode compile
...
Merge pull request !31269 from qujianwei/master
2022-03-15 12:35:38 +00:00
liuluobin
f13d342986
Implementation of element wise parallel ops
2022-03-15 19:24:17 +08:00
i-robot
313a247ae8
!31178 upgrade Ascend package 11 Mar 22
...
Merge pull request !31178 from shenwei41/upgrade_ascend_20220311
2022-03-15 11:08:32 +00:00
i-robot
3be552ca39
!31156 test centos ut
...
Merge pull request !31156 from luoyang/test_centos
2022-03-15 09:38:15 +00:00
shenwei41
f1fde6a262
upgrade Ascend package 11 Mar 22
2022-03-15 16:31:27 +08:00
huangxinjing
31f55b6525
1. The main gol: Fix mixing inserting the AllReduce when where is no mirror appeared
...
2. remove pattern match error as the origin pattern match will find no operator if there is only one parameter
2022-03-15 15:54:06 +08:00
qujianwei
d5eb5090d9
fix ut compile in debug mode
2022-03-15 11:04:37 +08:00
i-robot
01078763df
!31227 don't handle reshape when using adafactor optimizer in auto parallel
...
Merge pull request !31227 from yangzhenzhang/adafactor-parallel-skip-handle-reshape
2022-03-15 02:47:31 +00:00
huanghui
4482acf586
Support API set_dump for more type of ops
2022-03-15 10:24:15 +08:00
i-robot
d4fc47ef0c
!31211 Clean iterators before forking
...
Merge pull request !31211 from h.farahat/pytest_problem
2022-03-15 01:47:28 +00:00
i-robot
4b15a46a49
!30858 Refactor cpu kernel factory.
...
Merge pull request !30858 from TronZhang/refactor_cpu_factory
2022-03-15 01:33:14 +00:00
yangzhenzhang
1f98ffb79c
adafactor parallel skip handle reshape
2022-03-15 09:18:19 +08:00
i-robot
6be7c42052
!31191 [MD] more tests for failover reset
...
Merge pull request !31191 from Mohammad Motallebi/failover_more_tests
2022-03-14 21:50:37 +00:00
zetongzhao
9d4c8e29f9
add test for skip first epoch sampler
2022-03-14 15:01:08 -04:00
h.farahat
318e6b8803
fix segfault
2022-03-14 14:55:21 -04:00
mohammad
583eb32723
[MD] Add more UT and a ST for failover reset
2022-03-14 13:52:09 -04:00
luoyang
74d8969b91
ut failed with pillow>=9.0.0, need more validation
2022-03-14 23:58:09 +08:00
i-robot
a33611ce27
!31192 Support sync and async message sending mode for TCP client
...
Merge pull request !31192 from chengang/add_send_sync
2022-03-14 07:55:49 +00:00
i-robot
dcb5cd670c
!30953 Dynamic Weight Decay
...
Merge pull request !30953 from wanyiming/dynamic_wd
2022-03-14 06:03:24 +00:00
i-robot
c2c02b094a
!31204 modify virtualdataset bug for master
...
Merge pull request !31204 from lilei/modify_virtualdataset_for_master
2022-03-14 03:33:25 +00:00
TronZhang
db2931516e
refactor cpu factory
2022-03-14 10:05:56 +08:00
i-robot
c62112ba89
!30938 mindspore support compile with gcc-9 and gcc-8
...
Merge pull request !30938 from xulei/ms_gcc-9
2022-03-14 01:12:25 +00:00
lilei
690c58ebcf
modify virtualdataset bug for master
2022-03-13 10:49:15 +08:00
Parallels
6f31f270e9
support sync and async message sending mode for tcp client
2022-03-12 21:14:35 +08:00
i-robot
8b7d7a1e1a
!31141 [MD][Offload] Offload RandomSharpness Op Pure Sharpness Support & UT
...
Merge pull request !31141 from alashkari/fix-random-sharpness
2022-03-11 21:06:13 +00:00
Rescue
face5e51a0
Added sharpness without randomness support and sharpnes op ut.
...
fix pad issue
2022-03-11 17:30:49 +00:00
i-robot
603ff25afb
!31177 Add ut for fallback python buildin function with tensor input
...
Merge pull request !31177 from LiangZhibo/syntax
2022-03-11 08:38:53 +00:00
i-robot
3c0e34ada0
!30855 Support user-defined classes by ms_class decorators
...
Merge pull request !30855 from huangbingjian/ms_class_dev
2022-03-11 08:37:12 +00:00
xulei
bb10249d49
support gcc-8 and gcc-9
2022-03-11 16:12:41 +08:00
liangzhibo
a3f183bdad
Add test case for fallback buildin function with constant tensor
2022-03-11 15:35:29 +08:00
i-robot
f3b753807b
!30929 Add AkgKernel for lite_adapter
...
Merge pull request !30929 from DeshiChen/0307_akgkernel
2022-03-11 07:01:10 +00:00
i-robot
3375003da9
!31158 Add registration and heartbeat message from compute graph node to meta server node
...
Merge pull request !31158 from chengang/add_pb_to_node
2022-03-11 06:55:03 +00:00
i-robot
c2a5cc1486
!31040 Produce parallel operators for Argmin/max, SquareSumAll and UnsortedSegmentProd
...
Merge pull request !31040 from Bert0108/reduce_operators_arg
2022-03-11 06:15:56 +00:00
shenyu
443da2280d
[FIX][BUG][MD][FUNC] RandomInvert
2022-03-11 11:53:04 +08:00
Parallels
5fe6d42fe1
Add registration and heartbeat message from compute graph node to meta server node
2022-03-11 11:42:08 +08:00
shenyu
c14a8084c7
[FIX][BUG][MD][FUNC] RandomAuoContrast
2022-03-11 10:29:49 +08:00
i-robot
a064d0855b
!30728 add ci test for unify backend
...
Merge pull request !30728 from xiaoyao/master
2022-03-11 01:50:03 +00:00
i-robot
eecf6db426
!31121 [MD] revert OBSMindDataset
...
Merge pull request !31121 from liyong126/revert_OBSMindDataset
2022-03-11 01:21:10 +00:00
i-robot
216e7c6a92
!31041 add check for conv2d
...
Merge pull request !31041 from yangzhenzhang/add-check-for-conv2d
2022-03-11 01:20:52 +00:00
i-robot
67d10ce3be
!31066 Fix a bug where ROIAlign and CropAndResize distributed op do not support GPU
...
Merge pull request !31066 from liuluobin/fix_roialign
2022-03-11 01:16:46 +00:00
huangbingjian
4f7c9cb27f
Support user-defined classes through ms_class decorators.
2022-03-10 20:12:54 +08:00
i-robot
99910ef27f
!31088 convert C++ data to python when run fallback function
...
Merge pull request !31088 from lianliguang/add-fallback-builtin-function
2022-03-10 11:56:01 +00:00
yangzhenzhang
c00d29f223
rebase
2022-03-10 19:17:06 +08:00
liyong
50fc6f5736
Revert "!30810 [MD] add OBSMindDataset "
...
This reverts commit 9fcab9184e
, reversing
changes made to afb33fff62
.
2022-03-10 19:10:11 +08:00
dayschan
601fb25cb7
Add AkgKernel for lite_adapter
...
Implements a temporary solution of InferShape for GraphKernel ops by
saving shape/type/format in custom attributes and decode them in InferShape function.
2022-03-10 18:19:28 +08:00
liuluobin
8f045d02e3
Fix a bug where ROIAlign and CropAndResize distributed op do not support GPU
2022-03-10 17:31:36 +08:00
lianliguang
d9390afad1
convert MS C++ dtype to MS python
2022-03-10 16:09:34 +08:00
xiao_yao1994
94df0869ad
uni_backend: add ut ge test
2022-03-10 16:08:36 +08:00
i-robot
872cb74d3f
!30982 Vocab C++ Interface Alignment and SentencePieceVocab C++ interface alignment and Python interface refactoring
...
Merge pull request !30982 from 刘勇琪/master-vocab-sentencepiecevocab
2022-03-10 07:58:41 +00:00
i-robot
9fcab9184e
!30810 [MD] add OBSMindDataset
...
Merge pull request !30810 from liyong126/add_obs_mindrecord_dataset
2022-03-10 07:28:46 +00:00
Bert0108
bfc5e4345c
add distributed operators for argmax/min sqauresumall and unsortedsetmentprod
2022-03-10 14:44:37 +08:00
liu-yongqi-63
457ee03082
Vocab C++ Interface Alignment and SentencePieceVocab C++ interface alignment and Python interface refactoring
2022-03-10 11:33:00 +08:00
i-robot
662a7f8408
!31044 add cpp ut for combine_like_graphs
...
Merge pull request !31044 from huanghui/add-ut
2022-03-10 03:28:56 +00:00
liyong
10ada68fa9
add OBSMindDataset
2022-03-10 11:18:59 +08:00
i-robot
059ac45ae8
!30920 Call Draw function directly
...
Merge pull request !30920 from huanghui/fix-draw
2022-03-10 03:15:56 +00:00
wanyiming
a124ec4de7
add dynamic_decay
2022-03-10 11:02:27 +08:00
huanghui
1077601e26
add cpp ut for jit_config
2022-03-10 10:56:18 +08:00
i-robot
b32daf3cb4
!31000 add fallback example of function
...
Merge pull request !31000 from lianliguang/add-fallback-builtin-function
2022-03-10 01:57:37 +00:00
i-robot
3f1a7f45bb
!30942 Add hwc_to_chw operation on lite
...
Merge pull request !30942 from shenwei41/hwc_to_chw
2022-03-10 01:21:54 +00:00
i-robot
f57f862d7b
!30782 [MD] pushdown skip node in MD pipeline
...
Merge pull request !30782 from Mohammad Motallebi/failover_pushdown_skipop
2022-03-10 01:12:46 +00:00
mohammad
260cebf650
[MD] skip node pushdown optimization pass for Reset
2022-03-09 09:52:25 -05:00
shenwei41
7f7bae8bfe
add hwc_to_chw operation on lite
2022-03-09 15:43:13 +08:00
i-robot
698b99aaa3
!18945 [assistant][ops]New operator implementation, include OmniglotDataset
...
Merge pull request !18945 from Wangsong95/omniglot_dataset
2022-03-09 06:51:15 +00:00
lianliguang
f3b1e3291c
add jit fallback example
2022-03-09 11:25:03 +08:00
i-robot
cfda27a6c7
!30985 Add TCP communication module into compute graph nodes and meta server node
...
Merge pull request !30985 from chengang/topology_2
2022-03-09 01:51:23 +00:00
Parallels
e2125108b8
Add tcp communication into compute graph nodes and meta server node
2022-03-09 00:02:25 +08:00
qkeys
ce7597f500
[feat][assistant][I3J6VB] add new data operator OmniglotDataset
2022-03-08 21:36:22 +08:00
i-robot
0b207fce31
!29735 unique support multi_dim tensor
...
Merge pull request !29735 from jiangzhenguang/unique_multi_tensor
2022-03-08 08:30:57 +00:00
i-robot
63d7254bd3
!30952 add encryption to lite
...
Merge pull request !30952 from yeyunpeng2020/cryption
2022-03-08 07:30:41 +00:00
i-robot
f73987dcf2
!30949 Change the Parameter type name from 'ParameterTensor' to 'Parameter'
...
Merge pull request !30949 from 张清华/opt
2022-03-08 06:17:31 +00:00
jiangzhenguang
28f89f36b4
unique support multi-dim tensor
2022-03-08 14:09:19 +08:00
Zhang Qinghua
973008ebcf
Change the Parameter type name from 'ParameterTensor' to 'Parameter'
2022-03-08 11:23:56 +08:00
i-robot
82bc2b722d
!30417 add history and lambda callbacks
...
Merge pull request !30417 from liutongtong9/add_callbacks
2022-03-08 02:17:46 +00:00
i-robot
75e1939f50
!30899 Enable python buildin function type for graph fallback and add some ut for python buildin function for graph fallback
...
Merge pull request !30899 from LiangZhibo/syntax
2022-03-08 01:35:47 +00:00
i-robot
64544b142a
!30340 develop maxpool3d, avgpool3d and conv3d on CPU(including farward and backward)
...
Merge pull request !30340 from zhangbuxue/develop_conv3d_maxpool3d_and_avgpool3d_on_cpu
2022-03-08 01:31:54 +00:00
yeyunpeng2020
f670a635f0
add encryption to lite
2022-03-08 09:15:10 +08:00
i-robot
2655d64720
!30865 [MD] Modify Batch number of workers with Python multiprocessing
...
Merge pull request !30865 from cathwong/ckw_batch_workers_pymultip
2022-03-07 18:21:16 +00:00
Cathy Wong
bc29dd9d72
[MD] Modify Batch number of workers with Python multiprocessing
2022-03-07 10:39:05 -05:00
i-robot
335ef1c270
!30459 Add ut validate function for parallel
...
Merge pull request !30459 from liuluobin/ut_master
2022-03-07 15:23:14 +00:00
buxue
50f3ad00ab
develop maxpool3d, avgpool3d and conv3d on CPU(including forward and backward)
2022-03-07 20:44:38 +08:00
i-robot
4a9a65d2a6
!30919 Produce parallel operators for ReduceAll and ReduceProd
...
Merge pull request !30919 from Bert0108/reduce_operators_merge
2022-03-07 11:30:37 +00:00
liuluobin
b797a410cc
Add validate function for parallel ut
2022-03-07 16:19:52 +08:00
huanghui
ce0f500a3f
convert draw_ to call Draw function derectly
2022-03-07 16:12:52 +08:00
Bert0108
dfc92f1791
add distributed parallel operators for reduceall and reduceprod
2022-03-07 15:40:11 +08:00
liangzhibo
1a81031d92
Add testcases for fallback python buildin function
2022-03-07 14:20:24 +08:00
lianliguang
55e7f2d761
modify DynamicShape to TensorShape
2022-03-07 14:19:12 +08:00
i-robot
960d551b8f
!30889 parse core type from pre-build for tbe kernel compile
...
Merge pull request !30889 from liubuyu/master
2022-03-07 03:53:48 +00:00
i-robot
a2303a92bd
!30741 [dataset][dfx] 易用性-Python多进程统一整改专项
...
Merge pull request !30741 from xiefangqi/md_add_multiprocessing_optimization
2022-03-07 01:29:21 +00:00
i-robot
c4cc0e6f26
!30742 auto parallel support not ony power of 2 device numbers
...
Merge pull request !30742 from yao_yf/auto_parallel_support_not_only_2_power
2022-03-07 01:09:01 +00:00
yao_yf
b60e54e0d5
support not only power of 2
2022-03-06 18:08:46 +08:00
lby
2b1b539b36
parse core type from pre-build for tbe kernel compile
2022-03-06 15:23:50 +08:00
xiefangqi
2de6c8908b
add a clean up subprocess to clean subprocess when main process exit
2022-03-05 17:24:11 +08:00
liutongtong
7afcdfd211
add history and lambda callbacks
2022-03-04 17:24:57 +08:00
i-robot
fdf7aebd78
!30746 Fix recv actor stuck issue.
...
Merge pull request !30746 from ZPaC/sync-route-table
2022-03-04 03:01:03 +00:00
i-robot
b90cf43562
!30553 Support dataset reset() to recover after failure
...
Merge pull request !30553 from h.farahat/reset
2022-03-04 02:19:45 +00:00
i-robot
f01d841113
!27943 [MD][Autotune] Save/Load Autotune config
...
Merge pull request !27943 from harshvardhangupta/save_load_at_config
2022-03-03 20:42:23 +00:00
i-robot
7b26a32e98
!22608 [assistant][ops]New operator implementation, include KITTIDataset
...
Merge pull request !22608 from Wangsong95/kitti_dataset
2022-03-03 07:05:40 +00:00
h.farahat
a3dec34833
Dataset failover reset
2022-03-02 23:53:56 -05:00
i-robot
1eb2975ac2
!30753 Support random port for TCP server
...
Merge pull request !30753 from chengang/support_random_port
2022-03-03 02:08:28 +00:00
harshvardhangupta
bd61adbb72
Implement save/load of autotune dataset pipeline configuration #27943
2022-03-02 15:08:08 -05:00
i-robot
70e61e9014
!30781 [MD] test_pyfunc_multiprocess_autotune.py - enable AutoTune
...
Merge pull request !30781 from cathwong/ckw_ut_fixup_map_python_multiproc
2022-03-02 19:50:29 +00:00
i-robot
edcc6b790d
!30714 [MD][Offload] Add TypeCast op to offload
...
Merge pull request !30714 from markuskunej/offload_typecast
2022-03-02 19:48:58 +00:00
Cathy Wong
eb931c7f88
[MD] test_pyfunc_multiprocess_autotune.py - enable AutoTune
2022-03-02 13:25:35 -05:00
i-robot
4aa82cc21e
!30679 [MD] Add tests for Python Multiprocessing with AutoTune
...
Merge pull request !30679 from cathwong/ckw_ut_map_python_multiproc
2022-03-02 14:26:09 +00:00
Parallels
5e09380eb3
add random port support for tcp server
2022-03-02 21:27:33 +08:00
zx
93617ce91e
[feat][assistant][I3J6VO] add new data operator KITTI
2022-03-02 19:29:17 +08:00
ZPaC
4c74b600ea
Fix recv actor stuck issue.
2022-03-02 19:26:45 +08:00
i-robot
6bc31df410
!30672 delete ut test libgraph and liberror_manager dependency
...
Merge pull request !30672 from qujianwei/master
2022-03-02 02:18:24 +00:00
Cathy Wong
b151f90b8d
[MD] Add tests for Python Multiprocessing with AutoTune
2022-03-01 20:20:59 -05:00
markuskunej
6de37045ff
Added TypeCast op in dataset offload.
2022-03-01 21:55:27 +00:00
qujianwei
7e1aebc2c9
delete ut libgraph.so and liberror_manager.so dependency
2022-03-01 20:50:45 +08:00
wangjun
46612fabfb
add st for shard
2022-03-01 15:32:37 +08:00
i-robot
ad9757ccf0
!30661 [Auto parallel] [MoE] Fix an error of configuring MoE parallel
...
Merge pull request !30661 from Xiaoda/124-moe-changes
2022-03-01 02:26:37 +00:00
i-robot
4367377000
!30611 Fix rpc route bugs
...
Merge pull request !30611 from ZPaC/sync-route-table
2022-02-28 15:50:35 +00:00
ZPaC
dc80f9b8d1
Fix rpc route bugs
2022-02-28 19:18:42 +08:00
i-robot
789c1d6bd3
!30614 [AutoParallel] Fix Overflow As the cast is inserted before Mirror
...
Merge pull request !30614 from huangxinjing/fix_cast_error
2022-02-28 09:04:46 +00:00
Xiaoda Zhang
81e5abe580
fix an error of configuring parallel
2022-02-28 17:03:53 +08:00
i-robot
a92c54b206
!30496 [Fallback] Suppport scipy module.
...
Merge pull request !30496 from huangbingjian/support_scipy
2022-02-28 08:33:34 +00:00
i-robot
a8686ae3d9
!18827 [assistant][ops]New operator implementation, include LFWDataset
...
Merge pull request !18827 from Wangsong95/lfw_dataset
2022-02-28 08:31:11 +00:00
zx
2132f62d98
[feat][assistant][I3J6VQ] add new data operator LFW
2022-02-28 13:43:05 +08:00
huangxinjing
896daee845
[AutoParallel]Fix insert error for the mirror
2022-02-26 17:54:42 +08:00
i-robot
330cacf905
!30573 Catch abnormals from sto functions in debug
...
Merge pull request !30573 from maning202007/master
2022-02-26 08:52:22 +00:00
i-robot
de95400c52
!30587 check platform for resize bilinear
...
Merge pull request !30587 from yangzhenzhang/check-platform-for-resize-bilinear
2022-02-26 06:46:44 +00:00
i-robot
909183a2ef
!30317 extract common as an independent shared library
...
Merge pull request !30317 from zhoufeng/backend-compile-independently
2022-02-26 06:08:17 +00:00
i-robot
17accc12cf
!30238 [GraphKernel] replace nodes in lite and move const_input_to_attr pass.
...
Merge pull request !30238 from chenlei_autodiff/lite_callback
2022-02-26 03:56:57 +00:00
maning202007
9328041de4
Catch abnormals from sto functions in debug
...
Fix type errors
Fix bugs
Fix cycle depth
2022-02-26 11:03:52 +08:00
zhoufeng
f49b195c39
extract common as an independent shared library
...
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
2022-02-26 10:55:51 +08:00
yangzhenzhang
43e6e16da3
check platform for resizebilinear
2022-02-26 09:11:32 +08:00
i-robot
5deccfe64b
!30260 [MD][AutoTune] Re-enable AT for non-sink models
...
Merge pull request !30260 from danishfarid/re-enable_AT
2022-02-25 22:26:14 +00:00
i-robot
c4843c4085
!29958 upgrade ascend 20220211
...
Merge pull request !29958 from shenwei41/upgrade_ascend_20220211
2022-02-25 14:55:12 +00:00
shenwei41
ff75314719
upgrade ascend 20220211
2022-02-25 21:19:45 +08:00
chenlei_autodiff
9fabf8ae0d
[GraphKernel] replace graph kernel node with custom in lite.
2022-02-25 17:09:24 +08:00
i-robot
0341d96dd6
!30469 add shard function to support part of the graph executed in auto_parallel under pynative mode
...
Merge pull request !30469 from wangjun/0223_pp
2022-02-25 06:52:24 +00:00
huangbingjian
a69d13bc44
[Fallback] Suppport scipy module.
2022-02-25 10:13:43 +08:00
i-robot
cfe0f76d2b
!30491 ut for allgather fusion
...
Merge pull request !30491 from jiahongQian/master
2022-02-25 01:30:24 +00:00
danishfarid
6c4697fc8b
reenable AT for non-sink
...
tests re-enable
code check fix 1
remove self
lint fixing
lint fix 2
remove test as per req
2022-02-25 00:02:58 +00:00
wangjun
24d448239c
add pynative_parallel
2022-02-24 21:07:51 +08:00
i-robot
ac1463a192
!30298 Package the cuda operators as a dynamic link library
...
Merge pull request !30298 from jinjiali-kali/cuda_ops
2022-02-24 11:36:20 +00:00
i-robot
981eae461a
!30118 自动优化器并行特性
...
Merge pull request !30118 from zhuyuxiao/I4S85V
2022-02-24 11:09:44 +00:00
jiahongQian
25f57505bf
ut for allgather fusion
2022-02-24 17:40:32 +08:00
i-robot
bbcfbce9e0
!29997 [Auto parallel] [MoE] Support data_parallel + expert_parallel
...
Merge pull request !29997 from Xiaoda/124-moe-changes
2022-02-24 09:23:47 +00:00
zhuyuxiao
d0e0e305d3
good
2022-02-24 16:10:18 +08:00
i-robot
f2130e7434
!30483 [AutoParallel]Pipeline Automatic detection Opt
...
Merge pull request !30483 from lichen/pipeline_opt_detection
2022-02-24 03:40:15 +00:00
yao_yf
e21f878e14
adasum ut fix
2022-02-24 09:31:04 +08:00
Xiaoda Zhang
b714451937
implementing expert_parallel+data_parallel in MoE:
...
1) extending _Linear's input as 4-dimension tensor: [outer_batch, expert_dim, -1, hidden], and _Liner's BatchMatMul becomes BatchMatMul(4_dim_tensor, 3_dim_tensor);
2) configuring the _Linear's BatchMatMul sharding strategy as [[dp, ep, 1, 1], [ep, 1, mp]];
3) introducing a new parameter 'expert_parallel' in TransformerOpParallelConfig, creating a new class MoEParallelConfig to include 'data_parallel', 'model_parallel' and 'expert_parallel';
4) changing parallel config for FeedForward, TransformerEncoderLayer, TransformerDecoderLayer.
2022-02-24 09:24:27 +08:00
wangshengnan12@huawei.com
acbefd80ea
pipeline_opt_detection
2022-02-24 09:23:17 +08:00
i-robot
2a00ffd3b1
!30259 [MD]Update set_autotune_enable API to add save filepath
...
Merge pull request !30259 from cathwong/ckw_at_save_api
2022-02-24 01:00:59 +00:00
Cathy Wong
46e223e569
[MD] Update set_autotune_enable API to add save filepath
2022-02-23 14:50:55 -05:00
i-robot
81260a2319
!30466 takedown test_auto_parallel_adasum.py to ensure stability, again
...
Merge pull request !30466 from yanghaoran/master
2022-02-23 09:15:58 +00:00
i-robot
14393503b7
!30431 allreduce allgather fusion
...
Merge pull request !30431 from jiahongQian/master
2022-02-23 08:52:40 +00:00
yanghaoran
71d6b7d506
takedown test_auto_parallel_adasum.py to ensure stability, again
2022-02-23 16:42:13 +08:00
i-robot
2e8eac8341
!30367 auto_parallel_adasum_support_data_parallel
...
Merge pull request !30367 from yao_yf/auto_parallel_adasum_support_data_parallel
2022-02-23 06:50:26 +00:00
jiahongQian
8a2151d8bb
allgather reducescatter fusion
2022-02-23 10:47:30 +08:00