i-robot
f785fabcf2
!66157 add feature dvm
...
Merge pull request !66157 from looop5/feature_dvm_3_7
2024-03-08 12:07:00 +00:00
zhunaipan
e1b550ce8b
回退 'Pull Request !65980 : pijit one stage'
2024-03-08 02:38:04 +00:00
looop5
eb7b129656
add feature dvm
...
fix conflict
2024-03-07 22:35:05 +08:00
i-robot
447e604edf
!65121 No graph Automatic differentiation
...
Merge pull request !65121 from luochao60/pyboost-dev
2024-03-07 13:58:18 +00:00
i-robot
f97bc69373
!65980 pijit one stage
...
Merge pull request !65980 from r1chardf1d0/r2.3
2024-03-07 13:20:02 +00:00
luochao
21a422c2fe
fix bug
2024-03-07 12:07:23 +08:00
r1chardf1d0
4cc6add445
clean code
2024-03-07 11:00:39 +08:00
dayschan
2530f311c8
AKG-MLIR: fuse dynamic-shape operators in graphkernel
...
* add SymbolEngine to frontend for dynamic shape nodes. save the symbolic_shape and symbolic_value to Abstract.
* add symbol_ops_impl to core/ops, which implement the infering symbolic shape and/or symbolic value functions.
* use "enable_dynamic_shape_fusion" to fuse dynamic-shape nodes in graphkernel.
* add dynamic_akg_cpu_kernel_mod and dynamic_akg_gpu_kernel_mod for akg-mlir kernels.
* use symbol_engine_jit to optimize the infershape of dynamic akg kernels.
* optimize the frontend graph with symbol engine, in the pass "symbol_engine_optimizer"
* fuse the shape calculation nodes to a KernelPacket node, to optimize the runtime pipeline.
2024-03-06 22:54:32 +08:00
huangbingjian
be7d16423a
Implicit type conversion by signatures.
2024-03-04 23:14:02 +08:00
lilinjie
e76bb29d23
host api adapt: ReLU
2024-03-01 16:47:49 +08:00
i-robot
27ef7b4fcd
!63956 refactor cell call func
...
Merge pull request !63956 from chujinjin/refactor_cell_call_func
2024-02-29 07:07:43 +00:00
i-robot
0837b1d299
!65168 Add testcases for dataset_sink_mode
...
Merge pull request !65168 from maning202007/r2.3
2024-02-29 03:50:11 +00:00
maning202007
eb90c6cb37
Add testcases for dataset_sink_mode
...
Add testcase for hccl dump
fix random failure for ge dump testcases
2024-02-27 19:26:51 +08:00
panzhihui
b6722754f2
CpuKernel use cann pakcage library and header files
2024-02-24 14:36:39 +08:00
chujinjin
cb72044510
refactor cell call func
2024-02-22 16:50:05 +08:00
panzhihui
78a5f2b6e7
Remove cust_aicpu flag in ir
2024-02-21 10:11:28 +08:00
panzhihui
887810fd39
Add op ge support
2024-02-19 15:29:31 +08:00
hejianheng001
0b0f24c7aa
Revert "回退 'Pull Request !65105 : merge pijit project'"
...
This reverts commit 9480bc3f7f
.
fix python 3.8 ci
fix include
2024-02-17 21:19:34 +08:00
yanghaoran
9480bc3f7f
回退 'Pull Request !65105 : merge pijit project'
2024-02-06 11:20:59 +00:00
i-robot
d299bcc422
!65103 kbk dump
...
Merge pull request !65103 from maning202007/feature-2.3-dump
2024-02-06 07:30:55 +00:00
maning202007
1a5e0da93b
json acl
2024-02-05 16:06:26 +08:00
yanghaoran
0cd72b78c3
upgrade ascend 20231101 r2.3
...
fix ms adapter bug
Revert "upgrade ascend 20231101 r2.3"
This reverts commit fc74ed6fc88bf3799dd26e59f6f2443b7f225976.
2024-02-05 11:04:19 +08:00
lianliguang
952c017b6c
add eval cnode function
...
add ut test for example
2024-02-05 11:04:19 +08:00
zhumingming
38e7654e8a
Add fft/ifft/idct/correlate kernel and func api
2024-01-26 17:16:03 +08:00
fengyixing
97c4a6e8b0
回退 'Pull Request !64600 : add idct/fft/ifft at cpu/aicpu backend & correlate at cpu/aicpu/gpu backend'
2024-01-26 03:47:09 +00:00
zhumingming
ffb8171623
add idct at cpu/aicpu backend
2024-01-25 10:23:28 +08:00
jjfeing
e34475c4d9
use cann inc api
2024-01-23 19:38:40 +08:00
zjun
07e001ae6d
Opt grad execute by pyboost
...
Signed-off-by: zjun <zhangjun0@huawei.com>
2024-01-17 11:14:50 +08:00
i-robot
8598d2c510
!62796 aclnn ops: silu/silu_grad and sigmoid/sigmoid_grad
...
Merge pull request !62796 from jiaxueyu/pyboost
2024-01-16 02:25:01 +00:00
jiaxueyu
34c7048859
pyboost ops: silu/silu_grad and sigmoid/sigmoid_grad
2024-01-15 21:00:24 +08:00
lizhenyu
915fc71879
[Refactor] Delete deprecated interface in KernelTensor
2024-01-13 19:15:03 +08:00
hw_hz
414d867e0c
1. softmax/softmax_backward pyboost
...
2. replace origin softmax grad with SoftmaxBackward.
3. Softmax aclnnKernelMod
4. unify testcase of softmax
5. Format::ND no need to transpose to device shape, modify class Kernel
6. GetValue() in CreateInputAddress
7. SoftmaxCPU register fp16 and fp64
8. Softmax cpu implementation with parallel
9. Tensor.softmax
del softmax_
2024-01-09 15:03:17 +08:00
shaojunsong
4d633c5483
sync python cal_tuple_slice_mask to C++
2024-01-05 16:22:17 +08:00
hanhuifeng2020
b9738861f2
fix some bugs of to_enum in arg_handler
2024-01-01 23:10:41 +08:00
lyfe667
ec3d9f0855
linear sum assignment op support all dtypes
2023-12-22 10:31:32 +08:00
i-robot
116bd7e29d
!62962 add large model inference test case
...
Merge pull request !62962 from zhangminli/llama2-2.3
2023-12-19 09:28:34 +00:00
i-robot
370abc06b2
!63201 optimize trace in Cell
...
Merge pull request !63201 from NaCN/optimize_trace
2023-12-19 06:29:15 +00:00
zhang-minli
019e879e82
add large model inference test case
2023-12-19 09:33:33 +08:00
huangchengnuo
64a01fe2c4
optimize trace in Cell
2023-12-15 09:57:30 +08:00
shaojunsong
dba0ec3fc6
optimize setitem with tuple contains slice
2023-12-14 15:01:35 +08:00
i-robot
d0406cb6dc
!63045 [LITE] sync r2.2.10 ascendc code
...
Merge pull request !63045 from mengyuanli/r2.3
2023-12-13 06:26:49 +00:00
mengyuanli
58a648eaba
add PromptKVCache and DecoderKVCache ascend op
...
inplace update for ge
fix bug of prompt kv cache
fix buf of kernel sync problem
prompt kvcache add multi batch index
prompt kvcache and decoder kvcache support multi data type
decoder kvcache jump -1 seq_len
prompt kvcache jump -1 batch index
fix bug of decoder_kv_cache not support dig batch size
prompt_kv_cache support valid_seq_len
prompt_kv_cache and decoder_kv_cache support BSD format
make prompt_kv_cache and decoder_kv_cache support BSD format
2023-12-13 10:04:58 +08:00
limingqi107
17742cd102
add runtime committer
2023-12-12 08:45:18 +08:00
liangcanli
7e698a5c9a
support override getattr
2023-12-08 09:15:57 +08:00
limingqi107
bb6ac254c7
add runtime committer
2023-11-21 10:39:47 +08:00
i-robot
8b75c2270c
!61642 Fix range problem and a in b problem
...
Merge pull request !61642 from LiangZhibo/range
2023-11-13 03:13:07 +00:00
i-robot
1da59bf782
!61455 Support starred_expression in graph mode.
...
Merge pull request !61455 from Margaret_wangrui/starred_expression_unpack_r2.3
2023-11-11 02:16:15 +00:00
liangzhibo
1484a1a608
Fix range with empty output
2023-11-10 23:10:17 +08:00
Margaret_wangrui
cacd854fed
Support starred_expression in graph mode.
2023-11-09 11:28:21 +08:00
ligan
2d7808f8ac
fix ut bug.
2023-11-08 22:35:08 +08:00