Commit Graph

43067 Commits

Author SHA1 Message Date
yanghaoran 09da09f216 fix gpu docker environment paths 2022-04-27 19:03:18 +08:00
i-robot 71d55684e1
!33615 fix log_matrix_determinant accuracy for gpu backend.
Merge pull request !33615 from zhuzhongrui/pub_master2
2022-04-27 05:55:57 +00:00
i-robot 47e91e5dbc
!33621 modify format
Merge pull request !33621 from 俞涵/code_docs_04271
2022-04-27 04:13:09 +00:00
i-robot 41b9363230
!33614 add win ST testcases back
Merge pull request !33614 from yanghaoran/master
2022-04-27 03:22:56 +00:00
i-robot 0966af7d3b
!33384 vmap unpack graph
Merge pull request !33384 from Erpim/vmap_v15
2022-04-27 03:11:26 +00:00
huodagu aeb67af089 modify format2function 2022-04-27 11:10:16 +08:00
i-robot b7cebc1ace
!33514 [MSLITE][GPU] fix gpu 16bit quant bug
Merge pull request !33514 from Greatpan/fix_gpu_16bit_quant_bug
2022-04-27 03:04:06 +00:00
i-robot 42168e9dc9
!33582 [lite]fix thread-distribution question
Merge pull request !33582 from 徐安越/master1
2022-04-27 03:01:43 +00:00
i-robot ed4102c960
!33450 gpu collective ops data generalize
Merge pull request !33450 from chenweifeng/gpu-collective-data-generalization
2022-04-27 02:32:37 +00:00
z00512249 653efcadf6 fix log_matrix_determinant accuracy for gpu backend. 2022-04-27 10:27:43 +08:00
i-robot f49dd1f0cd
!33555 Construct distributed graph for embeddinglooup on servers.
Merge pull request !33555 from ZPaC/server-embedding-table-slice
2022-04-27 02:18:10 +00:00
yanghaoran 7eca0916dd add win ST testcases back 2022-04-27 10:17:13 +08:00
i-robot 712be60c30
!33586 GraphKernel Fix softsign test case bug
Merge pull request !33586 from ZengZitao/windows_continue
2022-04-27 02:04:36 +00:00
i-robot 885db5d39a
!33564 GraphKernel Add CheckInputs in some expand dsl and Close expandfallback on Ascend
Merge pull request !33564 from ZengZitao/type_fix
2022-04-27 02:04:36 +00:00
i-robot 42141e991b
!33524 LockRuntime to stream level
Merge pull request !33524 from TuDouNi/stream_lock
2022-04-27 01:36:43 +00:00
i-robot 2b13573044
!33239 Adjust the import specification of initializer, context and train
Merge pull request !33239 from 冯一航/adjust_import_spec_replenish
2022-04-27 00:47:22 +00:00
i-robot 6c9590d13d
!32632 add new Ascend operator TruncatedNormal
Merge pull request !32632 from yyxhgg/TruncatedNormal
2022-04-26 13:04:03 +00:00
i-robot e48b8835cf
!33554 [MSLITE][DEVELOP] fix bug of layernorm
Merge pull request !33554 from yangruoqi713/layernorm
2022-04-26 13:02:13 +00:00
i-robot 2591147c31
!33536 [MSLITE][GPU] Opencl reuses opengl texture features, supported by default, without macro separation
Merge pull request !33536 from Greatpan/gl_define_opencl
2022-04-26 12:48:40 +00:00
i-robot deabcbe420
!33558 runtime performance optimizer-eliminate the string find when output data is to stack actor
Merge pull request !33558 from limingqi107/bug_fix4
2022-04-26 11:36:09 +00:00
i-robot 640f06518f
!33157 flatten concat fission
Merge pull request !33157 from kisnwang/split-flatten-concat
2022-04-26 11:22:15 +00:00
i-robot 2854745ec4
!33542 Support to split forward feed kernel.
Merge pull request !33542 from ZPaC/slice-server-embedding-table
2022-04-26 11:22:12 +00:00
i-robot c926660451
!32001 atomic_clean_real_shape
Merge pull request !32001 from zhupuxu/atomic_clean_real_shape
2022-04-26 11:13:03 +00:00
zengzitao f6969e02ae don't run test case on windows 2022-04-26 18:23:05 +08:00
Erpim 04fe405289 vmap unpack graph 2022-04-26 18:20:41 +08:00
zengzitao 751a9e0094 add checkinput in expand dsl and close expandfallback on ascend 2022-04-26 18:09:17 +08:00
i-robot 8ce6015ae6
!33324 [MD] Speed up SentencePiece Tokenizer in Eager mode
Merge pull request !33324 from Mohammad Motallebi/sentencepiece_eager_optimization
2022-04-26 09:56:18 +00:00
i-robot df72761ad9
!33562 modify release
Merge pull request !33562 from xumengjuan1/master
2022-04-26 09:36:40 +00:00
ttudu af7aa0f41c LockRuntime with stream 2022-04-26 17:21:45 +08:00
i-robot 0be8cf4271
!33583 takedown windows cpu testcases and add windows ST back to gate level0
Merge pull request !33583 from yanghaoran/master
2022-04-26 09:21:27 +00:00
yanghaoran 75d7cd8fce takedown windows cpu testcases and add windows ST back to gate level0 2022-04-26 17:16:57 +08:00
i-robot 88311d6095
!33434 Add C++ primitive and infer func for TensorScatterAdd/Sub/Max/Min
Merge pull request !33434 from liangzelang/dev_tensorscatterop_cpu
2022-04-26 08:56:26 +00:00
i-robot fcb0319747
!33270 [assistant][InverseMelScale]
Merge pull request !33270 from chenchen/InverseMelScale
2022-04-26 08:55:58 +00:00
xuanyue b096e53779 fix thread-distribution question 2022-04-26 16:41:07 +08:00
i-robot 359cf60144
!33577 fix docker scripts
Merge pull request !33577 from yanghaoran/master
2022-04-26 08:27:31 +00:00
greatpanc b8a298f4ff fix gpu 16bit quant bug 2022-04-26 16:19:38 +08:00
yanghaoran 411086f538 fix docker scripts 2022-04-26 16:19:17 +08:00
i-robot 69d5bd0320
!33563 add log matrix_determinant and matrix_determinant kernel for gpu backend.
Merge pull request !33563 from zhuzhongrui/pub_master
2022-04-26 08:17:02 +00:00
i-robot f75c8a195e
!33323 add Power Sign cpu kernel
Merge pull request !33323 from chujinjin/add_power_sign_kernel_for_cpu
2022-04-26 08:11:34 +00:00
ZPaC 0477f6a8f8 Construct distributed graph for embeddinglooup on servers. 2022-04-26 15:49:51 +08:00
kswang e4ef608d20 add flatten concat fission 2022-04-26 15:40:36 +08:00
i-robot 851a1fc9e5
!33539 Update check tensor logic
Merge pull request !33539 from zichun_ye/graph_mode_check_tensor
2022-04-26 06:50:55 +00:00
ZPaC 8248e21ab1 Support to split forward feed kernel. 2022-04-26 14:22:55 +08:00
i-robot c1707043f9
!33534 add dropout2d kernel for gpu backend.
Merge pull request !33534 from zhuzhongrui/pub_master3
2022-04-26 06:21:01 +00:00
i-robot 11ca560a98
!33547 Add rank id for node base
Merge pull request !33547 from chengang/replace_rank_id_api
2022-04-26 06:04:35 +00:00
i-robot ddb057d1ee
!33532 [assistant][ops] Add FractionalAvgPool
Merge pull request !33532 from 徐喻琳/FractionalAvgPool
2022-04-26 04:01:47 +00:00
z00512249 17e20ffde2 add log matrix_determinant and matrix_determinant kernel for gpu backend. 2022-04-26 11:51:18 +08:00
i-robot 4d72b77f43
!33537 refactor expander api
Merge pull request !33537 from r1chardf1d0/e17
2022-04-26 03:42:41 +00:00
i-robot 29fad8ef0a
!33553 Fix bug for cuda10
Merge pull request !33553 from hezhenhao1/fix_bug
2022-04-26 03:28:38 +00:00
i-robot 76f372ac4b
!33531 Add none abstract for nullptr.
Merge pull request !33531 from gaoyong10/dynamic_shape_01
2022-04-26 03:11:00 +00:00