Alex Rozanski
713b65df3e
add back Makefile and quantize.cpp in correct locations
2023-03-16 02:30:13 +01:00
Alex Rozanski
e0d648c817
remove quantize.cpp
2023-03-16 01:30:35 +01:00
Alex Rozanski
058377cb3e
add SPM support
2023-03-16 01:17:44 +01:00
Alex Rozanski
5f4905a256
separate Swift and C++/Objective-C++ code into separate libraries
2023-03-16 00:24:12 +01:00
Alex Rozanski
b44026b8db
remove Makefile and CMakeLists.txt
...
will pull in any changes from llama.cpp and adjust the Xcode configuration
to match this.
2023-03-15 10:49:00 +01:00
Alex Rozanski
56c19351db
Merge remote-tracking branch 'llama.cpp/master'
2023-03-15 10:36:15 +01:00
Ronsor
47857e564c
Don't use vdotq_s32 if it's not available ( #139 )
...
* Don't use vdotq_s32 if it's not available
`dotprod` extensions aren't available on some ARM CPUs (e.g. Raspberry Pi 4), so check for them and only use them if they're available.
Reintroduces the code removed in 84d9015
if `__ARM_FEATURE_DOTPROD` isn't defined.
* Update ggml.c
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-14 21:34:37 +02:00
Radoslav Gerganov
60f819a2b1
Add section to README on how to run the project on Android ( #130 )
2023-03-14 15:30:08 +02:00
Alex Rozanski
8ce62d4135
improve README
2023-03-14 12:03:12 +01:00
Alex Rozanski
27afd915b8
update license file
2023-03-14 11:53:49 +01:00
Alex Rozanski
f5779567ed
update readme
2023-03-14 11:52:34 +01:00
Alex Rozanski
e69e116b82
improve output
2023-03-14 11:45:59 +01:00
Alex Rozanski
e99e6f1fbf
make sample app interactive
2023-03-14 11:45:59 +01:00
Alex Rozanski
81e97c8585
remove prints to stderr and return NSErrors
2023-03-14 11:45:59 +01:00
Alex Rozanski
2b27f14035
remove debug printf() statements
2023-03-14 11:45:59 +01:00
Alex Rozanski
60458cc580
improve swift API and test app
2023-03-14 11:45:59 +01:00
Alex Rozanski
e3df7bb91f
add basic error handling
2023-03-14 11:45:59 +01:00
Alex Rozanski
e04b99370c
pass events from C++ through to Swift
2023-03-14 11:45:59 +01:00
Alex Rozanski
b3404f6d27
execute prediction in NSOperation
2023-03-14 11:45:59 +01:00
Alex Rozanski
95722fb0e9
support reverse prompt in config
2023-03-14 11:45:59 +01:00
Alex Rozanski
9e54dc6d56
wire up LlamaRunner to cpp code
2023-03-14 11:45:59 +01:00
Alex Rozanski
406470dd3b
move cpp files to separate directory
2023-03-14 11:45:59 +01:00
Alex Rozanski
f33e7dbed3
fix file check in llamaTest
2023-03-14 11:45:59 +01:00
Alex Rozanski
2717aee3c1
add basic Objective-C runner and pass model path
2023-03-14 11:45:59 +01:00
Alex Rozanski
7e27bbc02e
add basic Xcode project and include cpp files
2023-03-14 11:45:59 +01:00
Georgi Gerganov
97ab2b2578
Add Misc section + update hot topics + minor fixes
2023-03-14 09:43:52 +02:00
Sebastián A
2f700a2738
Add windows to the CI ( #98 )
2023-03-13 22:29:10 +02:00
Georgi Gerganov
c09a9cfb06
CMake build in Release by default ( #75 )
2023-03-13 21:22:15 +02:00
Georgi Gerganov
7ec903d3c1
Update contribution section, hot topics, limitations, etc.
2023-03-13 19:21:51 +02:00
Georgi Gerganov
4497ad819c
Print system information
2023-03-13 19:15:08 +02:00
Sebastián A
ed6849cc07
Initial support for CMake ( #75 )
2023-03-13 19:12:33 +02:00
Thomas Klausner
41be0a3b3d
Add NetBSD support. ( #90 )
2023-03-13 18:40:54 +02:00
Pavol Rusnak
671d5cac15
Use fprintf for diagnostic output ( #48 )
...
keep printf only for printing model output
one can now use ./main ... 2>dev/null to suppress any diagnostic output
2023-03-13 18:39:56 +02:00
Georgi Gerganov
84d9015c4a
Use vdotq_s32 to improve performance ( #67 )
...
* 10% performance boost on ARM
* Back to original change
2023-03-13 18:36:44 +02:00
uint256_t
63fd76fbb0
Reduce model loading time ( #43 )
...
* Use buffering
* Use vector
* Minor
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-13 18:33:43 +02:00
Val Kharitonov
2a20f48efa
Fix UTF-8 handling (including colors) ( #79 )
2023-03-13 18:24:18 +02:00
Pavol Rusnak
d1f224712d
Add quantize script for batch quantization ( #92 )
...
* Add quantize script for batch quantization
* Indentation
* README for new quantize.sh
* Fix script name
* Fix file list on Mac OS
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-13 18:15:20 +02:00
Georgi Gerganov
1808ee0500
Add initial contribution guidelines
2023-03-13 09:42:26 +02:00
Matvey Soloviev
a169bb889c
Gate signal support on being on a unixoid system. ( #74 )
2023-03-13 04:08:01 +01:00
Matvey Soloviev
460c482540
Fix token count accounting
2023-03-13 01:04:41 +01:00
Georgi Gerganov
c80e2a8f2a
Revert "10% performance boost on ARM"
...
This reverts commit 113a9e83eb
.
There are some reports for illegal instruction.
Moved this stuff to vdotq_s32 branch until resolve
2023-03-13 01:28:08 +02:00
Georgi Gerganov
54a0e66ea0
Check for vdotq_s32 availability
2023-03-13 01:21:03 +02:00
Georgi Gerganov
543c57e991
Ammend to previous commit - forgot to update non-QRDMX branch
2023-03-13 01:05:24 +02:00
Georgi Gerganov
113a9e83eb
10% performance boost on ARM
2023-03-13 00:56:10 +02:00
Matvey Soloviev
404fac0d62
Fix color getting reset before prompt output done ( #65 )
...
(cherry picked from commit 7eb2987619feee04c40eff69b604017d09919cb6)
2023-03-13 00:07:34 +02:00
Georgi Gerganov
1a0a74300f
Update README.md
2023-03-12 23:39:01 +02:00
Matvey Soloviev
96ea727f47
Add interactive mode ( #61 )
...
* Initial work on interactive mode.
* Improve interactive mode. Make rev. prompt optional.
* Update README to explain interactive mode.
* Fix OS X build
2023-03-12 23:13:28 +02:00
Marc Köhlbrugge
9661954835
Fix typo in README ( #45 )
2023-03-12 22:30:08 +02:00
Ben Garney
f385f8dee8
Allow using prompt files ( #59 )
2023-03-12 22:28:36 +02:00
beiller
02f0c6fe7f
Add back top_k ( #56 )
...
* Add back top_k
* Update utils.cpp
* Update utils.h
---------
Co-authored-by: Bill Hamilton <bill.hamilton@shopify.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-12 22:23:15 +02:00