Laurent Mazare
0e2c8c17fb
UG metal integration. ( #2580 )
2024-10-27 15:20:37 +01:00
Laurent Mazare
594d984f9c
Support for UG kernels. ( #2579 )
...
* Support for UG kernels.
* Add a dedicated test.
2024-10-27 13:37:19 +01:00
Laurent Mazare
3699c1a053
Fix the repo name for llama 3.1. ( #2576 )
...
* Fix the repo name for llama 3.1.
* Fix the book.
2024-10-26 11:25:04 +02:00
Laurent Mazare
3a3c48b14b
Bump the crate version to 0.7.2. ( #2517 )
2024-09-29 10:56:50 +02:00
Laurent Mazare
8097559c1a
Move the candle version to 0.7.1. ( #2495 )
2024-09-22 20:44:39 +02:00
Laurent Mazare
829dcfa8dc
Update cudarc to 0.12.1. ( #2494 )
2024-09-22 20:32:29 +02:00
Laurent Mazare
c2fca0ca11
Bump the crate version. ( #2491 )
2024-09-21 15:13:12 +02:00
Laurent Mazare
aafa24ed93
Update cudarc to 0.12. ( #2451 )
...
* Update cudarc to 0.12.
* Some cudnn tweaks.
2024-08-27 10:10:30 +02:00
Laurent Mazare
6070278a31
Bump the version to 0.6.1. ( #2438 )
2024-08-22 09:23:52 +02:00
ivarflakstad
24d54d0ff9
Bump image crate version so ImageReader is available without aliasing ( #2365 )
2024-07-29 17:41:33 +02:00
Josh Collyer
d74fbed334
Pinning cudarc to 0.11.6 ( #2332 )
2024-07-15 15:29:08 +02:00
Laurent Mazare
f65e90e7ef
Bump the crate version. ( #2248 )
2024-06-05 15:49:15 +02:00
Eric Buehler
7abc3b8cd7
Bump cudarc version to 0.11.4 ( #2230 )
2024-06-01 08:18:35 +02:00
Laurent Mazare
46012ed31f
Another cudarc update. ( #2229 )
2024-05-30 22:27:06 +02:00
Laurent Mazare
f3fade3b03
Update cudarc to 0.11.2. ( #2227 )
2024-05-29 18:50:52 +02:00
Laurent Mazare
9b8537a62f
Remove the deprecated wav crate in favor of hound. ( #2202 )
2024-05-21 21:43:35 +02:00
Laurent Mazare
d9bc5ec151
Switch cudarc back to dynamic linking. ( #2176 )
2024-05-09 10:35:44 +02:00
Sidharth Rajaram
84328e2b60
Update cudarc requirement from 0.11.0 to 0.11.1 ( #2174 )
...
* Upgrading cudarc dependency from v0.11.0 to v0.11.1 due to that version having resolved a compile-time bug.
See: https://github.com/huggingface/candle/issues/2173
2024-05-08 20:40:36 +02:00
dependabot[bot]
82b641fd27
Update cudarc requirement from 0.10.0 to 0.11.0 ( #2165 )
...
* Update cudarc requirement from 0.10.0 to 0.11.0
Updates the requirements on [cudarc](https://github.com/coreylowman/cudarc ) to permit the latest version.
- [Release notes](https://github.com/coreylowman/cudarc/releases )
- [Commits](https://github.com/coreylowman/cudarc/compare/v0.10.0...v0.10.0 )
---
updated-dependencies:
- dependency-name: cudarc
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
* Use the default cuda version.
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: laurent <laurent.mazare@gmail.com>
2024-05-06 17:12:14 +02:00
Laurent Mazare
89f53b9d7b
Bump the version number to 0.5.1. ( #2155 )
...
* Bump the version number to 0.5.1.
* Fix clippy lints for 1.78.
* More clippy fixes.
2024-05-03 11:17:05 +02:00
dependabot[bot]
267601eec1
Update tokenizers requirement from 0.15.0 to 0.19.1 ( #2104 )
...
Updates the requirements on [tokenizers](https://github.com/huggingface/tokenizers ) to permit the latest version.
- [Release notes](https://github.com/huggingface/tokenizers/releases )
- [Changelog](https://github.com/huggingface/tokenizers/blob/main/RELEASE.md )
- [Commits](https://github.com/huggingface/tokenizers/compare/v0.15.0...v0.15.2 )
---
updated-dependencies:
- dependency-name: tokenizers
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-22 17:10:46 +02:00
dependabot[bot]
08a15cb79e
Update zip requirement from 0.6.6 to 1.1.1 ( #2103 )
...
* Update zip requirement from 0.6.6 to 1.1.1
---
updated-dependencies:
- dependency-name: zip
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
* Fix for the zip crate update.
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: laurent <laurent.mazare@gmail.com>
2024-04-22 16:23:27 +02:00
Laurent Mazare
f76bb7794a
Bumping the version number to 0.5.0. ( #2009 )
2024-04-04 17:48:45 +02:00
Laurent Mazare
3144150b8d
Move the tensor-tools binary in a separate crate. ( #1969 )
2024-03-30 15:49:37 +01:00
dependabot[bot]
356a170ae9
Update parquet requirement from 50.0.0 to 51.0.0 ( #1867 )
...
Updates the requirements on [parquet](https://github.com/apache/arrow-rs ) to permit the latest version.
- [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG-old.md )
- [Commits](https://github.com/apache/arrow-rs/compare/50.0.0...50.0.0 )
---
updated-dependencies:
- dependency-name: parquet
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-03-29 21:58:15 +01:00
Laurent Mazare
18036c6ccb
Update the image crate + use the re-exported version. ( #1893 )
...
* Update the image crate + use the re-exported version.
* Update to using ab_glyph.
2024-03-21 10:56:41 +01:00
Laurent Mazare
e7fc1daa21
Bump the crate versions to 0.4.2. ( #1821 )
2024-03-08 22:01:51 +01:00
Laurent Mazare
4fff5b51f5
Metavoice - first cut ( #1717 )
...
* Add the metavoice transformer.
* Sketch the speaker-encoder module.
* Adding to the metavoice model.
* Start adding the metavoice example.
* Get some logits out.
* Load the second stage model.
* Get the second step to run.
* Tweak the example.
* Add encodec tilting.
* Glue the different bits together.
* Fix a shape issue.
* Use a constant.
* BPE tokenization.
* Add a warning.
2024-03-02 18:50:01 +01:00
Laurent Mazare
5e526abc8c
Bump the version number to 0.4.1. ( #1768 )
...
* Fix the block size for some cuda kernels.
* Bump the version number to 0.4.1.
2024-02-27 14:19:59 +01:00
Laurent Mazare
a83ca2ece0
Bump the crate version to 0.4.0. ( #1658 )
2024-02-04 19:08:01 +01:00
Laurent Mazare
beadb1b434
Explicit candle version so that cargo publish can be used easily. ( #1641 )
2024-01-31 18:42:22 +01:00
dependabot[bot]
7e3349d7c3
Update parquet requirement from 45.0.0 to 50.0.0 ( #1592 )
...
Updates the requirements on [parquet](https://github.com/apache/arrow-rs ) to permit the latest version.
- [Changelog](https://github.com/apache/arrow-rs/blob/master/CHANGELOG-old.md )
- [Commits](https://github.com/apache/arrow-rs/compare/45.0.0...45.0.0 )
---
updated-dependencies:
- dependency-name: parquet
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-01-15 22:35:01 +01:00
dependabot[bot]
1257fc6719
Update safetensors requirement from 0.3.1 to 0.4.1 ( #1591 )
...
Updates the requirements on [safetensors](https://github.com/huggingface/safetensors ) to permit the latest version.
- [Release notes](https://github.com/huggingface/safetensors/releases )
- [Changelog](https://github.com/huggingface/safetensors/blob/main/RELEASE.md )
- [Commits](https://github.com/huggingface/safetensors/compare/v0.3.1...v0.3.3 )
---
updated-dependencies:
- dependency-name: safetensors
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-01-15 22:34:40 +01:00
Laurent Mazare
86b7c01b30
Update gemm to the latest version. ( #1587 )
2024-01-15 09:44:51 +01:00
dependabot[bot]
a897fda74e
Update memmap2 requirement from 0.7.1 to 0.9.3 ( #1556 )
...
Updates the requirements on [memmap2](https://github.com/RazrFalcon/memmap2-rs ) to permit the latest version.
- [Changelog](https://github.com/RazrFalcon/memmap2-rs/blob/master/CHANGELOG.md )
- [Commits](https://github.com/RazrFalcon/memmap2-rs/compare/v0.7.1...v0.7.1 )
---
updated-dependencies:
- dependency-name: memmap2
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-01-10 16:27:59 +01:00
dependabot[bot]
6e98cf2a92
Update cudarc requirement from 0.9.14 to 0.10.0 ( #1559 )
...
Updates the requirements on [cudarc](https://github.com/coreylowman/cudarc ) to permit the latest version.
- [Release notes](https://github.com/coreylowman/cudarc/releases )
- [Commits](https://github.com/coreylowman/cudarc/compare/v0.9.14...v0.9.15 )
---
updated-dependencies:
- dependency-name: cudarc
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-01-10 16:27:05 +01:00
dependabot[bot]
2cc1247999
Update tokenizers requirement from 0.13.4 to 0.15.0 ( #1555 )
...
Updates the requirements on [tokenizers](https://github.com/huggingface/tokenizers ) to permit the latest version.
- [Release notes](https://github.com/huggingface/tokenizers/releases )
- [Changelog](https://github.com/huggingface/tokenizers/blob/main/RELEASE.md )
- [Commits](https://github.com/huggingface/tokenizers/commits )
---
updated-dependencies:
- dependency-name: tokenizers
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-01-10 16:26:53 +01:00
Nicolas Patry
b4cb982e49
Simplifying our internal cargo dependencies. ( #1529 )
2024-01-07 12:04:14 +01:00
Laurent Mazare
d35f0a1376
Bump the crate version to 0.3.3. ( #1490 )
2023-12-28 13:38:30 +01:00
Nicolas Patry
9fc210fae8
Merge pull request #1318 from huggingface/metal4
...
Starting to fix some tests.
2023-12-20 15:37:31 +01:00
Nicolas Patry
9b5e4843a6
Optimizing decode matmul (Phi at 28tok/s on M3).
...
Adding some benchmark in order to help checking out matmul performance.
2023-12-20 09:54:19 +01:00
Laurent Mazare
94817dac56
Bump the crate version to 0.3.2. ( #1452 )
2023-12-17 05:34:53 -06:00
Nicolas Patry
931432ed55
Fixing tests + matmul from MFA
2023-12-13 16:58:36 +01:00
Nicolas Patry
c66e5d4716
Fix comments.
2023-11-20 14:13:44 +01:00
Nicolas Patry
7cfffcac10
Debugging rope.
2023-11-20 14:12:57 +01:00
Nicolas Patry
38de52bc4b
Fixed matmul (display still broken without casting back to CPU first? )
2023-11-20 14:12:57 +01:00
Nicolas Patry
39406a6721
Adding the actual backend
2023-11-20 14:12:56 +01:00
Laurent Mazare
a209ce8ceb
Update for 0.3.1. ( #1324 )
2023-11-11 18:48:52 +00:00
Nicolas Patry
26c4e5bf1d
Metal part 1 - Scaffolding for metal. ( #1308 )
...
* Metal part 1 - Scaffolding for metal.
* Remove tracing.
2023-11-10 08:35:48 +01:00
Juarez Bochi
18d30005c5
Add support to UL2 model family ( #1300 )
...
* Add support to UL2 model family
* Update docs with UL2
* Create ActivationWithOptionalGating to avoid polluting activations
* Also refactor quantized t5
* Remove useless conversion
* Revert Activation::NewGelu name change
* Remove useless return
* Apply rustfmt and clippy recommendations
* Reuse t5::ActivationWithOptionalGating in quantized version
* (cosmetic change) use a match rather than ifs + avoid early returns.
---------
Co-authored-by: Laurent <laurent.mazare@gmail.com>
2023-11-09 18:55:09 +01:00