candle/candle-examples
Santiago Medina ace282e5c2
Add flag to run Moondream in f16 precision (#2015)
* moondream implementation

* add moondream example

* change config default activation

* Add assets and integrate phi mixformer with example

* Make use of kv cache and fix seq_len bug; Clean up example code

* Add README link to example

* Remove pos_embed scaling; Remove assets; Add to README; Expand VisionConfig

* Delete image

* Use apply instead of forward

* Use latest release special token; Fix token/s accuracy; Use GeluPytorchTanh in VisionConfig v2

* Add flag to use f16

* Avoid breaking the quantized version on cuda.

---------

Co-authored-by: laurent <laurent.mazare@gmail.com>
2024-04-05 07:03:33 +02:00
..
examples Add flag to run Moondream in f16 precision (#2015) 2024-04-05 07:03:33 +02:00
src Normalize loudness of the generated audio (#1795) 2024-03-03 14:00:42 +01:00
Cargo.toml Update for pyo3 0.21. (#1985) 2024-04-01 17:07:02 +02:00
README.md Add some missing readme files. (#304) 2023-08-02 10:57:12 +01:00
build.rs Pin the revision used for phi-v2 + make it the default. (#1572) 2024-01-12 09:19:30 +01:00

README.md

candle-examples