langchain4j

Commit Graph

Author	SHA1	Message	Date
LangChain4j	91db3d354a	bumped to 0.29.0-SNAPSHOT	2024-03-14 13:31:28 +01:00
LangChain4j	90fe3040b9	released 0.28.0 (#735 )	2024-03-11 20:08:55 +01:00
LangChain4j	1acb7a607f	EmbeddingStore (Metadata) Filter API (#610 ) ## New EmbeddingStore (metadata) `Filter` API Many embedding stores, such as [Pinecone](https://docs.pinecone.io/docs/metadata-filtering) and [Milvus](https://milvus.io/docs/boolean.md) support strict filtering (think of an SQL "WHERE" clause) during similarity search. So, if one has an embedding store with movies, for example, one could search not only for the most semantically similar movies to the given user query but also apply strict filtering by metadata fields like year, genre, rating, etc. In this case, the similarity search will be performed only on those movies that match the filter expression. Since LangChain4j supports (and abstracts away) many embedding stores, there needs to be an embedding-store-agnostic way for users to define the filter expression. This PR introduces a `Filter` interface, which can represent both simple (e.g., `type = "documentation"`) and composite (e.g., `type in ("documentation", "tutorial") AND year > 2020`) filter expressions in an embedding-store-agnostic manner. `Filter` currently supports the following operations: - Comparison: - `IsEqualTo` - `IsNotEqualTo` - `IsGreaterThan` - `IsGreaterThanOrEqualTo` - `IsLessThan` - `IsLessThanOrEqualTo` - `IsIn` - `IsNotIn` - Logical: - `And` - `Not` - `Or` These operations are supported by most embedding stores and serve as a good starting point. However, the list of operations will expand over time to include other operations (e.g., `Contains`) supported by embedding stores. Currently, the DSL looks like this: ```java Filter onlyDocs = metadataKey("type").isEqualTo("documentation"); Filter docsAndTutorialsAfter2020 = metadataKey("type").isIn("documentation", "tutorial").and(metadataKey("year").isGreaterThan(2020)); // or Filter docsAndTutorialsAfter2020 = and( metadataKey("type").isIn("documentation", "tutorial"), metadataKey("year").isGreaterThan(2020) ); ``` ## Filter expression as a `String` Filter expression can also be specified as a `String`. This might be necessary, for example, if the filter expression is generated dynamically by the application or by the LLM (as in [self querying](https://python.langchain.com/docs/modules/data_connection/retrievers/self_query/)). This PR introduces a `FilterParser` interface with a simple `Filter parse(String)` API, allowing for future support of multiple syntaxes (if this will be required). For the out-of-the-box filter syntax, ANSI SQL's `WHERE` clause is proposed as a suitable candidate for several reasons: - SQL is well-known among Java developers - There is extensive tooling available for SQL (e.g., parsers) - LLMs are pretty good at generating valid SQL, as there are tons of SQL queries on the internet, which are included in the LLM training datasets. There are also specialized LLMs that are trained for text-to-SQL task, such as [SQLCoder](https://huggingface.co/defog). The downside is that SQL's `WHERE` clause might not support all operations and data types that could be supported in the future by various embedding stores. In such case, we could extend it to a superset of ANSI SQL `WHERE` syntax and/or provide an option to express filters in the native syntax of the store. An out-of-the-box implementation of the SQL `FilterParser` is provided as a `SqlFilterParser` in a separate module `langchain4j-embedding-store-filter-parser-sql`, using [JSqlParser](https://github.com/JSQLParser/JSqlParser) under the hood. `SqlFilterParser` can parse SQL "SELECT" (or just "WHERE" clause) statement into a `Filter` object: - `SELECT * FROM fake_table WHERE userId = '123-456'` -> `metadataKey("userId").isEqualTo("123-456")` - `userId = '123-456'` -> `metadataKey("userId").isEqualTo("123-456")` It can also resolve `CURDATE()` and `CURRENT_DATE`/`CURRENT_TIME`/`CURRENT_TIMESTAMP`: `SELECT * FROM fake_table WHERE year = EXTRACT(YEAR FROM CURRENT_DATE` -> `metadataKey("year").isEqualTo(LocalDate.now().getYear())` ## Changes in `Metadata` API Until now, `Metadata` supported only `String` values. This PR expands the list of supported value types to `Integer`, `Long`, `Float` and `Double`. In the future, more types may be added (if needed). The method `String get(String key)` will be deprecated later in favor of: - `String getString(String key)` - `Integer getInteger(String key)` - `Long getLong(String key)` - etc New overloaded `put(key, value)` methods are introduced to support more value types: - `put(String key, int value)` - `put(String key, long value)` - etc ## Changes in `EmbeddingStore` API New method `search` is added that will become the main entry point for search in the future. All `findRelevant` methods will be deprecated later. New `search` method accepts `EmbeddingSearchRequest` and returns `EmbeddingSearchResult`. `EmbeddingSearchRequest` contains all search criteria (e.g. `maxResults`, `minScore`), including new `Filter`. `EmbeddingSearchResult` contains a list of `EmbeddingMatch`. ```java EmbeddingSearchResult search(EmbeddingSearchRequest request); ``` ## Changes in `EmbeddingStoreContentRetriever` API `EmbeddingStoreContentRetriever` can now be configured with a static `filter` as well as dynamic `dynamicMaxResults`, `dynamicMinScore` and `dynamicFilter` in the builder: ```java ContentRetriever contentRetriever = EmbeddingStoreContentRetriever.builder() .embeddingStore(embeddingStore) .embeddingModel(embeddingModel) ... .maxResults(3) // or .dynamicMaxResults(query -> 3) // You can define maxResults dynamically. The value could, for example, depend on the query or the user associated with the query. ... .minScore(0.3) // or .dynamicMinScore(query -> 0.3) ... .filter(metadataKey("userId").isEqualTo("123-456")) // Assuming your TextSegments contain Metadata with key "userId" // or .dynamicFilter(query -> metadataKey("userId").isEqualTo(query.metadata().chatMemoryId().toString())) ... .build(); ``` So now you can define `maxResults`, `minScore` and `filter` both statically and dynamically (they can depend on the query, user, etc.). These values will be propagated to the underlying `EmbeddingStore`. ## ["Self-querying"](https://python.langchain.com/docs/modules/data_connection/retrievers/self_query/) This PR also introduces `LanguageModelSqlFilterBuilder` in `langchain4j-embedding-store-filter-parser-sql` module which can be used with `EmbeddingStoreContentRetriever`'s `dynamicFilter` to automatically build a `Filter` object from the `Query` using language model and `SqlFilterParser`. For example: ```java TextSegment groundhogDay = TextSegment.from("Groundhog Day", new Metadata().put("genre", "comedy").put("year", 1993)); TextSegment forrestGump = TextSegment.from("Forrest Gump", new Metadata().put("genre", "drama").put("year", 1994)); TextSegment dieHard = TextSegment.from("Die Hard", new Metadata().put("genre", "action").put("year", 1998)); // describe metadata keys as if they were columns in the SQL table TableDefinition tableDefinition = TableDefinition.builder() .name("movies") .addColumn("genre", "VARCHAR", "one of [comedy, drama, action]") .addColumn("year", "INT") .build(); LanguageModelSqlFilterBuilder sqlFilterBuilder = new LanguageModelSqlFilterBuilder(model, tableDefinition); ContentRetriever contentRetriever = EmbeddingStoreContentRetriever.builder() .embeddingStore(embeddingStore) .embeddingModel(embeddingModel) .dynamicFilter(sqlFilterBuilder::build) .build(); String answer = assistant.answer("Recommend me a good drama from 90s"); // Forrest Gump ``` ## Which embedding store integrations will support `Filter`? In the long run, all (provided the embedding store itself supports it). In the first iteration, I aim to add support to just a few: - `InMemoryEmbeddingStore` - Elasticsearch - Milvus <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit ## Summary by CodeRabbit - New Features - Introduced filters for checking key's value existence in a collection for improved data handling. - Enhancements - Updated `InMemoryEmbeddingStoreTest` to extend a different class for improved testing coverage and added a new test method. - Refactor - Made minor formatting adjustments in the assertion block for better readability. - Documentation - Updated class hierarchy information for clarity. <!-- end of auto-generated comment: release notes by coderabbit.ai -->	2024-03-08 17:06:58 +01:00
LangChain4j	197b4af9d1	bumped version to 0.28.0-SNAPSHOT	2024-02-09 15:11:52 +01:00
LangChain4j	c1462c087f	release 0.27.1 (#621 )	2024-02-09 15:00:42 +01:00
LangChain4j	ad2fd90f32	bumped version to 0.28.0-SNAPSHOT	2024-02-09 08:12:28 +01:00
LangChain4j	a22d297104	Release 0.27.0 (#615 )	2024-02-09 08:00:34 +01:00
LangChain4j	8026f6013c	set langchain4j-core jacoco coverage 0.90	2024-02-05 14:43:20 +01:00
Antonio Goncalves	baac759766	Beautifying Maven output (#572 ) Looking at the Maven output I thought it could benefit from a little renaming. I just changed the `<name>` in the `pom.xml`, nothing more. The output is like this at the moment: ![Screenshot 2024-01-30 at 16 26 53](https://github.com/langchain4j/langchain4j/assets/729277/940886d1-565e-416f-a58e-91f609fc0c00) It could look like this if this PR is merged: ![Screenshot 2024-01-30 at 16 42 38](https://github.com/langchain4j/langchain4j/assets/729277/f8787af2-b869-4e95-90bd-72bce5622737) Just a personal taste. Let me know if you like it or not (or want to change it). If not, just discard it, it's fine ;o)	2024-01-30 16:54:54 +01:00
LangChain4j	fca8ca48f7	bump version to 0.27.0-SNAPSHOT	2024-01-30 16:18:40 +01:00
LangChain4j	3958e01738	release 0.26.1 (#570 )	2024-01-30 16:11:21 +01:00
LangChain4j	469699b944	bump version to 0.27.0-SNAPSHOT	2024-01-30 08:07:45 +01:00
LangChain4j	a8ad9e48d9	Automate release (#562 )	2024-01-30 07:20:20 +01:00
LangChain4j	14fb985de0	Foundation for advanced RAG (#538 ) So far, LangChain4j had only a simple (a.k.a., naive) RAG implementation: a single `Retriever` was invoked on each interaction with the LLM, and all retrieved `TextSegments` were appended to the end of the `UserMessage`. This approach was very limiting. This PR introduces support for much more advanced RAG use cases. The design and mental model are inspired by [this article](https://blog.langchain.dev/deconstructing-rag/) and [this paper](https://arxiv.org/abs/2312.10997), making it advisable to read the article. This PR introduces a `RetrievalAugmentor` interface responsible for augmenting a `UserMessage` with relevant content before sending it to the LLM. The `RetrievalAugmentor` can be used with both `AiServices` and `ConversationalRetrievalChain`, as well as stand-alone. A default implementation of `RetrievalAugmentor` (`DefaultRetrievalAugmentor`) is provided with the library and is suggested as a good starting point. However, users are not limited to it and can have more freedom with their own custom implementations. `DefaultRetrievalAugmentor` decomposes the entire RAG flow into more granular steps and base components: - `QueryTransformer` - `QueryRouter` - `ContentRetriever` (the old `Retriever` is now deprecated) - `ContentAggregator` - `ContentInjector` This modular design aims to separate concerns and simplify development, testing, and evaluation. Most (if not all) currently known and proven RAG techniques can be represented as one or multiple base components listed above. Here is how the decomposed RAG flow can be visualized: ![advanced-rag](https://github.com/langchain4j/langchain4j/assets/132277850/b699077d-dabf-4768-a241-3fcd9ab0286c) This mental and software model aims to simplify the thinking, reasoning, and implementation of advanced RAG flows. Each base component listed above has a sensible and simple default implementation configured in `DefaultRetrievalAugmentor` by default but can be overridden by more sophisticated implementations (provided by the library out-of-the-box) as well as custom ones. The list of implementations is expected to grow over time as we discover new techniques and implement existing proven ones. This PR also introduces out-of-the-box support for the following proven RAG techniques: - Query expansion - Query compression - Query routing using LLM - [Reciprocal Rank Fusion](https://learn.microsoft.com/en-us/azure/search/hybrid-search-ranking) - Re-ranking ([Cohere Rerank](https://docs.cohere.com/docs/reranking) integration is coming in a [separate PR](https://github.com/langchain4j/langchain4j/pull/539)).	2024-01-26 16:25:24 +01:00
Crutcher Dunnavant	2880fb023e	Unify langchang4j-core coverage to a 95% requirement. (#509 )	2024-01-15 18:43:56 +01:00
Crutcher Dunnavant	9e251fd7a3	Tighten coverage ratchet. (#483 )	2024-01-15 09:43:29 +01:00
Crutcher Dunnavant	c45bc235d4	Add jacoco coverage targets to langchain4j-core, attached to the `verify` stage. (#474 )	2024-01-09 11:13:43 +01:00
LangChain4j	7e5e82b7b2	updated to 0.26.0-SNAPSHOT	2023-12-22 18:08:19 +01:00
LangChain4j	2a5308b794	released 0.25.0	2023-12-22 18:02:04 +01:00
LangChain4j	b04a502a1d	Replace Mustache templates with simple implementation to reduce transitive dependencies (#408 ) One can provide their own prompt template implementation via SPI if needed.	2023-12-22 14:24:00 +01:00
LangChain4j	e1dddb33a2	bumped version to 0.25.0-SNAPSHOT (#369 )	2023-12-19 13:03:48 +01:00
deep-learning-dynamo	3882133f6e	moved EmbeddingStoreIngestor from langchain4j to langchain4j-core (#285 )	2023-12-14 16:33:02 +01:00
deep-learning-dynamo	e467beb64a	reducing duplication of *EmbeddingStoreIT	2023-11-18 19:20:26 +01:00
deep-learning-dynamo	00c6068de3	reducing duplication of *EmbeddingStoreIT	2023-11-18 18:29:44 +01:00
deep-learning-dynamo	16f60dbef9	reducing duplication of *EmbeddingStoreIT	2023-11-18 16:23:29 +01:00
deep-learning-dynamo	21dfc8b317	released 0.24.0	2023-11-12 18:58:31 +01:00
deep-learning-dynamo	315eab8641	released 0.23.0	2023-09-29 14:27:51 +02:00
deep-learning-dynamo	6b3a3eac7e	added missing license info	2023-09-28 18:21:01 +02:00
deep-learning-dynamo	c1cc5be1c7	released 0.22.0	2023-08-29 19:21:56 +02:00
kuraleta	88b56778f4	Integration with Google Vertex AI (#135 )	2023-08-28 21:30:18 +02:00
deep-learning-dynamo	db1f236ed2	released 0.21.0	2023-08-19 15:57:39 +02:00
jiangsier-xyz	d908f5158a	Integrate the Qwen series models via dashscope-sdk. (#99 ) Qwen series models are provided by Alibaba Cloud. They are much better in Asia languages then other LLMs. DashScope is a model service platform. Qwen models are its primary supported models. But it also supports other series like LLaMA2, Dolly, ChatGLM, BiLLa(based on LLaMA)...These may be integrated sometime in the future.	2023-08-18 20:49:50 +02:00
deep-learning-dynamo	d7b96ca9a6	released 0.20.0	2023-08-14 00:44:07 +02:00
deep-learning-dynamo	1541f214c1	released 0.19.0	2023-08-10 14:34:21 +02:00
deep-learning-dynamo	d4fca658c1	released 0.18.0	2023-07-26 21:19:24 +02:00
LangChain4j	529ef6b647	Added in-process embedding models (#41 ) - all-minilm-l6-v2 - all-minilm-l6-v2-q - e5-small-v2 - e5-small-v2-q The idea is to give users an option to embed documents/texts in the same Java process without any external dependencies. ONNX Runtime is used to run models inside JVM. Each model resides in it's own maven module (inside the jar).	2023-07-23 19:05:13 +02:00
deep-learning-dynamo	1976560aeb	released 0.16.0	2023-07-18 10:49:43 +02:00
deep-learning-dynamo	e439f96466	released 0.15.0	2023-07-18 00:13:08 +02:00
deep-learning-dynamo	14185653c7	released 0.14.0	2023-07-16 12:15:31 +02:00
deep-learning-dynamo	120c6a01d8	released 0.13.0	2023-07-15 17:53:10 +02:00
deep-learning-dynamo	52b7c3b441	released a hotfix for https://github.com/langchain4j/langchain4j/issues/23	2023-07-14 19:18:47 +02:00
Julien Perrochet	c451a220d9	[build] Introduce a parent pom (#15 ) Have a parent pom that contains most/all common things for the sub-projects. Note that it is separate from the root aggregator pom: not mixing the aggregator and the parents makes things slightly easier. If this change makes it harder to do releases, there might be a possibility to generate the effective poms for each subproject, but on the other hand releasing everything should not be too problematic.	2023-07-13 22:59:25 +02:00
deep-learning-dynamo	17654e31d0	released 0.11.0	2023-07-11 20:50:57 +02:00
Julien Perrochet	0534ec91e4	[CI] automated license check as part of CI (Apache 2.0/MIT/Eclipse) (#14 ) The title says it all. Relying on [this maven plugin](https://github.com/chonton/license-maven-plugin) for it. Note that this adds a separate build step because we need a more recent JDK to run the needed plugin.	2023-07-07 09:27:44 +02:00
deep-learning-dynamo	d645a8d5c7	released 0.10.0	2023-07-05 18:55:20 +02:00
deep-learning-dynamo	721a330228	released 0.9.0	2023-07-03 15:12:43 +02:00
deep-learning-dynamo	acb1e641c0	released 0.8.0	2023-07-02 23:13:13 +02:00
deep-learning-dynamo	c2d9298ce1	tests were not running during maven build facepalm	2023-07-02 22:07:27 +02:00
deep-learning-dynamo	6e85f7f06c	- added support for tools - released 0.7.0	2023-07-01 18:33:50 +02:00
deep-learning-dynamo	fa9646145d	Released 0.6.0	2023-06-29 22:15:38 +02:00

1 2

52 Commits