langchain4j

Commit Graph

Author	SHA1	Message	Date
Julien Dubois	d546c64511	Support for GitHub Models using the Azure AI Inference API (#1807 ) Fix #1719 This adds GitHub Models (see https://github.com/marketplace/models ) support with the new Azure AI Inference API Java SDK.	2024-09-24 19:18:42 +02:00
LangChain4j	42c958a458	Extract HtmlTextExtractor into its own module (#1811 ) ## Issue Closes #1049 ## Change Extracted `HtmlTextExtractor` into `langchain4j-document-transformer-jsoup` module. Renamed `HtmlToTextDocumentTransformer` into `HtmlTextExtractor`. Please import: ```xml <dependency> <groupId>dev.langchain4j</groupId> <artifactId>langchain4j-document-transformer-jsoup</artifactId> <version>0.35.0</version> </dependency> ``` ## General checklist - [ ] There are no breaking changes - [ ] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [X] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green - [X] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable)	2024-09-24 15:08:23 +02:00
Martin7-1	9e63004370	Integration with Voyage (#1816 ) ## Issue Closes #1814 and #1813 ## Change 1. Integration `EmbeddingModel` and `ScoringModel` with `Voyage`. 2. Add document about `VoyageEmbeddingModel` and `VoyageScoringModel` Related PR: 1. [Voyage Example](https://github.com/langchain4j/langchain4j-examples/pull/109) 2. [Voyage Spring Boot](https://github.com/langchain4j/langchain4j-spring/pull/42) ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [x] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [x] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [x] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [x] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [x] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-09-24 08:59:01 +02:00
Guillaume Laforge	6bd851fefc	fix(Google AI Gemini) — Fixed wrong mapping between function execution requests and function execution responses. (#1802 ) Added a test using `AiServices`. Function execution requests were mapped to function execution responses.	2024-09-23 12:29:57 +02:00
1758225523	1c0671617d	add scoring onnx (#1769 ) ## Issue Closes #1549 ## Change OnnxScoringModel similar to OnnxEmbeddingModel ## General checklist - [X] There are no breaking changes - [X] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [X] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green ## Checklist for adding new model integration - [X] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-09-20 12:18:45 +02:00
ScriptShi	be7454a7c6	Add langchain4j-tablestore Integration: TablestoreEmbeddingStore/TablestoreChatMemoryStore (#1650 ) ## Change Add langchain4j-tablestore Integration: TablestoreEmbeddingStore / TablestoreChatMemoryStore ## General checklist - [x] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [x] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [x] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added a `{NameOfIntegration}EmbeddingStoreIT` that extends from either `EmbeddingStoreIT` or `EmbeddingStoreWithFilteringIT` - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-09-18 11:41:53 +02:00
LangChain4j	21d35e4434	changed version to 0.35.0-SNAPSHOT	2024-09-09 10:11:09 +02:00
LangChain4j	b0a8e6f45b	Release 0.34.0 (#1711 )	2024-09-05 16:49:39 +02:00
Guillaume Laforge	67898076d6	#1363 — Add support for the Gemini AI model (#1695 ) ## Issue Closes #1363 ## Change New module for the Google AI Gemini ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [x] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [ ] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new model integration - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-09-04 15:39:15 +02:00
Dmitrii Chechetkin	56d8d40875	Adds couchbase vector storage (#1482 ) ## Issue <!-- Please specify the ID of the issue this PR is addressing. For Closes #1481 ## Change Adds couchbase vector storage. ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [x] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [x] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [x] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [x] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for adding new embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added a `{NameOfIntegration}EmbeddingStoreIT` that extends from either `EmbeddingStoreIT` or `EmbeddingStoreWithFilteringIT` - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-09-02 15:11:36 +02:00
Michael McMahon	254dc79bad	Oracle Database Embedding Store (#1490 ) ## Issue Closes #1091 ## Change Added an `EmbeddingStore` integration for Oracle Database. ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [X] There are no breaking changes - [X] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [X] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [ ] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for adding new embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [X] I have added a `{NameOfIntegration}EmbeddingStoreIT` that extends from either `EmbeddingStoreIT` or `EmbeddingStoreWithFilteringIT` - [X] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) --------- Signed-off-by: Michael McMahon <michael.a.mcmahon@oracle.com> Co-authored-by: psilberk <pablo.silberkasten@oracle.com> Co-authored-by: Pablo Silberkasten <47338417+psilberk@users.noreply.github.com> Co-authored-by: LangChain4j <langchain4j@gmail.com> Co-authored-by: Fernanda Meheust <fernanda.meheust@oracle.com> Co-authored-by: Eddú Meléndez Gonzales <eddu.melendez@gmail.com>	2024-08-27 10:38:29 +02:00
Felipe Zambrin	99ed696f05	[FEATURE] Adds SearchApi as WebSearchEngine and Tool (#1216 ) ## Issue Closes #1132 ## Change Added SearchApi as a WebSearchEngine that also can be used as a tool. Currently using Google Search as default engine. It also allows for new engines to be implemented using the SearchApiRequestResponseHandler interface, and adding it to the SearchApiEngine enum so the user can choose which one to use. ## General checklist - [X] There are no breaking changes - [X] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [ ] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green - [X] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [X] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) * The example is in the docs, I will open a new PR to the examples repo if it is ok @algora-pbc /claim #1132	2024-08-22 11:05:39 +02:00
Yellow--	cdaccd3547	support run integration tests for changed and dependent modules (#1185 ) ## Issue Closes #1000 ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [X] There are no breaking changes - [ ] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [ ] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features)	2024-08-16 13:52:42 +02:00
LangChain4j	1cccfdfa65	changed version to 0.34.0-SNAPSHOT	2024-07-26 15:12:26 +02:00
LangChain4j	822f09cb1c	Release 0.33.0 (#1514 )	2024-07-25 10:12:20 +02:00
Stéphane Philippart	4ef7d0a033	feat: ✨ add OVHcloud embedding models (#1355 ) ## Change With this PR I added the OVHcloud embedding models (see https://endpoints.ai.cloud.ovh.net for more details). ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [X] There are no breaking changes - [X] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [X] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [x] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [x] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-07-23 16:57:34 +02:00
LangChain4j	fe50c88e77	changed version to 0.33.0-SNAPSHOT	2024-07-08 14:47:07 +02:00
LangChain4j	c2366a226c	Release 0.32.0 (#1409 )	2024-07-04 12:04:29 +02:00
Jake Luciani	ed21f61daf	Add Jlama support (#1379 ) ## Issue Implements https://github.com/langchain4j/langchain4j/issues/1350 ## Change Adds support for Jlama engine ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [X] There are no breaking changes - [X] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [X] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [X] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added/updated [Spring Boot starter(s)](https://github.com/langchain4j/langchain4j-spring) (if applicable) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [X] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-07-02 09:20:46 +02:00
Cedrick Lunven	d1e4bdd634	Add support for Workers AI provider by CloudFlare (#1262 ) This PR introduces the different models implements by CloudFlare WorkersAI https://developers.cloudflare.com/workers-ai/ And especially: - [Text Embeddings](https://developers.cloudflare.com/workers-ai/models/#text-embeddings) - [Text Generation](https://developers.cloudflare.com/workers-ai/models/#text-generation) - [Text-to-Image](https://developers.cloudflare.com/workers-ai/models/#text-to-image)	2024-06-13 15:44:30 +02:00
Vadym Zhytkevych	36c175190f	Draft: Add Selenium document loader (#1166 ) ## Change <!-- Please describe the changes you made. --> This change adds the document loading process by integrating Selenium web automation. Unlike the existing UrlDocumentLoader, this method captures the complete version of a webpage, including all post-redirect content and dynamically loaded elements via JavaScript. This ensures that the full content is retrieved, providing a more accurate representation of the page as rendered in a browser. ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [x] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [ ] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for adding new embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [ ] I have added a `{NameOfIntegration}EmbeddingStoreIT` that extends from either `EmbeddingStoreIT` or `EmbeddingStoreWithFilteringIT` - [ ] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for changing existing embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [ ] I have manually verified that the `{NameOfIntegration}EmbeddingStore` works correctly with the data persisted using the latest released version of LangChain4j	2024-05-27 11:23:07 +02:00
LangChain4j	a1b733d96d	bumped version to 0.32.0-SNAPSHOT	2024-05-24 16:25:13 +02:00
LangChain4j	d9cb1e9b81	Release 0.31.0 (#1151 )	2024-05-23 17:40:52 +02:00
LangChain4j	240d34df89	Adds an embedding store for Azure Cosmos DB for NoSQL (#1115 )	2024-05-23 11:23:55 +02:00
Aayush Kataria	9e382d4f33	Adds an embedding store for Azure Cosmos DB for NoSQL (#1115 ) ## Issue This PR add supports for Azure Cosmos DB for NoSQL embedding store. ## Change - This PR adds an embedding store for Azure Cosmos DB for NoSql. The test cases and IT test case is also included. ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [ ] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [ ] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for adding new embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added a `{NameOfIntegration}EmbeddingStoreIT` that extends from either `EmbeddingStoreIT` or `EmbeddingStoreWithFilteringIT` - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for changing existing embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [ ] I have manually verified that the `{NameOfIntegration}EmbeddingStore` works correctly with the data persisted using the latest released version of LangChain4j	2024-05-23 11:06:11 +02:00
LangChain4j	050e93b689	Jina AI Embedding model integration (#997 )	2024-05-22 13:52:09 +02:00
Pankaj Shukla	0ad92d5b0c	Jina AI Embedding model integration (#997 ) ## Context This pr is for integration of jina ai embedding model which is mentioned in the issue [973](https://github.com/langchain4j/langchain4j/issues/973) ## Change 1. Since no jina sdk was available for java hence built client for the same . 2. Added method of embedding generation for both single input and multiple inputs 3. Default model used _jina-embeddings-v2-base-en_ ## Checklist Before submitting this PR, please check the following points: - [x] I have added unit and integration tests for my change - [x] All unit and integration tests in the module I have added/changed are green - [x] All unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules are green - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) - [ ] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) (only when a new module is added) ## Checklist for adding new embedding store integration - [ ] I have added a {NameOfIntegration}EmbeddingStoreIT that extends from either EmbeddingStoreIT or EmbeddingStoreWithFilteringIT	2024-05-22 13:36:16 +02:00
LangChain4j	504aa173df	Experimental: RAG: SQL database content retriever (#1056 ) ## Issue https://github.com/langchain4j/langchain4j/issues/232 ## Change An experimental `SqlDatabaseContentRetriever` has been added. Simplest usage example: ```java ContentRetriever contentRetriever = SqlDatabaseContentRetriever.builder() .dataSource(dataSource) .chatLanguageModel(openAiChatModel) .build(); ``` In this case SQL dialect and table structure will be determined from the `DataSource`. But it can be customized: ```java ContentRetriever contentRetriever = SqlDatabaseContentRetriever.builder() .dataSource(dataSource) .sqlDialect("PostgreSQL") .databaseStructure(...) .promptTemplate(...) .chatLanguageModel(openAiChatModel) .maxRetries(2) .build(); ``` See `SqlDatabaseContentRetrieverIT` for a full example. ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [X] There are no breaking changes - [X] I have added unit and integration tests for my change - [X] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [X] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [ ] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [ ] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [X] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml)	2024-05-21 16:49:02 +02:00
kuraleta	3457784a29	Tavily Web Search Engine (#676 )	2024-05-21 16:10:59 +02:00
LangChain4j	3897e565cc	[FEATURE] Google web search integration (#641 )	2024-05-21 15:37:30 +02:00
Carlos Zela Bueno	43274ff465	[FEATURE] Google web search integration (#641 ) Integrating [Google Custom Search](https://developers.google.com/custom-search) as a `WebSearchEngine` and as a `Tool` for function calling	2024-05-21 14:05:21 +02:00
Antonio Goncalves	b0f0a7bdef	Adding missing -azure-ai-search and azure-cosmos-mongo-vcore to the BOM (#1127 ) I've adding two missing artefacts in the BOM (azure-ai-search and azure-cosmos-mongo-vcore). And to make it easier to spot, I've sorted the `pom.xml` files by artifact id	2024-05-21 10:39:29 +02:00
Mohamed AIT ABDERRAHMAN	b27fd4d6ac	Extract Judge0 code execution engine as module (#1051 ) ## Issue https://github.com/langchain4j/langchain4j/issues/1048 ## Change I extract these classes as new module `langchain4j-code-execution-engine-judge0` : - `Judge0JavaScriptEngine` - `JavaScriptCodeFixer` - `Judge0JavaScriptExecutionTool` - `JavaScriptCodeFixerTest` and I moved the `com.squareup.okhttp3:okhttp` dependency from the main module to that new one. ## General checklist <!-- Please double-check the following points and mark them like this: [X] --> - [x] There are no breaking changes - [x] I have added unit and integration tests for my change - [x] I have manually run all the unit and integration tests in the module I have added/changed, and they are all green - [x] I have manually run all the unit and integration tests in the [core](https://github.com/langchain4j/langchain4j/tree/main/langchain4j-core) and [main](https://github.com/langchain4j/langchain4j/tree/main/langchain4j) modules, and they are all green <!-- Before adding documentation and example(s) (below), please wait until the PR is reviewed and approved. --> - [x] I have added/updated the [documentation](https://github.com/langchain4j/langchain4j/tree/main/docs/docs) - [x] I have added an example in the [examples repo](https://github.com/langchain4j/langchain4j-examples) (only for "big" features) ## Checklist for adding new model integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for adding new embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have added a `{NameOfIntegration}EmbeddingStoreIT` that extends from either `EmbeddingStoreIT` or `EmbeddingStoreWithFilteringIT` - [x] I have added my new module in the [BOM](https://github.com/langchain4j/langchain4j/blob/main/langchain4j-bom/pom.xml) ## Checklist for changing existing embedding store integration <!-- Please double-check the following points and mark them like this: [X] --> - [x] I have manually verified that the `{NameOfIntegration}EmbeddingStore` works correctly with the data persisted using the latest released version of LangChain4j	2024-05-21 08:47:25 +02:00
LangChain4j	66c338c135	changed version to 0.31.0-SNAPSHOT	2024-04-29 11:21:00 +02:00
LangChain4j	1a340893ec	Release 0.30.0 (#945 )	2024-04-16 18:21:01 +02:00
LangChain4j	d1d9b45adc	bumped to 0.30.0-SNAPSHOT	2024-04-08 17:36:52 +02:00
LangChain4j	45b58ac993	released 0.29.1 (#857 )	2024-03-28 16:42:45 +01:00
LangChain4j	d1e3cc1693	Release 0.29.0 (#830 )	2024-03-26 11:54:43 +01:00
Aayush Kataria	7e5025c5de	Adding Azure cosmos mongo vCore as a new Embedding Store (#691 ) I have added Azure Cosmos DB Mongo vCore as an Embedding Store to be used by customers using LangChain4J and Cosmos DB for their LLM applications.	2024-03-22 14:51:11 +01:00
LangChain4j	2f425da9f7	POC: Easy RAG (#686 ) Implementing RAG applications is hard. Especially for those who are just getting started exploring LLMs and RAG. This PR introduces an "Easy RAG" feature that should help developers to get started with RAG as easy as possible. With it, there is no need to learn about chunking/splitting/segmentation, embeddings, embedding models, vector databases, retrieval techniques and other RAG-related concepts. This is similar to how one can simply upload one or multiple files into [OpenAI Assistants API](https://platform.openai.com/docs/assistants/overview) and the LLM will automagically know about their contents when answering questions. Easy RAG is using local embedding model running in your CPU (GPU support can be added later). Your files are ingested into an in-memory embedding store. Please note that "Easy RAG" will not replace manual RAG setups and especially [advanced RAG techniques](https://github.com/langchain4j/langchain4j/pull/538), but will provide an easier way to get started with RAG. The quality of an "Easy RAG" should be sufficient for demos, proof of concepts and for getting started. To use "Easy RAG", simply import `langchain4j-easy-rag` dependency that includes everything needed to do RAG: - Apache Tika document loader (to parse all document types automatically) - Quantized [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) in-process embedding model which has an impressive (for it's size) 51.68 [score](https://huggingface.co/spaces/mteb/leaderboard) for retrieval Here is the proposed API: ```java List<Document> documents = FileSystemDocumentLoader.loadDocuments(directoryPath); // one can also load documents recursively and filter with glob/regex EmbeddingStore<TextSegment> embeddingStore = new InMemoryEmbeddingStore<>(); // we will use an in-memory embedding store for simplicity EmbeddingStoreIngestor.ingest(documents, embeddingStore); Assistant assistant = AiServices.builder(Assistant.class) .chatLanguageModel(model) .contentRetriever(EmbeddingStoreContentRetriever.from(embeddingStore)) .build(); String answer = assistant.chat("Who is Charlie?"); // Charlie is a carrot... ``` `FileSystemDocumentLoader` in the above code loads documents using `DocumentParser` available in classpath via SPI, in this case an `ApacheTikaDocumentParser` imported with the `langchain4j-easy-rag` dependency. The `EmbeddingStoreIngestor` in the above code: - splits documents into smaller text segments using a `DocumentSplitter` loaded via SPI from the `langchain4j-easy-rag` dependency. Currently it uses `DocumentSplitters.recursive(300, 30, new HuggingFaceTokenizer())` - embeds text segments using an `AllMiniLmL6V2QuantizedEmbeddingModel` loaded via SPI from the `langchain4j-easy-rag` dependency - stores text segments and their embeddings into the specified embedding store When using `InMemoryEmbeddingStore`, one can serialize/persist it into a JSON string on into a file. This way one can skip loading documents and embedding them on each application run. It is easy to customize the ingestion in the above code, just change ```java EmbeddingStoreIngestor.ingest(documents, embeddingStore); ``` into ```java EmbeddingStoreIngestor ingestor = EmbeddingStoreIngestor.builder() //.documentTransformer(...) // you can optionally transform (clean, enrich, etc) documents before splitting //.documentSplitter(...) // you can optionally specify another splitter //.textSegmentTransformer(...) // you can optionally transform (clean, enrich, etc) segments before embedding //.embeddingModel(...) // you can optionally specify another embedding model to use for embedding .embeddingStore(embeddingStore) .build(); ingestor.ingest(documents) ``` Over time, we can add an auto-eval feature that will find the most suitable hyperparametes for a given documents (e.g. which embedding model to use, which splitting method, possibly advanced RAG techniques, etc.) so that "easy RAG" can be comparable to the "advanced RAG". Related: https://github.com/langchain4j/langchain4j-embeddings/pull/16 --------- Co-authored-by: dliubars <dliubars@redhat.com>	2024-03-21 17:37:38 +01:00
LangChain4j	fcded2a81c	Revert "Delombok before JavaDoc. (#595 )" This reverts commit `3af1fe5ea2`.	2024-03-16 08:55:46 +01:00
LangChain4j	91db3d354a	bumped to 0.29.0-SNAPSHOT	2024-03-14 13:31:28 +01:00
LangChain4j	90fe3040b9	released 0.28.0 (#735 )	2024-03-11 20:08:55 +01:00
kuraleta	a46fd20807	WIP: integration with Anthropic (#727 ) <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - New Features - Introduced classes and interfaces to facilitate chat interactions with the Anthropic API, enabling chat completion functionalities. - Developed a client class for seamless interaction with the AnthropicAI API, including authentication and request handling. - Implemented utility methods for message conversion and managing token usage in chat interactions. - Defined an enum to distinguish between user and assistant roles in chat scenarios. - Added logging interceptors for HTTP requests and responses to enhance debugging capabilities. - Created a model class for generating AI responses from chat messages using the Anthropic API. - Added a request model class for creating messages in the Anthropic system. - Introduced a class for representing image content with type and source details. - Included integration tests for the `AnthropicChatModel` class covering various functionalities. <!-- end of auto-generated comment: release notes by coderabbit.ai -->	2024-03-11 15:32:34 +01:00
二毛	8fcd1926db	add ZhipuAI integration (#558 ) ZhipuAI is a large model focusing on Chinese cognition <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit - New Features - Introduced serialization and deserialization for assistant messages. - Enhanced utility methods for data conversion in Zhipu AI processing. - Implemented JSON serialization/deserialization support. - Defined an interface for Zhipu AI service interactions. - Introduced classes for handling chat completions and embedding requests with Zhipu AI. - Provided structure for chat messages, choices, models, requests, and responses. - Added classes for function calls, parameters, tool interactions, and web searches within chats. - Established data structures for embedding information and requests. - Implemented builders for chat and embedding model instances. - Tests - Added integration tests for chat model, embedding model, and streaming chat model functionalities. <!-- end of auto-generated comment: release notes by coderabbit.ai -->	2024-03-11 07:58:26 +01:00
LangChain4j	1acb7a607f	EmbeddingStore (Metadata) Filter API (#610 ) ## New EmbeddingStore (metadata) `Filter` API Many embedding stores, such as [Pinecone](https://docs.pinecone.io/docs/metadata-filtering) and [Milvus](https://milvus.io/docs/boolean.md) support strict filtering (think of an SQL "WHERE" clause) during similarity search. So, if one has an embedding store with movies, for example, one could search not only for the most semantically similar movies to the given user query but also apply strict filtering by metadata fields like year, genre, rating, etc. In this case, the similarity search will be performed only on those movies that match the filter expression. Since LangChain4j supports (and abstracts away) many embedding stores, there needs to be an embedding-store-agnostic way for users to define the filter expression. This PR introduces a `Filter` interface, which can represent both simple (e.g., `type = "documentation"`) and composite (e.g., `type in ("documentation", "tutorial") AND year > 2020`) filter expressions in an embedding-store-agnostic manner. `Filter` currently supports the following operations: - Comparison: - `IsEqualTo` - `IsNotEqualTo` - `IsGreaterThan` - `IsGreaterThanOrEqualTo` - `IsLessThan` - `IsLessThanOrEqualTo` - `IsIn` - `IsNotIn` - Logical: - `And` - `Not` - `Or` These operations are supported by most embedding stores and serve as a good starting point. However, the list of operations will expand over time to include other operations (e.g., `Contains`) supported by embedding stores. Currently, the DSL looks like this: ```java Filter onlyDocs = metadataKey("type").isEqualTo("documentation"); Filter docsAndTutorialsAfter2020 = metadataKey("type").isIn("documentation", "tutorial").and(metadataKey("year").isGreaterThan(2020)); // or Filter docsAndTutorialsAfter2020 = and( metadataKey("type").isIn("documentation", "tutorial"), metadataKey("year").isGreaterThan(2020) ); ``` ## Filter expression as a `String` Filter expression can also be specified as a `String`. This might be necessary, for example, if the filter expression is generated dynamically by the application or by the LLM (as in [self querying](https://python.langchain.com/docs/modules/data_connection/retrievers/self_query/)). This PR introduces a `FilterParser` interface with a simple `Filter parse(String)` API, allowing for future support of multiple syntaxes (if this will be required). For the out-of-the-box filter syntax, ANSI SQL's `WHERE` clause is proposed as a suitable candidate for several reasons: - SQL is well-known among Java developers - There is extensive tooling available for SQL (e.g., parsers) - LLMs are pretty good at generating valid SQL, as there are tons of SQL queries on the internet, which are included in the LLM training datasets. There are also specialized LLMs that are trained for text-to-SQL task, such as [SQLCoder](https://huggingface.co/defog). The downside is that SQL's `WHERE` clause might not support all operations and data types that could be supported in the future by various embedding stores. In such case, we could extend it to a superset of ANSI SQL `WHERE` syntax and/or provide an option to express filters in the native syntax of the store. An out-of-the-box implementation of the SQL `FilterParser` is provided as a `SqlFilterParser` in a separate module `langchain4j-embedding-store-filter-parser-sql`, using [JSqlParser](https://github.com/JSQLParser/JSqlParser) under the hood. `SqlFilterParser` can parse SQL "SELECT" (or just "WHERE" clause) statement into a `Filter` object: - `SELECT * FROM fake_table WHERE userId = '123-456'` -> `metadataKey("userId").isEqualTo("123-456")` - `userId = '123-456'` -> `metadataKey("userId").isEqualTo("123-456")` It can also resolve `CURDATE()` and `CURRENT_DATE`/`CURRENT_TIME`/`CURRENT_TIMESTAMP`: `SELECT * FROM fake_table WHERE year = EXTRACT(YEAR FROM CURRENT_DATE` -> `metadataKey("year").isEqualTo(LocalDate.now().getYear())` ## Changes in `Metadata` API Until now, `Metadata` supported only `String` values. This PR expands the list of supported value types to `Integer`, `Long`, `Float` and `Double`. In the future, more types may be added (if needed). The method `String get(String key)` will be deprecated later in favor of: - `String getString(String key)` - `Integer getInteger(String key)` - `Long getLong(String key)` - etc New overloaded `put(key, value)` methods are introduced to support more value types: - `put(String key, int value)` - `put(String key, long value)` - etc ## Changes in `EmbeddingStore` API New method `search` is added that will become the main entry point for search in the future. All `findRelevant` methods will be deprecated later. New `search` method accepts `EmbeddingSearchRequest` and returns `EmbeddingSearchResult`. `EmbeddingSearchRequest` contains all search criteria (e.g. `maxResults`, `minScore`), including new `Filter`. `EmbeddingSearchResult` contains a list of `EmbeddingMatch`. ```java EmbeddingSearchResult search(EmbeddingSearchRequest request); ``` ## Changes in `EmbeddingStoreContentRetriever` API `EmbeddingStoreContentRetriever` can now be configured with a static `filter` as well as dynamic `dynamicMaxResults`, `dynamicMinScore` and `dynamicFilter` in the builder: ```java ContentRetriever contentRetriever = EmbeddingStoreContentRetriever.builder() .embeddingStore(embeddingStore) .embeddingModel(embeddingModel) ... .maxResults(3) // or .dynamicMaxResults(query -> 3) // You can define maxResults dynamically. The value could, for example, depend on the query or the user associated with the query. ... .minScore(0.3) // or .dynamicMinScore(query -> 0.3) ... .filter(metadataKey("userId").isEqualTo("123-456")) // Assuming your TextSegments contain Metadata with key "userId" // or .dynamicFilter(query -> metadataKey("userId").isEqualTo(query.metadata().chatMemoryId().toString())) ... .build(); ``` So now you can define `maxResults`, `minScore` and `filter` both statically and dynamically (they can depend on the query, user, etc.). These values will be propagated to the underlying `EmbeddingStore`. ## ["Self-querying"](https://python.langchain.com/docs/modules/data_connection/retrievers/self_query/) This PR also introduces `LanguageModelSqlFilterBuilder` in `langchain4j-embedding-store-filter-parser-sql` module which can be used with `EmbeddingStoreContentRetriever`'s `dynamicFilter` to automatically build a `Filter` object from the `Query` using language model and `SqlFilterParser`. For example: ```java TextSegment groundhogDay = TextSegment.from("Groundhog Day", new Metadata().put("genre", "comedy").put("year", 1993)); TextSegment forrestGump = TextSegment.from("Forrest Gump", new Metadata().put("genre", "drama").put("year", 1994)); TextSegment dieHard = TextSegment.from("Die Hard", new Metadata().put("genre", "action").put("year", 1998)); // describe metadata keys as if they were columns in the SQL table TableDefinition tableDefinition = TableDefinition.builder() .name("movies") .addColumn("genre", "VARCHAR", "one of [comedy, drama, action]") .addColumn("year", "INT") .build(); LanguageModelSqlFilterBuilder sqlFilterBuilder = new LanguageModelSqlFilterBuilder(model, tableDefinition); ContentRetriever contentRetriever = EmbeddingStoreContentRetriever.builder() .embeddingStore(embeddingStore) .embeddingModel(embeddingModel) .dynamicFilter(sqlFilterBuilder::build) .build(); String answer = assistant.answer("Recommend me a good drama from 90s"); // Forrest Gump ``` ## Which embedding store integrations will support `Filter`? In the long run, all (provided the embedding store itself supports it). In the first iteration, I aim to add support to just a few: - `InMemoryEmbeddingStore` - Elasticsearch - Milvus <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit ## Summary by CodeRabbit - New Features - Introduced filters for checking key's value existence in a collection for improved data handling. - Enhancements - Updated `InMemoryEmbeddingStoreTest` to extend a different class for improved testing coverage and added a new test method. - Refactor - Made minor formatting adjustments in the assertion block for better readability. - Documentation - Updated class hierarchy information for clarity. <!-- end of auto-generated comment: release notes by coderabbit.ai -->	2024-03-08 17:06:58 +01:00
Crutcher Dunnavant	3af1fe5ea2	Delombok before JavaDoc. (#595 )	2024-03-05 16:17:14 +01:00
LangChain4j	197b4af9d1	bumped version to 0.28.0-SNAPSHOT	2024-02-09 15:11:52 +01:00
LangChain4j	c1462c087f	release 0.27.1 (#621 )	2024-02-09 15:00:42 +01:00
LangChain4j	ad2fd90f32	bumped version to 0.28.0-SNAPSHOT	2024-02-09 08:12:28 +01:00

1 2 3

123 Commits