Contains Retrieval Augmented Generation related utilities for Azure Machine Learning and OSS interoperability.

These details have not been verified by PyPI

Project links

Homepage

Project description

AzureML Retrieval Augmented Generation Utilities

This package is in alpha stage at the moment, use at risk of breaking changes and unstable behavior.

It contains utilities for:

Processing text documents into chunks appropriate for use in LLM prompts, with metadata such is source url.
Embedding chunks with OpenAI or HuggingFace embeddings models, including the ability to update a set of embeddings over time.
Create MLIndex artifacts from embeddings, a yaml file capturing metadata needed to deserialize different kinds of Vector Indexes for use in langchain. Supported Index types:
- FAISS index (via langchain)
- Azure Cognitive Search index

Getting started

You can install AzureMLs RAG package using pip.

pip install azureml-rag

There are various extra installs you probably want to include based on intended use:

faiss: When using FAISS based Vector Indexes
cognitive_search: When using Azure Cognitive Search Indexes
hugging_face: When using Sentence Transformer embedding models from HuggingFace (local inference)
document_parsing: When cracking and chunking documents locally to put in an Index

MLIndex

MLIndex files describe an index of data + embeddings and the embeddings model used in yaml.

embeddings:
  dimension: 768
  kind: hugging_face
  model: sentence-transformers/all-mpnet-base-v2
  schema_version: '2'
index:
  api_version: 2021-04-30-Preview
  connection:
    id: /subscriptions/<subscription_id>/resourceGroups/<resource_group>/providers/Microsoft.MachineLearningServices/workspaces/<workspace>/connections/<acs_connection_name>
  connection_type: workspace_connection
  endpoint: https://<acs_name>.search.windows.net
  engine: azure-sdk
  field_mapping:
    content: content
    filename: sourcefile
    metadata: meta_json_string
    title: title
    url: sourcepage
    embedding: content_vector_hugging_face
  index: azureml-rag-test-206e03b6-3880-407b-9bc4-c0a1162d6c70
  kind: acs

Create MLIndex

Examples using MLIndex remotely with AzureML and locally with langchain live here: https://github.com/Azure/azureml-examples/tree/main/sdk/python/generative-ai/rag

Consume MLIndex

from azureml.rag.mlindex import MLIndex

retriever = MLIndex(uri_to_folder_with_mlindex).as_langchain_retriever()
retriever.get_relevant_documents('What is an AzureML Compute Instance?')

Changelog

0.1.18

Add FaissAndDocStore and FileBasedDocStore which closely mirror langchains' FAISS and InMemoryDocStore without the langchain or pickle dependency. These are default not used until PromptFlow support has been added.
Pin azure-documents-search==11.4.0b6 as there's breaking changes in 11.4.0b7 and 11.4.0b8

0.1.17

Update interactions with Azure Cognitive Search to use latest azure-documents-search SDK

0.1.16

Convert api_type from Workspace Connections to lower case to appease langchains case sensitive checking.

0.1.15

Add support for custom loaders
Added logging for MLIndex.init to understand usage of MLindex

0.1.14

Add Support for CustomKeys connections
Add OpenAI support for QA Gen and Embeddings

0.1.13 (2023-07-12)

Implement single node non-PRS embed task to enable clearer logs for users.

0.1.12 (2023-06-29)

Fix casing check of ApiVersion, ApiType in infer_deployment util

0.1.11 (2023-06-28)

Update casing check for workspace connection ApiVersion, ApiType
int casting for temperature, max_tokens

0.1.10 (2023-06-26)

Update data asset registering to have adjustable output_type
Remove asset registering from generate_qa.py

0.1.9 (2023-06-22)

Add azureml.rag.data_generation module.
Fixed bug that would cause crack_and_chunk to fail for documents that contain non-utf-8 characters. Currently these characters will be ignored.
Improved heading extraction from Markdown files. When use_rcts=False Markdown files will be split on headings and each chunk with have the heading context up to the root as a prefix (e.g. # Heading 1\n## Heading 2\n# Heading 3\n{content})

0.1.8 (2023-06-21)

Add deployment inferring util for use in azureml-insider notebooks.

0.1.7 (2023-06-08)

Improved telemetry for tasks (used in RAG Pipeline Components)

0.1.6 (2023-05-31)

Fail crack_and_chunk task when no files were processed (usually because of a malformed input_glob)
Change update_acs.py to default push_embeddings=True instead of False.

0.1.5 (2023-05-19)

Add api_base back to MLIndex embeddings config for back-compat (until all clients start getting it from Workspace Connection).
Add telemetry for tasks used in pipeline components, not enabled by default for SDK usage.

0.1.4 (2023-05-17)

Fix bug where enabling rcts option on split_documents used nltk splitter instead.

0.1.3 (2023-05-12)

Support Workspace Connection based auth for Git, Azure OpenAI and Azure Cognitive Search usage.

0.1.2 (2023-05-05)

Refactored document chunking to allow insertion of custom processing logic

0.0.1 (2023-04-25)

Features Added

Introduced package
langchain Retriever for Azure Cognitive Search

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.2.36

Aug 14, 2024

0.2.35

Jul 25, 2024

0.2.34

Jun 18, 2024

0.2.33

May 31, 2024

0.2.32

May 16, 2024

0.2.31.1

May 13, 2024

0.2.31

May 7, 2024

0.2.30.2

May 1, 2024

0.2.30.1

Apr 30, 2024

0.2.30

Apr 24, 2024

0.2.29.2

Apr 16, 2024

0.2.29.1

Apr 12, 2024

0.2.29

Apr 11, 2024

0.2.28

Apr 8, 2024

0.2.27

Apr 1, 2024

0.2.26

Mar 6, 2024

0.2.25

Feb 9, 2024

0.2.24.2

Feb 1, 2024

0.2.24.1

Jan 11, 2024

0.2.24

Jan 9, 2024

0.2.23.5

Dec 30, 2023

0.2.23.4

Dec 29, 2023

0.2.23.3

Dec 14, 2023

0.2.23.2

Dec 13, 2023

0.2.23.1

Dec 7, 2023

0.2.23

Dec 6, 2023

0.2.22

Nov 22, 2023

0.2.21

Nov 21, 2023

0.2.20

Nov 17, 2023

0.2.18.1

Nov 9, 2023

0.2.18

Nov 8, 2023

0.2.17

Oct 31, 2023

0.2.15.1

Oct 25, 2023

0.2.15

Oct 24, 2023

0.2.14

Oct 19, 2023

0.2.13

Oct 18, 2023

0.2.12

Oct 17, 2023

0.2.11

Oct 17, 2023

0.2.10

Oct 10, 2023

0.2.9

Oct 3, 2023

0.2.8

Oct 2, 2023

0.2.7

Sep 29, 2023

0.2.6

Sep 28, 2023

0.2.5

Sep 27, 2023

0.2.4

Sep 25, 2023

0.2.3

Sep 22, 2023

0.2.2

Sep 13, 2023

0.2.1

Sep 7, 2023

0.1.24.2

Sep 14, 2023

0.1.24.1

Aug 31, 2023

0.1.24

Aug 31, 2023

0.1.23.2

Aug 30, 2023

0.1.23.1

Aug 30, 2023

0.1.23

Aug 28, 2023

0.1.22

Aug 26, 2023

0.1.21

Aug 25, 2023

This version

0.1.20

Aug 25, 2023

0.1.19

Aug 23, 2023

0.1.18

Aug 23, 2023

0.1.17

Aug 9, 2023

0.1.16

Aug 3, 2023

0.1.15

Jul 28, 2023

0.1.14

Jul 18, 2023

0.1.13

Jul 12, 2023

0.1.12

Jun 30, 2023

0.1.11

Jun 28, 2023

0.1.10

Jun 26, 2023

0.1.9

Jun 23, 2023

0.1.8

Jun 22, 2023

0.1.7

Jun 9, 2023

0.1.6

Jun 1, 2023

0.1.5

May 22, 2023

0.1.4

May 18, 2023

0.1.3

May 16, 2023

0.1.2

May 5, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

azureml_rag-0.1.20-py3-none-any.whl (198.7 kB view details)

Uploaded Aug 25, 2023 Python 3

File details

Details for the file azureml_rag-0.1.20-py3-none-any.whl.

File metadata

Download URL: azureml_rag-0.1.20-py3-none-any.whl
Upload date: Aug 25, 2023
Size: 198.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.9.6 requests/2.31.0 setuptools/50.3.2 requests-toolbelt/1.0.0 tqdm/4.66.1 CPython/3.8.13

File hashes

Hashes for azureml_rag-0.1.20-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bc9b947061084ef29652a7ad0ffb43b3d1800e91967134f1e2de52c326f4718b`
MD5	`4e168ef51f527c92d35245d158227780`
BLAKE2b-256	`b18fd34efde2d91a5ac37d7be918f86290d6e114af3036aa445d22b29e6f939e`