Milvus
A PHP Client for the Milvus Vector Database Rest API
Install / Use
/learn @HelgeSverre/MilvusREADME
Milvus.io PHP API Client
Milvus is an open-source vector database that is highly flexible, reliable, and blazing fast. It supports adding, deleting, updating, and near real-time search of vectors on a trillion-byte scale.
This package is an API Client for the Milvus v2.3.3 Restful API, and is built on the Saloon package.
Documentation about the Restful API is available on the Milvus website, and an OpenAPI spec is available here.
Versions
| Milvus Version | PHP Client Version | |----------------|--------------------| | v2.3.x | v0.0.x | | v2.2.x | Not supported (*) |
(*) But is mostly compatible, the only difference (that I can see) between them is the new Vector Upsert endpoint, and
new parameters (params.range_filter and params.radius) in the Vector Search endpoint.
Installation
You can install the package via composer:
composer require helgesverre/milvus
You can publish the config file with:
php artisan vendor:publish --tag="milvus-config"
This is the contents of the published config/milvus.php file:
return [
'token' => env('MILVUS_TOKEN'),
'username' => env('MILVUS_USERNAME'),
'password' => env('MILVUS_PASSWORD'),
'host' => env('MILVUS_HOST', 'localhost'),
'port' => env('MILVUS_PORT', '19530'),
];
Usage
With Laravel
For Laravel users, you can use the Milvus facade to interact with the Milvus API:
use HelgeSverre\Milvus\Facades\Milvus;
// NOTE: dbName is optional and defaults to 'default', this is only relevant if you have multiple databases.
// List all collections in the 'default' database
Milvus::collections()->list(
dbName: 'default'
);
// Create a new collection named 'documents' in the 'default' database with a specified dimension
Milvus::collections()->create(
collectionName: 'documents',
dimension: 128,
dbName: 'default',
);
// Describe the structure and properties of the 'documents' collection in the 'default' database
Milvus::collections()->describe(
collectionName: 'documents',
dbName: 'default',
);
// Drop or delete the 'documents' collection from the 'default' database
Milvus::collections()->drop(
collectionName: 'documents',
dbName: 'default',
);
// Insert a new vector into the 'documents' collection with additional fields like title and link
// Note "vector" is a reserved field name and must be used for the vector data
Milvus::vector()->insert(
collectionName: 'documents',
data: [
'vector' => [0.1, 0.2, 0.3 /* etc... */],
"title" => "Document name here",
"link" => "https://example.com/document-name-here",
]
);
// Search for similar vectors in the 'documents' collection using a provided vector
Milvus::vector()->search(
collectionName: 'documents',
vector: [0.1, 0.2, 0.3 /* etc... */],
);
// Delete a vector from the 'documents' collection using its ID
Milvus::vector()->delete(
id: '123129471497',
collectionName: 'documents'
);
// Query the 'documents' collection for specific documents using a filter condition and select specific output fields
Milvus::vector()->query(
collectionName: 'documents',
filter: "id in [443300716234671427, 443300716234671426]",
outputFields: ["id", "title", "link"],
);
// Retrieve a specific vector from the 'documents' collection using its ID
Milvus::vector()->get(
id: '123129471497',
collectionName: 'documents'
);
// Update or insert a vector in the 'documents' collection. If the ID exists, it's updated; if not, a new entry is created
Milvus::vector()->upsert(
collectionName: 'documents',
data: [
'id' => 123129471497,
'vector' => [0.1, 0.2, 0.3 /* etc... */],
"title" => "Document name here",
"link" => "https://example.com/document-name-here",
]
);
Without Laravel
If you are not using laravel, you will have to create a new instance of the Milvus class and provide a token or user/pass, the host and the port.
<?php
// use HelgeSverre\Milvus\Facades\Milvus;
use HelgeSverre\Milvus\Milvus;
$milvus = new Milvus(
token: "your-token",
host: "localhost",
port: "19530"
);
// Import the Milvus facade for easier access to Milvus functions
// NOTE: dbName is optional and defaults to 'default', this is only relevant if you have multiple databases.
// List all collections in the 'default' database
$milvus->collections()->list(
dbName: 'default'
);
// Create a new collection named 'documents' in the 'default' database with a specified dimension
$milvus->collections()->create(
collectionName: 'documents',
dimension: 128,
dbName: 'default',
);
// Describe the structure and properties of the 'documents' collection in the 'default' database
$milvus->collections()->describe(
collectionName: 'documents',
dbName: 'default',
);
// Drop or delete the 'documents' collection from the 'default' database
$milvus->collections()->drop(
collectionName: 'documents',
dbName: 'default',
);
// Insert a new vector into the 'documents' collection with additional fields like title and link
// Note "vector" is a reserved field name and must be used for the vector data
$milvus->vector()->insert(
collectionName: 'documents',
data: [
'vector' => [0.1, 0.2, 0.3 /* etc... */],
"title" => "Document name here",
"link" => "https://example.com/document-name-here",
]
);
// Search for similar vectors in the 'documents' collection using a provided vector
$milvus->vector()->search(
collectionName: 'documents',
vector: [0.1, 0.2, 0.3 /* etc... */],
);
// Delete a vector from the 'documents' collection using its ID
$milvus->vector()->delete(
id: '123129471497',
collectionName: 'documents'
);
// Query the 'documents' collection for specific documents using a filter condition and select specific output fields
$milvus->vector()->query(
collectionName: 'documents',
filter: "id in [443300716234671427, 443300716234671426]",
outputFields: ["id", "title", "link"],
);
// Retrieve a specific vector from the 'documents' collection using its ID
$milvus->vector()->get(
id: '123129471497',
collectionName: 'documents'
);
// Update or insert a vector in the 'documents' collection. If the ID exists, it's updated; if not, a new entry is created
$milvus->vector()->upsert(
collectionName: 'documents',
data: [
'id' => 123129471497,
'vector' => [0.1, 0.2, 0.3 /* etc... */],
"title" => "Document name here",
"link" => "https://example.com/document-name-here",
]
);
Using with Zilliz Cloud
If you are using the hosted version of Milvus, you will need to specify the following host and port along with your API token:
use HelgeSverre\Milvus\Milvus;
$milvus = new Milvus(
token: "db_randomstringhere:passwordhere",
host: 'https://in03-somerandomstring.api.gcp-us-west1.zillizcloud.com',
port: '443'
);
Example: Semantic Search with Milvus and OpenAI Embeddings
This example demonstrates how to perform a semantic search in Milvus using embeddings generated from OpenAI.
Prepare Your Data
First, create an array of data you wish to index. In this example, we'll use blog posts with titles, summaries, and tags.
$blogPosts = [
[
'title' => 'Exploring Laravel',
'summary' => 'A deep dive into Laravel frameworks...',
'tags' => ['PHP', 'Laravel', 'Web Development']
],
[
'title' => 'Exploring Laravel',
'summary' => 'A deep dive into Laravel frameworks, exploring its features and benefits for modern web development.',
'tags' => ['PHP', 'Laravel', 'Web Development']
],
[
'title' => 'Introduction to React',
'summary' => 'Understanding the basics of React and how it revolutionizes frontend development.',
'tags' => ['JavaScript', 'React', 'Frontend']
],
[
'title' => 'Getting Started with Vue.js',
'summary' => 'A beginner’s guide to building interactive web interfaces with Vue.js.',
'tags' => ['JavaScript', 'Vue.js', 'Frontend']
],
];
Generate Embeddings
Use OpenAI's embeddings API to convert the summaries of your blog posts into vector embeddings.
$summaries = array_column($blogPosts, 'summary');
$embeddingsResponse = OpenAI::client('sk-your-openai-api-key')
->embeddings()
->create([
'model' => 'text-embedding-ada-002',
'input' => $summaries,
]);
foreach ($embeddingsResponse->embeddings as $embedding) {
$blogPosts[$embedding->index]['vector'] = $embedding->embedding;
}
Create Milvus collection
Create a collection in Milvus to store your blog post embeddings, note that the dimension of the embeddings must match
the dimension of the embeddings generated by OpenAI (1536 if you are using the text-embedding-ada-002 model).
$milvus = new Milvus(
token: "your-token",
host: "localhost",
port: "19530"
);
$milvus->collections()->create(
collectionName: 'blog_posts',
dimension: 1536,
);
Insert into Milvus
Insert these embeddings, along with other blog post data, into your Milvus collection.
$insertResponse = $milvus->vector()->insert('blog_posts', $blogPosts);
Creating a Search Vector with OpenAI
Gen
