Elastically
🔍 JoliCode's Elastica wrapper to bootstrap Elasticsearch PHP integrations
Install / Use
/learn @jolicode/ElasticallyREADME
Opinionated Elastica based framework to bootstrap PHP and Elasticsearch implementations.
Main features:
- <abbr title="Data Transfer Object">DTO</abbr> are first-class citizen, you send PHP object as documents, and get objects back on search results, like an ODM;
- All indexes are versioned and aliased automatically;
- Mappings are done via YAML files, PHP or custom via
MappingProviderInterface; - Analysis is separated from mappings to ease reuse;
- 100% compatibility with ruflin/elastica;
- Mapping migration capabilities with ReIndex;
- Symfony HttpClient compatible transport (optional);
- Symfony JsonStreamer support for faster serialization (optional);
- Tested with Elasticsearch 7, 8 and 9;
- Symfony support (optional):
- See dedicated chapter;
- Tested with Symfony 5.4 to 7;
- Symfony Messenger Handler support (with or without spool);
[!IMPORTANT] Require PHP 8.0+ and Elasticsearch >= 8
Works with Elasticsearch 7 as well but is not officially supported by Elastica 8. Use with caution.
Version 2+ does not work with OpenSearch anymore due to restrictions added by Elastic on their client.
You can check the changelog and the upgrade documents.
Installation
composer require jolicode/elastically
Demo
[!TIP] If you are using Symfony, you can move to the Symfony chapter
Quick example of what the library do on top of Elastica:
// Your own DTO, or one generated by Jane (see below)
class Beer
{
public string $foo;
public string $bar;
}
use JoliCode\Elastically\Factory;
use JoliCode\Elastically\Model\Document;
// Factory object with Elastica options + new Elastically options in the same array
$factory = new Factory([
// Where to find the mappings
Factory::CONFIG_MAPPINGS_DIRECTORY => __DIR__.'/mappings',
// What objects to find in each index
Factory::CONFIG_INDEX_CLASS_MAPPING => [
'beers' => Beer::class,
],
]);
// Class to perform request, same as the Elastica Client
$client = $factory->buildClient();
// Class to build Indexes
$indexBuilder = $factory->buildIndexBuilder();
// Create the Index in Elasticsearch
$index = $indexBuilder->createIndex('beers');
// Set the proper aliases
$indexBuilder->markAsLive($index, 'beers');
// Class to index DTO(s) in an Index
$indexer = $factory->buildIndexer();
$dto = new Beer();
$dto->bar = 'American Pale Ale';
$dto->foo = 'Hops from Alsace, France';
// Add a document to the queue
$indexer->scheduleIndex('beers', new Document('123', $dto));
$indexer->flush();
// Set parameters on the Bulk
$indexer->setBulkRequestParams([
'pipeline' => 'covfefe',
'refresh' => 'wait_for'
]);
// Force index refresh if needed
$indexer->refresh('beers');
// Get the Document (new!)
$results = $client->getIndex('beers')->getDocument('123');
// Get the DTO (new!)
$results = $client->getIndex('beers')->getModel('123');
// Perform a search
$results = $client->getIndex('beers')->search('alsace');
// Get the Elastic Document
$results->getDocuments()[0];
// Get the Elastica compatible Result
$results->getResults()[0];
// Get the DTO 🎉 (new!)
$results->getResults()[0]->getModel();
// Create a new version of the Index "beers"
$index = $indexBuilder->createIndex('beers');
// Slow down the Refresh Interval of the new Index to speed up indexation
$indexBuilder->slowDownRefresh($index);
$indexBuilder->speedUpRefresh($index);
// Set proper aliases
$indexBuilder->markAsLive($index, 'beers');
// Clean the old indices (close the previous one and delete the older)
$indexBuilder->purgeOldIndices('beers');
// Mapping change? Just call migrate and enjoy a full reindex (use the Task API internally to avoid timeout)
$newIndex = $indexBuilder->migrate($index);
$indexBuilder->speedUpRefresh($newIndex);
$indexBuilder->markAsLive($newIndex, 'beers');
[!NOTE]
scheduleIndexis here called with"beers"index because the index was already created before. If you are creating a new index and want to index documents into it, you should pass theIndexobject directly.
mappings/beers_mapping.yaml
# Anything you want, no validation
settings:
number_of_replicas: 1
number_of_shards: 1
refresh_interval: 60s
mappings:
dynamic: false
properties:
foo:
type: text
analyzer: english
fields:
keyword:
type: keyword
Configuration
This library add custom configurations on top of Elastica's:
Factory::CONFIG_MAPPINGS_DIRECTORY (required with default configuration)
The directory Elastically is going to look for YAML.
When creating a foobar index, a foobar_mapping.yaml file is expected.
If an analyzers.yaml file is present, all the indices will get it.
Factory::CONFIG_INDEX_CLASS_MAPPING (required)
An array of index name to class FQN.
[
'indexName' => My\AwesomeDTO::class,
]
Factory::CONFIG_MAPPINGS_PROVIDER
An instance of MappingProviderInterface.
If this option is not defined, the factory will fall back to YamlProvider and will use
Factory::CONFIG_MAPPINGS_DIRECTORY option.
There are two providers available in Elastically: YamlProvider and PhpProvider.
Factory::CONFIG_SERIALIZER (optional)
A SerializerInterface compatible object that will be used on indexation.
Default to Symfony Serializer with Object Normalizer.
A faster alternative is to use Jane to generate plain PHP Normalizer, see below. Also, we recommend customization to handle things like Date.
Factory::CONFIG_DENORMALIZER (optional)
A DenormalizerInterface compatible object that will be used on search results to build your objects back.
If this option is not defined, the factory will fall back to
Factory::CONFIG_SERIALIZER option.
Factory::CONFIG_SERIALIZER_CONTEXT_BUILDER (optional)
An instance of ContextBuilderInterface that build a serializer context from a
class name.
If it is not defined, Elastically, will use a StaticContextBuilder with the
configuration from Factory::CONFIG_SERIALIZER_CONTEXT_PER_CLASS.
Factory::CONFIG_SERIALIZER_CONTEXT_PER_CLASS (optional)
Allow to specify the Serializer context for normalization and denormalization.
[
Beer::class => ['attributes' => ['title']],
];
Default to [].
Factory::CONFIG_BULK_SIZE (optional)
When running indexation of lots of documents, this setting allow you to fine-tune the number of document threshold.
Default to 100.
Using JsonStreamer for faster serialization (optional)
Elastically supports Symfony JsonStreamer for faster serialization during indexation. JsonStreamer can significantly speed up the serialization process by streaming JSON directly without building intermediate data structures.
To use JsonStreamer:
- Install the package:
composer require symfony/json-streamer
- Add the
#[JsonStreamable]attribute to your DTO classes:
use Symfony\Component\JsonStreamer\Attribute\JsonStreamable;
#[JsonStreamable]
class Beer
{
public string $foo;
public string $bar;
}
That's it! Elastically will automatically detect the JsonStreamer package and use it for any DTO that has the #[JsonStreamable] attribute. DTOs without the attribute will continue to use the standard Symfony Serializer.
[!NOTE] JsonStreamer is only used for serialization during indexation. Deserialization of search results still uses the standard Symfony Serializer denormalizer.
Factory::CONFIG_INDEX_PREFIX (optional)
Add a prefix to all indexes and aliases created via Elastically.
Default to null.
Usage in Symfony
Configuration
You'll need to add the bundle in bundles.php:
// config/bundles.php
return [
// ...
JoliCode\Elastically\Bridge\Symfony\ElasticallyBundle::class => ['all' => true],
];
Then configure the bundle:
# config/packages/elastically.yaml
elastically:
connections:
# You can create multiple clients
default:
# Any Elastica option works here
client:
hosts:
- '127.0.0.1:9200'
# Use HttpClient component
transport_config:
http_client: 'Psr\Http\Client\ClientInterface'
# Path to the mapping directory (in YAML)
mapping_directory: '%kernel.project_dir%/config/elasticsearch'
# Size of the bulk sent to Elasticsearch (default to 100)
bulk_size: 100
# Mapping between an index name and a FQCN
index_class_mapping:
my-foobar-index: App\Dto\Foobar
# Configuration for the serializer
serializer:
# Fill a static context
context_mapping:
foo: bar
# If you want to add a prefix for your index in elasticsearch (you can still call it by its base name everywhere!)
# prefix: '%kernel.
