Marv
A programmatic database migration tool with plugable drivers for Postgres and MySQL
Install / Use
/learn @guidesmiths/MarvREADME
Marv
Marv is a programmatic database migration tool with plugable drivers for MySQL, PostgreSQL, SQLite, Microsoft SQL Server and Oracle DB.
TL;DR
Create a directory of migrations
migrations/
|- 001.create-table.sql
|- 002.create-another-table.sql
Usage
Promises
const marv = require('marv/api/promise'); // <-- Promise API
const driver = require('marv-pg-driver');
const directory = path.resolve('migrations');
const connection = {
// Properties are passed straight pg.Client
host: 'postgres.example.com',
};
const migrations = await marv.scan(directory);
await marv.migrate(migrations, driver({ connection }));
// Profit :)
Callbacks
const marv = require('marv/api/callback'); // <-- Callback API
const driver = require('marv-pg-driver');
const directory = path.resolve('migrations');
const connection = {
// Properties are passed straight pg.Client
host: 'postgres.example.com',
};
marv.scan(directory, (err, migrations) => {
if (err) throw err;
marv.migrate(migrations, driver({ connection }), (err) => {
if (err) throw err;
// Profit :)
});
});
Migration Files
Migration files are just SQL scripts. Filenames must be in the form <level><separator><comment>.<extension> where:
- level must be numeric and greater or equal to 0
- separator can be any non numeric
- comment can contain any characters except '.'
- extension is any file extension. See here for how to filter migration files.
Marv runs migrations in order. If you have two migration files in the same namespace with the same level it will report an error. Gaps in the sequence are tolerated, but marv will report an error if it detects that a migration has been run out of sequence. This has implications for your branching strategy. For example, if you work on two isolated feature branches that both require a database migrations, you should should start both sets of migrations from the current level, then resolve the ordering when merging back to trunk.
Drivers
The following drivers exist for marv.
- marv-pg-driver
- marv-mysql-driver
- marv-better-sqlite3-driver
- marv-mssql-driver
- marv-oracledb-driver
- marv-foxpro-driver
If you want to add a new driver please use the compliance tests and include at least one end-to-end test. See marv-pg-driver for an example.
Configuring Drivers
You can configure a driver by passing it options, e.g.
const options = {
// defaults to 'migrations'
table: 'db_migrations',
// The connection sub document is passed directly to the underlying database library,
// in this case pg.Client
connection: {
host: 'localhost',
port: 5432,
database: 'postgres',
user: 'postgres',
password: '',
},
};
const migrations = await marv.scan(directory);
await marv.migrate(migrations, driver(options));
What Makes Marv Special
Before writing Marv we evaluated existing tools against the following criteria:
- Cluster safe
- Works with raw SQL
- Programmatic API so we can invoke it on application startup
- Supports multiple databases including PostgreSQL, MySQL, SQlite, MSSQL and Oracle via optional plugins
- Can be run repeatedly from integration tests
- Reports errors via events, callbacks or promise rejections rather than throwing or logging
- Follows the rule of silence
- Reasonable code hygiene
- Reasonably well tested
Candidates were:
Disappointingly they all fell short. Marv does all these things in less than 150 lines, with around another 150 lines for a driver.
What Marv Doesn't Do
One of the reasons Marv is has a small and simple code base is because it doesn't come with a lot of unnecessary bells and whistles. It doesn't support
- Rollbacks (we make our db changes backwards compatible so we can deploy without downtime).
- A DSL (high maintenance and restrictive)
- Conditional migrations
- A command line interface (we may implement this in future)
- Checksum validation (we may implement this in future)
Important Notes About Transactions
Marv is unlike some other migration libraries in that it deliberately doesn't run your scripts in a transaction. This is because some SQL statements cannot be run in a transaction, and others(e.g. locking in Postgres) will automatically commit the current transaction if one exists. Unfortunately this means that in rare situations, scripts may be only partially applied, e.g.
CREATE TABLE customer (
id BIGSERIAL PRIMARY KEY,
name TEXT
);
CREATE INDEX customer_name ON customer (
name
);
If something goes wrong (e.g. a network outage) after CREATE TABLE but before CREATE INDEX, the table would be created without the index. Because scripts are audited on successful completion, the script will be included in the next migration run, but now the CREATE TABLE step will fail because the table already exists. One way to work around this is by explicitly specifying a transactions...
BEGIN TRANSACTION;
CREATE TABLE customer (
id BIGSERIAL PRIMARY KEY,
name TEXT
);
CREATE INDEX customer_name ON customer (
name
);
END TRANSACTION;
However there's still a gotcha. Now the script will either be applied or not, but consider what will happen if the network outage occurs after the script has been applied, but before Marv inserts the audit record? Because the script hasn't been audited, Marv won't know that it completed successfully and will still include it in the next migration run. Once again it will fail on the CREATE TABLE step. A better workaround is to make your script idempotent, e.g.
CREATE TABLE IF NOT EXISTS customer (
id BIGSERIAL PRIMARY KEY,
name TEXT
);
CREATE INDEX IF NOT EXISTS customer_name ON customer (
name
);
Unfortunately not all statements and SQL dialects have an equivalent of IF NOT EXISTS. If you're especially unlucky and something goes wrong while applying a non-atomic / non-idempotent script you will have some manual clean up to do. This may involve applying the missing steps and inserting the audit record manually. The exact syntax will vary from driver to driver but should be similar to...
$ cat migrations/002.create-customer-table.sql | md5
82b392f3594050ecefd768bfe258843b
INSERT INTO migrations (level, comment, "timestamp", checksum) VALUES (2, 'create customer table', now(), '82b392f3594050ecefd768bfe258843b');
Advanced Usage
Filtering Migration Files
If you would like to exclude files from your migrations directory you can specify a filter
migrations/
|- 001.create-table.sql
|- 002.create-another-table.sql
const migrations = await marv.scan(directory, { filter: /\.sql$/ });
Namespacing
All migration scripts are namespaced. If namespace is not provided explicitly they're assigned to the 'default' namespace. Namespaces can be used to isolate migrations when multiple applications maintain (a subset of) tables in same database.
Namespace can be passed as an option to the scan method, and all migrations returned from by will be assigned to that namespace. Alternatively the namespace can be set in a .marvrc file, in which case all the migrations in that folder will be assigned to it.
.marvrc
You can configure marv by placing a .marvrc file in your migrations folder
migrations/
|- .marvrc
|- 001.create-table.sql
|- 002.create-another-table.sql
{
"filter": "\\.sql$",
"directives": {
"audit": "false"
},
"namespace": "blogs"
}
const migrations = await marv.scan(directory, { namespace: 'custom' });
Directives
Directives allow you to customise the behaviour of migrations. You can specify directives in three ways...
-
Programatically via marv.scan
const migrations = await marv.scan(directory, { filter: /\.sql$/, directives: { audit: false } }); -
Via .marvrc
{ "filter": "\\.sql$", "directives": { "audit": "false" } } -
Using a specially formed comment in a migration f
Related Skills
feishu-drive
352.5k|
things-mac
352.5kManage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database)
clawhub
352.5kUse the ClawHub CLI to search, install, update, and publish agent skills from clawhub.com
codebase-memory-mcp
1.3kHigh-performance code intelligence MCP server. Indexes codebases into a persistent knowledge graph — average repo in milliseconds. 66 languages, sub-ms queries, 99% fewer tokens. Single static binary, zero dependencies.
