Fiats

A deep learning library for use in high-performance computing applications in modern Fortran

Generate Convert Improve

Install / Use

/learn @BerkeleyLab/Fiats

About this skill

Quality Score

0/100

README

___________.__        __          
\_   _____/|__|____ _/  |_  ______
 |    __)  |  \__  \\   __\/  ___/
 |     \   |  |/ __ \|  |  \___ \ 
 \___  /   |__(____  /__| /____  >
     \/            \/          \/

Fiats: Functional inference and training for surrogates

Alternatively, Fortran inference and training for science.

Overview | Getting Started | Documentation | Publications

Overview

Fiats supports research on the training and deployment of neural-network surrogate models for computational science. Fiats also provides a platform for exploring and advancing the native parallel programming features of Fortran 2023 in the context of deep learning. The design of Fiats centers around functional programming patterns that facilitate concurrency, including loop-level parallelism via the do concurrent construct and Single-Program, Multiple Data (SMPD) parallelism via "multi-image" (e.g., multithreaded or multiprocess) execution. Towards these ends,

Most Fiats procedures are pure and thus satisfy a language requirement for invocation inside do concurrent,
The network training procedure uses do concurrent to expose automatic parallelization opportunities to compilers, and
Exploiting multi-image execution to speedup training is under investigation.

To broaden support for the native parallel features, the Fiats contributors also write compiler tests, bug reports, and patches; develop a parallel runtime library ([Caffeine]); participate in the language standardization process; and provide example inference and training code for exercising and evaluating compilers' automatic parallelization capabilities on processors and accelerators, including Graphics Processing Units (GPUs).

Available optimizers:

Stochastic gradient descent and
Adam (recommended).

Supported network types:

Feed-forward networks and
Residual networks (for inference only).

Supported activation functions:

Sigmoid,
RELU,
GELU,
Swish, and
Step (for inference only).

Please submit a pull request or an issue to add or request other optimizers, network types, or activation functions.

Getting Started

Examples and demonstration applications

The [example] subdirectory contains demonstrations of several relatively simple use cases. We recommend reviewing the examples to see how to handle basic tasks such as configuring a network training run or reading a neural network and using it to perform inference.

The [demo] subdirectory contains demonstration applications that depend on Fiats but build separately due to requiring additional prerequisites such as [NetCDF] and [HDF5]. The demonstration applications

Train a cloud microphysics model surrogate for the Intermediate Complexity Atmospheric Research ([ICAR]) package,
Perform inference using a pretrained model for aerosol dynamics in the Energy Exascale Earth System ([E3SM]) package, and
Calculate ICAR cloud microphysics tensor component statistics that provide useful insights for training-data reduction.

Building and Testing

Because this repository supports programming language research, the code exercises new language features in novel ways. We recommend using any compiler's latest release or even building open-source compilers from source. The [handy-dandy] repository contains scripts capturing steps for building the [LLVM] compiler suite. The remainder of this section contains commands for building Fiats with a recent Fortran compiler and the Fortran Package Manager ([fpm]).

Dependencies

Fortran Package Manager (fpm)
- Julienne (automatically downloaded by fpm)
- Assert (automatically downloaded by fpm)
A supported Fortran compiler:
- LLVM (flang-new) version 19 or higher
- NAG (nagfor) version 7.2 Build 7235 or higher
- Intel (ifx) version 2025.2 or higher

Supported Compilers

LLVM (`flang-new`)

Testing Fiats with LLVM flang-new version 20 or later with the following command should report that all tests pass:

fpm test --compiler flang-new --flag "-cpp -O3"

With LLVM 19, add -mmlir -allow-assumed-rank to the --flag argument. With LLVM 21, it should be possible to automatically parallelize batch inferences on central processing units (CPUs). For example, run concurrent-inferences in parallel as follows:

fpm run \
  --example concurrent-inferences \
  --compiler flang-new \
  --flag "-cpp -O3 -fopenmp -fdo-concurrent-to-openmp=host" \
  -- --network model.json

where model.json must be a neural network in the [JSON] format used by Fiats and the companion [nexport] package. Rouson et al. (2025) demonstrated that this approach achieves performance on par with using OpenMP compiler directives. Work is under way to automatically parallelize training on CPUs and to offload inference and training to GPUs.

NAG (`nagfor`)

With nagfor 7.2 Build 7235 or later, the following command builds Fiats and reports that all tests pass:

fpm test --compiler nagfor --flag "-fpp -O3"

Partially Supported Compilers

Fiats release 0.14.0 and earlier support the use of the GNU and Intel Fortran compilers. We are corresponding with these compilers' developers about addressing the issues preventing building newer Fiats releases.

GNU (`gfortran`)

Compiler bugs related to parameterized derived types currently prevent gfortran from building Fiats versions 0.15.0 or later. Test and build earlier versions of Fiats build with the following command:

fpm test --compiler gfortran --profile release

Intel (`ifx`)

Testing with ifx versions higher than 2025.1 with the following command should report all tests passing except one:

fpm test --compiler ifx --flag "-fpp -coarray" --profile release

The failing test converts a neural network with varying-width hidden layers to and from JSON. The reason for this failure is under investigation. Please submit an issue if you would like to use ifx and require hidden layers of varying width.

Experimental: Automatic offloading of `do concurrent` to GPUs

This capability is under development with the goal to facilitate automatic GPU offloading via the following command:

fpm test --compiler ifx --profile release --flag "-fpp -fopenmp-target-do-concurrent -qopenmp -fopenmp-targets=spir64 -O3 -coarray"

Under Development

We are corresponding with the developers of the compiler(s) below about addressing the compiler issues preventing building Fiats.

HPE Cray Compiler Environment (CCE) (`crayftn.sh`)

Building with the CCE ftn compiler wrapper requires an additional trivial wrapper. For example, create a file crayftn.sh with the following contents and place this file's location in your PATH:

#!/bin/bash

ftn "$@"

Then execute

fpm test --compiler crayftn.sh

Configuring a training run

Fiats imports hyperparameters and network configurations to and from JSON files. To see the expected file format, run the [print-training-configuration] example as follows:

% fpm run --example print-training-configuration --compiler gfortran

which should produce output like the following:

Project is up to date
{
    "hyperparameters": {
        "mini-batches" : 10,
        "learning rate" : 1.5,
        "optimizer" : "adam"
    }
,
    "network configuration": {
        "skip connections" : false,
        "nodes per layer" : [2,72,2],
        "activation function" : "sigmoid"
    }
,
    "tensor names": {
        "inputs"  : ["pressure","temperature"],
        "outputs" : ["saturated mixing ratio"]
    },  
     "training data file names": {
         "path" : "fiats-training-data",
         "inputs prefix"  : "training_input-image-",
         "outputs prefix" : "training_output-image-",
         "infixes" : ["000001", "000002", "000003", "000004", "000005", "000006", "000007", "000008", "000009", "000010"]
     }   
}

The Fiats JSON file format is fragile: splitting or combining lines breaks the file reader. It should be ok, however, to add or removed white space or to reordered whole objects such as placing the "network configuration" object above the "hyperparameters" object. A future release will leverage the [rojff] JSON interface to allow for more flexible file formatting.

Training a neural network

Running the following command will train a neural network to learn the saturated mixing ratio function that is one component of the ICAR SB04 cloud microphysics model (see the [saturated_mixing_ratio_m] module for an implementation of the involved function):

 fpm run --example learn-saturated-mixing-ratio --compiler gfortran --profile release -- --output-file sat-mix-rat.json

The following is representative output after 3000 epochs:

 Initializing a new network
         Epoch | Cost Function| System_Clock | Nodes per Layer
         1000    0.79896E-04     4.8890      2,4,72,2,1
         2000    0.61259E-04     9.8345      2,4,72,2,1
         3000    0.45270E-04     14.864      2,4,72,2,1

The example program halts execution after reaching a cost-function threshold (which requires millions of epochs) or a maximum number of iterations or if the program detects a file named stop in the source-tree root directory. Before halting, the program will print a table of expected and predicted saturated mixing ratio values across a range of input pressures and temperatures, wherein two the inputs have each been mapped to the unit interval [0,1]. The program also writes the neural network initial condition to initial-network.json and the final (trained) network to the file specified in the above command: sat-mix-rat.json.

Performing inference

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

399

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

last30days-skill

18.8k

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

BerkeleyLab

View profile

View on GitHub

GitHub Stars71

CategoryEducation

Updated7d ago

Forks28

BerkeleyLab/fiats

Languages

Fortran

Security Score

85/100

Audited on Mar 30, 2026

No findings