Quickstart#

This notebook demonstrates how to use AFL-agent to analyze measurement data and identify different phases. We’ll create a simple pipeline that:

Calculates derivatives of measurement data using Savitzky-Golay filtering
Computes similarity between measurements
Uses spectral clustering to group similar measurements into phases

We’ll work with synthetic data that simulates two different types of signals - a flat background and a power law decay, both with added noise.

Google Colab Setup#

Only uncomment and run the next cell if you are running this notebook in Google Colab or if don’t already have the AFL-agent package installed.

[2]:

# !pip install git+https://github.com/usnistgov/AFL-agent.git

Define Pipeline#

[1]:

from AFL.double_agent import *

with Pipeline() as clustering_pipeline:

        SavgolFilter(
            input_variable='measurement',
            output_variable='derivative',
            dim='x',
            derivative=1
            )

        Similarity(
            input_variable='derivative',
            output_variable='similarity',
            sample_dim='sample',
            params={'metric': 'laplacian','gamma':1e-4}
            )

        SpectralClustering(
            input_variable='similarity',
            output_variable='labels',
            dim='sample',
            params={'n_phases': 2}
            )

The pipeline above consists of three operations:

SavgolFilter: Applies Savitzky-Golay filtering to calculate derivatives of the measurement data along the x-dimension. This helps identify changes in the signal shape.
Similarity: Computes pairwise similarity between measurements using their derivatives. It uses a Laplacian kernel with gamma=1e-4 to quantify how similar each measurement is to every other measurement.
SpectralClustering: Groups measurements into 2 phases based on their similarity scores. Measurements with high similarity will be grouped into the same phase.

Conclusion#

In this quickstart tutorial, we demonstrated how to use the clustering pipeline to automatically classify different phases in a dataset. We:

Started with a dataset containing multiple measurements with two distinct patterns
Applied the clustering pipeline to analyze and classify the data
Successfully separated the measurements into two distinct phases

The pipeline was able to automatically detect and group similar measurements, making it a powerful tool for analyzing phase transitions and other classification tasks in scientific data.

Quickstart#

Google Colab Setup#

Define Pipeline#

Load Input Data#

Execute the Pipeline#

Conclusion#

This Page