Basic Walk-Through (Python API)¶

This notebook fits a bespoke force field via the Python API. It mirrors the CLI walk-through but builds WorkflowSettings programmatically. Use the Python API when you want to:

Sweep settings or batch-fit many molecules.
Inject runtime objects (e.g. an ASE calculator) that can't round-trip through YAML.
Integrate presto into a larger pipeline.

For the CLI version of the same fit, see basic-walk-through-cli.ipynb.

1. Build a `WorkflowSettings`¶

In [ ]:

Copied!





from presto.settings import ParamSettings, WorkflowSettings
from presto.workflow import get_bespoke_force_field

settings = WorkflowSettings(
    param_settings=ParamSettings(
        molecule_input_type="smiles",
        molecules="CCO",
    ),
    device_type="cuda",
    n_iterations=2,
)
settings
from presto.settings import ParamSettings, WorkflowSettings
from presto.workflow import get_bespoke_force_field

settings = WorkflowSettings(
    param_settings=ParamSettings(
        molecule_input_type="smiles",
        molecules="CCO",
    ),
    device_type="cuda",
    n_iterations=2,
)
settings

Validation runs at construction time — invalid configurations raise InvalidSettingsError immediately. See the API reference for the full settings documentation.

2. Round-trip via YAML¶

WorkflowSettings.to_yaml writes a settings file you can later replay with presto train-from-yaml.

In [2]:

Copied!





settings.to_yaml("workflow_settings.yaml")

# Reload, with a one-off override
from presto.settings import WorkflowSettings

reloaded = WorkflowSettings.from_yaml(
    "workflow_settings.yaml",
    overwrite={"training_settings": {"n_epochs": 200}},
)
reloaded.training_settings.n_epochs
settings.to_yaml("workflow_settings.yaml")

# Reload, with a one-off override
from presto.settings import WorkflowSettings

reloaded = WorkflowSettings.from_yaml(
    "workflow_settings.yaml",
    overwrite={"training_settings": {"n_epochs": 200}},
)
reloaded.training_settings.n_epochs

Out[2]:

3. Run the fit¶

get_bespoke_force_field returns the final fitted openff.toolkit.ForceField and writes the same outputs presto train would (see Output directory layout).

In [ ]:

Copied!

bespoke_ff = get_bespoke_force_field(settings)
bespoke_ff
bespoke_ff = get_bespoke_force_field(settings)
bespoke_ff

4. Use the bespoke force field¶

The returned object is a standard SMIRNOFF ForceField. Use it like any other OpenFF force field:

In [4]:

Copied!

from openff.toolkit import Molecule

mol = Molecule.from_smiles("CCO")
system = bespoke_ff.create_interchange(mol.to_topology()).to_openmm()
system
from openff.toolkit import Molecule

mol = Molecule.from_smiles("CCO")
system = bespoke_ff.create_interchange(mol.to_topology()).to_openmm()
system

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 0

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 1

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 2

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 3

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 4

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 5

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 6

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 7

INFO:openff.interchange.smirnoff._nonbonded:Charge section NAGLCharges, using NAGL model 
openff-gnn-am1bcc-1.0.0.pt, applied to topology atom index 8

Out[4]:

<openmm.openmm.System; proxy of <Swig Object of type 'OpenMM::System *' at 0x77c9efe56d30> >

5. Inspect outputs¶

The plots and diagnostic files live under <output_dir>/plots/. For what each plot means, see Inspect outputs and plots.

In [5]:

Copied!

from IPython.display import Image

Image("plots/loss.png")
from IPython.display import Image

Image("plots/loss.png")

Out[5]:

No description has been provided for this image

6. Bring your own MLP via ASE¶

If your reference MLP isn't directly supported by OpenMM-ML, wrap it in an ASE calculator and pass it through MLPSettings(ml_potential="ase", ml_system_kwargs={"calculator": calc}). The example below uses ASE's built-in EMT calculator for portability — replace it with your real MLP. See Use an ASE calculator for the full pattern.

In [ ]:

Copied!





from ase.calculators.emt import EMT

from presto.settings import (
    MLMDSamplingSettings,
    MLPSettings,
    MSMSettings,
    ParamSettings,
    WorkflowSettings,
)

ase_calculator = EMT()  # replace with your real MLP

# The MLP is used in three places — MSM, training sampling, and testing
# sampling — so all three must be updated.
ase_mlp = MLPSettings(
    ml_potential="ase",
    ml_system_kwargs={
        "calculator": ase_calculator,
        # Charge must be passed explicitly for ASE-backed MLPs.
        "info": {"charge": 0},
    },
)

ase_settings = WorkflowSettings(
    param_settings=ParamSettings(
        molecule_input_type="smiles",
        molecules="CCO",
        msm_settings=MSMSettings(mlp_settings=ase_mlp),
    ),
    device_type="cuda",
    training_sampling_settings=MLMDSamplingSettings(mlp_settings=ase_mlp),
    testing_sampling_settings=MLMDSamplingSettings(mlp_settings=ase_mlp),
)
from ase.calculators.emt import EMT

from presto.settings import (
    MLMDSamplingSettings,
    MLPSettings,
    MSMSettings,
    ParamSettings,
    WorkflowSettings,
)

ase_calculator = EMT()  # replace with your real MLP

# The MLP is used in three places — MSM, training sampling, and testing
# sampling — so all three must be updated.
ase_mlp = MLPSettings(
    ml_potential="ase",
    ml_system_kwargs={
        "calculator": ase_calculator,
        # Charge must be passed explicitly for ASE-backed MLPs.
        "info": {"charge": 0},
    },
)

ase_settings = WorkflowSettings(
    param_settings=ParamSettings(
        molecule_input_type="smiles",
        molecules="CCO",
        msm_settings=MSMSettings(mlp_settings=ase_mlp),
    ),
    device_type="cuda",
    training_sampling_settings=MLMDSamplingSettings(mlp_settings=ase_mlp),
    testing_sampling_settings=MLMDSamplingSettings(mlp_settings=ase_mlp),
)

7. Reload an ASE settings YAML¶

Runtime objects (the ASE calculator) cannot round-trip through YAML, so to_yaml replaces them with a placeholder. Reload with from_yaml(..., overwrite=...) to inject the live calculator before validation:

In [ ]:

Copied!





ase_settings.to_yaml("ase_workflow_settings.yaml")

ase_kwargs = {"ml_system_kwargs": {"calculator": ase_calculator}}

loaded = WorkflowSettings.from_yaml(
    "ase_workflow_settings.yaml",
    overwrite={
        "param_settings": {
            "msm_settings": {"mlp_settings": ase_kwargs}
        },
        "training_sampling_settings": {"mlp_settings": ase_kwargs},
        "testing_sampling_settings": {"mlp_settings": ase_kwargs},
    },
)
ase_settings.to_yaml("ase_workflow_settings.yaml")

ase_kwargs = {"ml_system_kwargs": {"calculator": ase_calculator}}

loaded = WorkflowSettings.from_yaml(
    "ase_workflow_settings.yaml",
    overwrite={
        "param_settings": {
            "msm_settings": {"mlp_settings": ase_kwargs}
        },
        "training_sampling_settings": {"mlp_settings": ase_kwargs},
        "testing_sampling_settings": {"mlp_settings": ase_kwargs},
    },
)

Next steps¶

How-to → Run from Python — the prose reference for the patterns shown here.
How-to → Fit a congeneric series — share parameters across related molecules.
Concepts → MLPs in presto — pick the right reference MLP.

Basic Walk-Through (Python API)¶

1. Build a WorkflowSettings¶

2. Round-trip via YAML¶

3. Run the fit¶

4. Use the bespoke force field¶

5. Inspect outputs¶

6. Bring your own MLP via ASE¶

7. Reload an ASE settings YAML¶

Next steps¶

1. Build a `WorkflowSettings`¶