Skip to main content
When you add a recognized scientific file, DataErai reads it and extracts structured metadata — the settings and parameters recorded by the instrument. That metadata becomes searchable, so you can later find an asset by what’s inside it instead of remembering where you put it.

Supported formats

DataErai currently extracts metadata from these formats:
FormatExtensionTypical source
HDF5.h5, .hdf5General scientific data and model files
Panalytical XRDML.xrdmlX-ray diffraction (XRD)
Gatan DM4.dm4Electron microscopy
WaveMetrics IBW.ibwIgor Pro waveforms
Other file types upload normally; they just don’t have extracted metadata. Support for more formats is added over time.

Review before you upload

When you add a recognized file in the upload panel, DataErai shows a metadata preview before the upload starts:
  • See the extracted fields, grouped to match the instrument (for example, XRD shows X-ray source, Geometry, and Scan).
  • Edit any value inline, or switch to the raw JSON view.
  • Use the Attach toggle to decide whether the extracted metadata is saved with the asset.
DataErai also classifies the data type of the file (for example, xrd or tem-session) to help with organization and search.
Very large files (over about 100 MB) skip the in-dialog preview, but their metadata is still extracted automatically after the upload finishes.

Re-run extraction later

If a file uploaded before extraction was available, or you want to refresh it, open the asset and choose Extract metadata to run it again.

Validation hints

For some formats, the preview flags likely problems so you can catch a bad file early — for example, an XRD scan with an unusually low tube voltage, zero current, or a missing anode material.

It’s searchable

Extracted metadata is available to search alongside the metadata you add yourself, so you can filter assets by instrument settings and other captured fields.

Next steps

Upload in the browser

Add files and review their extracted metadata.

Search & discover

Find assets by metadata, tags, and more.