Manifest List

An Iceberg metadata file containing details about all the manifest files in a single table snapshot.

Manifest List is a metadata file in the Apache Iceberg format that represents a single snapshot of a table. It lists all the manifest files that make up that snapshot, providing a unified view of the table’s state.

How it Works

When a write transaction commits, Iceberg generates a new snapshot mapped to a manifest list.

  • Snapshot Metadata: Contains the snapshot ID, parent snapshot ID, creation timestamp, and the summary of changes (e.g., records added).
  • Manifest Tracking: Lists the file paths of all active manifest files associated with the snapshot.
  • Partition Summary: Stores the range of partition values contained in each manifest file. Query engines scan these summaries to skip entire manifest files during query execution.

Lakehouse & Agentic Relevance

In a data lakehouse, manifest lists enable transaction isolation and time travel capabilities. Because each manifest list represents a complete snapshot, query engines can access previous table states. If an autonomous agent needs to query historical data or run transaction-consistent audits, it leverages these snapshot states. Dremio queries the manifest list to identify the active files for any given snapshot, executing queries without locking the table.