Website Content Archiving and Retention Policy
Purpose
This policy establishes the principles and procedures for archiving and retaining website content to ensure long-term accessibility, authenticity and integrity. It supports compliance with relevant Australian legislation and digital-preservation standards, including the Archives Act 1983, Privacy Act 1988 and the NSLA Digital Preservation Principles.
Scope
This policy applies to all web content published on the Sydney Memorial website including:
- Static and dynamic web pages
- Images, videos and multimedia assets
- Downloadable documents and datasets
- Metadata, logs and associated records
It covers both publicly accessible and restricted (internal) web content.
Definitions
| Term | Definition |
|---|---|
| Permanent Content | Material of enduring value, such as reports, policies or publications |
| Transitory Content | Temporary or time-sensitive material such as event notices or announcements |
| Archival Copy | A stable, preserved version of web content retained for long-term reference |
| Dynamic Content | Web content generated through interactive or database-driven systems |
| Personal Content | Information about an identifiable individual as defined under the Privacy Act 1988 |
| WARC | Web ARChive file format used for storing complete web snapshots. |
Objectives
- Preserve digital assets of long-term cultural, historical or administrative value.
- Ensure compliance with Australian legal and regulatory obligations.
- Enable reliable public access via recognised digital archives (e.g. Pandora and Wayback Machine).
- Ensure recoverability of web assets in the event of data loss or technical failure.
- Support long-term sustainability through format migration and technology monitoring.
Policy Statements
Content Retention
| Content Type | Retention Period | Responsible Role | Disposal / Migration Action |
|---|---|---|---|
| Policies, Reports, Publications | Permanent | Archivist / Web Team | Archive indefinitely |
| Honour Roll / Crew Details | Permanent | Archivist / Web Team | Archive securely |
| Family Submitted Stories, Biographies, Photos and News Articles | Permanent | Web Team | Archive to WARC |
| Event Notices / Upcoming Commemorative Events | 6-12 months post-relevance | Web Team | Delete upon expiry |
| Multimedia Assets (Wreck Photos / Crew Portraits / Expedition Videos / Galleries) | Permanent or per Preferred Formats | Web Team / Archivist | Convert or migrate to sustainable format; Archive securely |
| Metadata & Logs | Permanent | Archivist / Web Team | Archive securely |
Archiving Procedures
- The website shall be archived bi-annually using approved tools to capture a full and verifiable snapshot.
- Each archive must include HTML, multimedia and downloadable files stored in WARC format.
- Archive versions shall be timestamped and version controlled.
- Archival procedures must comply with the Archives Act 1983 and organisational digital-preservation plans.
- Archived copies shall be stored securely in both on-site and off-site repositories to support disaster recovery.
Metadata
Each archived item must include descriptive, administrative and technical metadata based on Dublin Core and where applicable, PREMIS preservation metadata.
Metadata must record:
- Title, creator, subject, description, date of capture
- File format, checksum and technical environment
- Source URL and version identifier
- Rights, access level and retention status
Persistent identifiers (e.g. DOI or Handle) should be applied for long-term reference.
Roles and Responsibilities
| Roles | Responsibility |
|---|---|
| Web Team | Prepare data exports, ensure web-content versioning and quality. |
| Archivist / Record Manager | Manage metadata, initiate archival actions, ensure compliance with retention schedules. |
| IT / Hosting Provider | Maintain storage infrastructure, backup integrity and disaster-recovery procedures. |
| Records Governance Committee | Approve policy changes and oversee compliance reviews. |
| Digital Preservation Lead | Oversee technology watch, risk assessments and format – migration planning. |
Format Migration and Integrity
- The Archivist shall periodically review file formats for obsolescence risk.
- When a format is no longer supported, files must be migrated to a new, sustainable format without loss of fidelity.
- Fixity checks (e.g. checksums or hashes) must be performed annually to verify file integrity.
- All migrations and verifications must be logged in the preservation register.
- A preservation risk register shall be maintained and reviewed annually.
Security and Access Control
- Access to archived materials shall be restricted to authorised personnel.
- Sensitive or confidential web content shall be encrypted in storage and transmission.
- Access permissions and audit logs must be reviewed annually.
- Public releases must be approved by the Archivist or Governance Committee.
- All archived web materials that contain personal information must be handled in accordance with the organisation's Website Privacy Policy and the Privacy Act 1988 (Cth).
- Any personal data captured incidentally during web archiving (such as IP addresses or form submissions) must be processed and retained under the same principles of lawful, fair, and transparent use outlined in the Website Privacy Policy.
- Access restrictions must be reviewed every 5 years to determine whether materials can be released publicly, especially in cases involving deceased individuals.
- Robots.txt configuration must allow authorised web archiving crawlers unless restricted for privacy or legal reasons.
Review and Audit
This policy shall be reviewed annually or whenever legislative or operational changes occur.
A digital audit must be conducted every 12 months to verify:
- Completeness of archived materials
- Metadata accuracy
- File integrity and accessibility
- Conformance with retention and privacy requirements
The Records Governance Committee must document and endorse audit outcomes. Audit processes must include verification of WARC file readability and database export completeness.
Compliance and References
This policy is informed by the following standards and frameworks:
Website Privacy Policy – Governs the collection, handling, and disclosure of personal information collected via the website. All archiving processes must comply with its provisions.
- ISO 14721:2012 – Open Archival Information System (OAIS) Reference Model
- ISO 16363:2012 – Audit and Certification of Trustworthy Digital Repositories
- Archives Act 1983 (Cth)
- Privacy Act 1988 (Cth)
- NSLA Digital Preservation Principles (2018)
- Australian Government Web Archive Guidelines (AGWA)
- W3C Web Preservation Best Practices
Preferred Formats
| Content | Format | Rationale |
|---|---|---|
| Text | PDF/A, XML | Long-term readability |
| Images | TIFF, PNG | Lossless, archival grade |
| Video | MP4 (H.264), MXF | Broad and archival support |
| Audio | FLAC, WAV | High-fidelity, non-lossy |
| Web | WARC, HTML | Web preservation |
| Metadata | XML, JSON-LD | Interoperable and machine-readable |