Challenge
Moveworks ingests and surfaces archived Confluence spaces/pages because archived items remain reachable via deep links. This leads to stale content appearing in bot results and requires manual removal of spaces from ingestion whenever they are archived.
Product Idea
Enable an option to automatically exclude archived Confluence spaces and pages from ingestion.
Key points:
- Use Confluence’s existing
status = "archived"metadata. - Ignore spaces/pages marked as archived.
- Optional toggle in the connector: “Exclude archived content.”
- Prevents stale results and removes manual cleanup.
Proposed Enhancement
Implement logic in the Confluence connector to:
- Detect
status = "archived"on spaces using the Confluence REST API (e.g.,/api/v2/spaces/{id}). - Exclude pages belonging to archived spaces from ingestion.
- Optionally detect archived pages (those with the
/pages/archivedURL pattern or other metadata) and exclude them individually. - Provide a toggle in the Moveworks Integrations UI:
- “Exclude archived spaces”
- “Exclude archived pages”
- Ensure Moveworks search results return only current knowledge matching what users see in Confluence.
Note: I’ve already raised a support ticket and the team have confirmed this is currently not possible. Only workaround is to adjust Space Keys URL to add “Archive” to them then apply a DSL filter which is not ideal. e.g.
NOT (("archive" IN article.article_url.$LOWERCASE()))