Crucial Flaw in Fool Archive Utility

WatchingTheHerd · August 1, 2024, 5:17pm

In the prior incarnation of the discussion boards, each page was a standalone HTML page rendered by the server. Activity on a board could be sorted by author, making it possible for an industrious person to write a script to generate a list of recent activity by their name, parse that list for message_id then iteratively fetch each post and save it locally as an archive.

The new discussion board design uses a single-page app design where the list of post activity isn’t rendered statically in the outer post but fetched within the page. This makes older “screen scrape” approaches useless.

The new board engine includes an archive utility that lets a user request an archive of all of their activity to be extracted, zipped and made available as a download. The process works very quickly but has a crucial flaw. The exported post content is saved in a CSV formatted flat file. That CSV format is incompatible with the HTML metadata within the post content, rendering the resulting CSV text file useless as a format for saving post content without a lot of extra work.

Is there any way this archive utility could just create individual flat files with the raw HTML of each post with a filename scheme based on message_id that ensures each file is distinct?

WTH

Topic		Replies	Views
Potential Archive of old posts Improve the Fool	2	459	September 29, 2022
Saving Stuff Improve the Fool	0	221	September 23, 2022
Short question re the new boards system Improve the Fool	9	707	September 22, 2022
To save favourite posts from boards & people Macro Economic Trends and Risks	3	483	October 28, 2022
Seriously? All the old posts are gone forever? New Site Help and Feedback	30	2994	February 2, 2023

Crucial Flaw in Fool Archive Utility

Related topics