reviews.llvm.org became a read-only archive
2023-12-30 13:0:0
Author: maskray.me(查看原文)
阅读量:9
收藏
For approximately 10 years, reviews.llvm.org functioned as the code
view site for the LLVM project, utilizing a Phabricator instance. This
website hosted numerous invaluable code review discussions. However,
following LLVM's transition
to GitHub pull requests, there arises a necessity for a read-only
archive of the existing Phabricator instance.
The intent is to eliminate a SQL engine. Phabicator operates on a complex
database scheme. To minimize time investment, the most feasible
approach seems to involve downloading the static HTML pages and
employing a lightweight scraping process.
Raphaël Gomès developed phab-archive
to serve a read-only archive for Mercurial's Phabricator instance. I have modified
the code to suit reviews.llvm.org.
At this juncture, the only requirement is someone with domain access
to redirect reviews.llvm.org to the archive website. Then we can obtain a
HTTPS certificate.
Data
The file hierarchy is quite straightforward.
archive/unprocessed/diffs contains raw HTML pages while
templates/diffs contains scraped HTML pages alongside patch
files.