Software Heritage Co-Founder & Former Debian Leader | Stefano Zacchiroli
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Software Heritage Co-Founder & Former Debian Leader | Stefano Zacchiroli” inside PodZeus.
In this episode of Tech Over Tea, host Brody Robertson interviews Stefano Zacchiroli, co-founder of Software Heritage and former Debian Project Leader. The conversation dives deep into the mission and mechanics of Software Heritage, a non-profit initiative dedicated to collecting, preserving, and sharing all publicly available software source code for future generations. Stefano explains that the archive goes beyond simple storage by preserving full version control histories—Git, Subversion, and more—creating a global, immutable record of software evolution. He emphasizes the cultural, historical, and scientific value of source code, citing examples like Apollo mission code and Quake’s inverse square root algorithm. The discussion also covers the technical and ethical challenges of archiving, including AI scraping, platform centralization risks, and the importance of integrity through the standardized SWHID identifier. Stefano highlights the project’s growth from a two-person research effort to a 20+ person team with diverse funding from public institutions and major tech companies like IBM, Microsoft, and AWS. He also touches on the broader implications of digital sovereignty, open science, and the need for research infrastructure to analyze the archive at scale. The episode concludes with a call to action for developers and enthusiasts to contribute through donations, code contributions, advocacy, or requesting specific repositories be archived. Key takeaways include: (1) Software Heritage preserves not just code, but the full development history, enabling future reconstruction and verification; (2) The SWHID standard provides cryptographic integrity guarantees, critical for scientific reproducibility and security; (3) Centralized platforms like GitHub are fragile—archiving is essential to prevent digital obsolescence; (4) The archive enables large-scale research on open source diversity, language evolution, and community dynamics; (5) Anyone can help by donating, contributing code, promoting the project, or requesting specific repositories be archived. The overall tone is optimistic and mission-driven, underscoring the importance of collective stewardship of digital heritage.
Software Heritage preserves the full history of software development, not just code, enabling future reconstruction and verification.
The SWHID identifier provides cryptographic integrity, ensuring archived software hasn't been altered over time.
Centralized platforms like GitHub are fragile—archiving is essential to prevent digital obsolescence and loss of historical context.
The archive enables large-scale research on open source diversity, language evolution, and community dynamics.
Anyone can help by donating, contributing code, advocating, or requesting specific repositories be archived.
Introduction to Stefano Zacchiroli and Software Heritage
Brody introduces Stefano Zacchiroli, co-founder of Software Heritage and former Debian Project Leader, and sets the stage for a deep dive into the mission and impact of the organization.
What Is Software Heritage and Why It Matters
“There is value in software source code. There is cultural value. There is technical value. There is a lot of effort that went into creating that software, and we don't want that value to be lost for future generations.”
Preserving Development History and the SWHID Standard
“The idea is that when you archive something in Software Heritage, you have received a different granularity... and later on 10 years from now you will retrieve the same thing... and verify that they match if they don't match something has been modified.”
The Scale, Infrastructure, and Sustainability of the Archive
Stefano details the archive’s current size (2 petabytes), infrastructure (on-premise servers, cloud mirrors, and independent mirrors), and funding model, emphasizing its non-profit, diversified, and sustainable approach.
Why Archive Software? Historical, Scientific, and Practical Value
“If we archive scientific papers because it's useful for the historical record and for the scientific record, then we should archive those pieces of source code.”
“We are not just archiving software—we are archiving the collective intelligence of humanity’s digital evolution.”
“Digital objects last forever or five years, whichever occurs first.”
“The idea is that when you archive something in Software Heritage, you have received a different granularity... and later on 10 years from now you will retrieve the same thing... and verify that they match if they don't match something has been modified.”
Host
Guest
Stefano Zacchiroli
person
Software Heritage
organization
GitHub
organization
Git
other
Debian
organization
SWHID
other
Internet Archive
organization
INRIA
organization
Roberto Di Cosmo
person
UNESCO
organization
Budgie Desktop Is In For A Qt Future | Joshua Strobl
Tech Over Tea • 2h 2m • 4/7/2026
Creator Of Bottles, VanillaOS And Singularity Desktop | Mirko Brombin
Tech Over Tea • 1h 4m • 4/7/2026
We Must Protect Your Software Freedom | Jurgen Gaeremyn
Tech Over Tea • 1h 50m • 4/7/2026
This Upcoming Metroidvania Looks Incredible | Lantern Jump
Tech Over Tea • 1h 19m • 4/10/2026
This Father & Son Team Made A Game | Sundae Buoy
Tech Over Tea • 2h 1m • 4/17/2026
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Software Heritage Co-Founder & Former Debian Leader | Stefano Zacchiroli” inside PodZeus.
Start discovering podcast insights today
Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.
No credit card required • 7-day trial • Cancel anytime
