Readium Logo

Protecting Web Publications against content copy

Editors:

Version: V1.0

Release date: 2020-12-15

The concept of reading books - i.e. long-form publications - in a Web browser emerged around 2010. It rapidly got some traction and the organization of the Books in Browsers event, between 2011 and 2016, is a good illustration of the raise and fall of the concept.

The idea of moving ebooks to the Web was reactivated in 2017 by W3C members of the Publishing Working Group (Charter). EDRLab was member of this Working Group and promoted there the use of the Readium Web Publication Manifest for representing Web Publications. But the result of this work, the Web Publications specification, wasn’t adopted by content providers and was therefore published as a W3C Note, not a Recommandation.

In many ways, it was the lack of content protection technologies for browsers that slowed down books in browsers. Without specific protection measures, it is really easy for users to copy or download copyrighted publications, convert them in any format they like (including EPUB and PDF) and share them widely. And this is something most publishers really fear.

The wisdom of the Web is “if you don’t want it to be shared, don’t share it on the Web”. No protection technology can fully protect Web content against copy. Web browsers are “transparent” when it come to textual and image content: Web content and code can be debugged in inspector tools and no secret can stay hidden for long. Only video (and therefore audio) content may be protected by a DRM, using Encrypted Media Extensions and specialized Content Decryption Modules. With the exception of video, no protection technique of Web content can be called a “DRM”.

That said, Amazon, Overdrive and a few others have developed browser based interfaces for commercial ebooks for years now; some were abandoned, some are still alive. These content providers have used different techniques to make the content harder to copy. Some approaches are fairly outdated at this point, but still appear to be acceptable to most publishers. Other more modern technologies, most of it open source, are far more robust.

The goal of this work is to provide guidelines to developers of Web Readers who need to achieve some level of protection for copyright protected Web Publications consumed by Web applications.

However it cannot be a definitive guide, as:

Note also that care has been taken to avoid heavy processing which would drastically slow down the preparation and rendering of Web content.

This work is therefore split into three parts:

Publishers who think that the solutions presented here provide an insufficient level of protection for their content should consider distributing their publications to LCP compliant mobile / desktop applications and e-ink devices only.

Acknowledgments

This work would not have been possible without the financial support of the Korea Copyright Commission, for the third consecutive year.

The following people were involved in this project: