Books3 corpus would like you to know that all the data in it is from copyrighted books. It has reportedly been widely used in closed-source AI LLMs. “Rules for thee, not for me” shit. They’ll break copyright and then copyright what they made from it.