I do not intend what I am about to say as a defense of either Reddit or the LLM companies, but given that Reddit posts and discussions are already on the public Internet, should we be not be assuming that most everything posted there has already been hoovered up by these models already? What am I missing here?

