ML Data Fair Use
... is still a grey zone and and a form of Western exploitation and molding the whole world to their image / system / etc:
By offering Copilot as an alternative interface to a large body of open-source code, Microsoft is doing more than severing the legal relationship between open-source authors and users. Arguably, Microsoft is creating a new walled garden that will inhibit programmers from discovering traditional open-source communities. Or at the very least, remove any incentive to do so. Over time, this process will starve these communities. User attention and engagement will be shifted into the walled garden of Copilot and away from the open-source projects themselves—away from their source repos, their issue trackers, their mailing lists, their discussion boards. This shift in energy will be a painful, permanent loss to open source.
githubcopilotinvestigation.com…
Unless you have some corporate / deep pocket sponsors and / or you are doing something truly unique, there is LESS AND LESS incentive to publish FOSS.
I like the story of libraries like nmslib ... where they get all the accolades possible, but no visible financial benefits. The system just does not function this way.