It is difficult to get a man to understand something, when his
salary depends on his not understanding it!—Upton Sinclair,
I, Candidate for Governor: And How I Got
Licked
It’s not like ChatGPT is the only LLM. GPT is pretty broad and general. Remember that MS has Co-Pilot which is literally entirely built on GitHub’s codebase knowledge. Different data sets will produce different kinds of useful predictors.
And no SO means worse LLMs. Chatgpt relies on scrapes of SO, reddit, forums and GitHub discussion pages.
It’s not like ChatGPT is the only LLM. GPT is pretty broad and general. Remember that MS has Co-Pilot which is literally entirely built on GitHub’s codebase knowledge. Different data sets will produce different kinds of useful predictors.
And the scrapping stage already happened. A new one will be useful only after enough human-made content is added, or it a major change in tech happen.