Web crawlers and precision data sets
OpenAI released details about a crawling bot it is using to collect data from the web for its training set. But the best model might not just be build on the biggest training set.
Author’s note: Wednesday’s issue will be coming out on Thursday this week.
In addition, Tuesday’s issue next week will be coming out on Thursday. This is due to a trip I’ll be taking to New York for the next two weeks. If you’re based in New York, let’s hang out! You can find my email …