proto-n t1_j30xu2t wrote on January 5, 2023 at 8:05 AM Reply to [D] Simple Questions Thread by AutoModerator Is there anywhere one can download the training dataset of gpt2 (or equivalent)? Or do you have to crawl it yourself for legal reasons? Nvm, after an hour: common crawl, openwebtext2, the pile Permalink 1
proto-n t1_j30xu2t wrote
Reply to [D] Simple Questions Thread by AutoModerator
Is there anywhere one can download the training dataset of gpt2 (or equivalent)? Or do you have to crawl it yourself for legal reasons?
Nvm, after an hour: common crawl, openwebtext2, the pile