Great question. In practice, I spend a week crafting a 'good' weak dataset. The result is a modest performance gain, and the model becomes a lot more unpredictable (spans off by a token or so).
The correct answer nobody wants to hear is: "I should have spent a week labelling data"
Forget Snorkel and all that crap. It's harder to make good labelling functions than it is to label data, IMO
Empty-Painter-3868 t1_irtdww2 wrote
Reply to [D] What are your thoughts about weak supervision? by ratatouille_artist
Great question. In practice, I spend a week crafting a 'good' weak dataset. The result is a modest performance gain, and the model becomes a lot more unpredictable (spans off by a token or so).
The correct answer nobody wants to hear is: "I should have spent a week labelling data"
Forget Snorkel and all that crap. It's harder to make good labelling functions than it is to label data, IMO