armchair-progamer

armchair-progamer t1_j4ovjjm wrote

> digital watermark

Wouldn't it be easier to store the model outputs or a perceptual hash, and then provide a way to determine if some text is similar to prior ChatGPT output? I assumed they were already doing something like this to collect usage data as they scrape new content.

ChatGPT already has a unique writing style, I'm not sure how you could add anything to the text which couldn't be trivially removed and do better

1