Michael Ten @lemmy.world to Technology@lemmy.worldEnglish · 1 年前OpenAI transcribed over a million hours of YouTube videos to train GPT-4www.theverge.comexternal-linkmessage-square43fedilinkarrow-up1167arrow-down112cross-posted to: technology@lemmy.ml
arrow-up1155arrow-down1external-linkOpenAI transcribed over a million hours of YouTube videos to train GPT-4www.theverge.comMichael Ten @lemmy.world to Technology@lemmy.worldEnglish · 1 年前message-square43fedilinkcross-posted to: technology@lemmy.ml
minus-squareKnock_Knock_Lemmy_In@lemmy.worldlinkfedilinkEnglisharrow-up2arrow-down3·1 年前https://blog.gdeltproject.org/do-llms-truly-create-or-merely-arrange-just-how-much-of-an-llms-writing-is-original/
minus-squareBreakDecks@lemmy.mllinkfedilinkEnglisharrow-up1·1 年前 The differences between human and machine-generated text overlap support the image of LLMs as more “arrangers” than “creators” of text. So plagiarism…
minus-squareKnock_Knock_Lemmy_In@lemmy.worldlinkfedilinkEnglisharrow-up1·1 年前It only plagiarises you if you write something similar to lots of other people. Write something original and, even if it is in their training dataset, LLMs are highly unlikely to reproduce it.
https://blog.gdeltproject.org/do-llms-truly-create-or-merely-arrange-just-how-much-of-an-llms-writing-is-original/
So plagiarism…
It only plagiarises you if you write something similar to lots of other people.
Write something original and, even if it is in their training dataset, LLMs are highly unlikely to reproduce it.