Five@beehaw.org to Technology@beehaw.orgEnglish · 11 months agoChatGPT broke the Turing test — the race is on for new ways to assess AIwww.nature.comexternal-linkmessage-square218fedilinkarrow-up1198arrow-down10
arrow-up1198arrow-down1external-linkChatGPT broke the Turing test — the race is on for new ways to assess AIwww.nature.comFive@beehaw.org to Technology@beehaw.orgEnglish · 11 months agomessage-square218fedilink
minus-squareDroggl@lemmy.sdf.orglinkfedilinkarrow-up2·11 months agoI dont remember the numbers but iirc it was covered by one of the validation datasets and GPT 4 did quite well on it
minus-squareMaestro@kbin.sociallinkfedilinkarrow-up2·edit-211 months agoYeah, but did it do well on the specific examples from the Winograd paper? Because ChatGPT probably just learned those since they are well known and oft repeatef. Or does it do well on brand new sentences made according to the Winograd scheme?
I dont remember the numbers but iirc it was covered by one of the validation datasets and GPT 4 did quite well on it
Yeah, but did it do well on the specific examples from the Winograd paper? Because ChatGPT probably just learned those since they are well known and oft repeatef. Or does it do well on brand new sentences made according to the Winograd scheme?