southpaw's avatar

southpaw

@nycsouthpaw.bsky.social

People will really yell at you if you suggest AI models should pay for a license to all the works they ingest and cite their sources in their answers. I think it’s in part bc they believe in the magic of a machine intelligence. But we see more and more undeniable examples of straight up copying.

31 replies 436 reposts 1328 likes


DBCII's avatar DBCII @dbcii.bsky.social
[ View ]

They’d certainly be higher quality if they were investing paid content instead of free content.

1 replies 0 reposts 1 likes


Bjorn Fritz's avatar Bjorn Fritz @bjorn-fritz.bsky.social
[ View ]

Theres no such thing as magic, or creative machines. Creativity is based on the lived experience of being born, living and dying in a natural world – which machines will lack for a long time to come.

0 replies 0 reposts 0 likes


MU-RDDOQ 's avatar MU-RDDOQ @aaroncollom.bsky.social
[ View ]

the citation thing is weird because that functionality makes AI useful? Verifiable? If you resist citation you are defending an ability to bullshit?

2 replies 0 reposts 19 likes


Pat Scaramuzza's avatar Pat Scaramuzza @genocideman.bsky.social
[ View ]

It's called Overfitting. The more specific your question, the more likely it'll find a single reference that answers your question, and it'll parrot that reference verbatim. Normally a LLM will blend together all it knows about a topic, but if it only knows one thing that's what you'll get.

0 replies 0 reposts 1 likes


nottooloud's avatar nottooloud @nottooloud.bsky.social
[ View ]

Not of it is A.I., of course. It's machine learning, which is, by definition, copying. "When I see this, put that next to it." Of course they should pay the creator of "that".

0 replies 0 reposts 0 likes


TommyBenBergman's avatar TommyBenBergman @tbapple.bsky.social
[ View ]

Expecting AI models to cite sources would be similar to requiring Social Media companies to moderate posts or requiring bars to ID their patrons.

0 replies 0 reposts 6 likes


Zach Allen-Kelly's avatar Zach Allen-Kelly @zachwallen.bsky.social
[ View ]

It was clear this was stolen from something because coconut is too funny of a closer for it to have come up with on it's own

0 replies 0 reposts 2 likes


John Price's avatar John Price @calmthoughts.bsky.social
[ View ]

Yes. No tech, but i believe that.

0 replies 0 reposts 1 likes


slrellison.bsky.social's avatar slrellison.bsky.social @slrellison.bsky.social
[ View ]

Do we charge human authors a license to all the works they ingest? And require them to cite their sources? Not, I think, for fiction, and the latter mainly for verification, not credit. Flagrant plagiarism is dealt with when it happens; not in case it might.

0 replies 0 reposts 0 likes


Neal Curtis's avatar Neal Curtis @nihilcurtis.bsky.social
[ View ]

📌

0 replies 0 reposts 0 likes


Samantha Ferreira Wants to Talk About Anime History's avatar Samantha Ferreira Wants to Talk About Anime History @sam-animeherald.bsky.social
[ View ]

At this point, my immediate reaction to LLM engines scraping my publication's stuff is "fuck you, pay me." I never gave consent, and I refuse to grant that to simpering tech bros who sneer that they're "going to put us out of business"

0 replies 1 reposts 14 likes


southpaw's avatar southpaw @nycsouthpaw.bsky.social
[ View ]

To google’s credit, when I went to replicate this, it does cite the 2019 Quora post—though obviously it doesn’t detect the joke.

3 replies 4 reposts 104 likes


Lyndon Hood's avatar Lyndon Hood @lyndonhood.bsky.social
[ View ]

I feel like actually producing true and accurate references is still hard in the extreme sense. I mean they can fully produce copies of things but from it's point of view it's not referencing any particular thing. (And you may have seen what happens when you ask for references *in* the output )

2 replies 0 reposts 2 likes


Kanna Banana 🏳️‍⚧️🇧🇷's avatar Kanna Banana 🏳️‍⚧️🇧🇷 @kanna-banana.bsky.social
[ View ]

i knew it was too funny to not be made by a real person

0 replies 0 reposts 2 likes


Sean Eric Fagan's avatar Sean Eric Fagan @kithrup.bsky.social
[ View ]

As far as I can tell, google didn't copy there, it is just a link to quora. This could be a presentation issue -- I did the experiment in Safari on macOS, and it certainly did show the link.

0 replies 0 reposts 1 likes


jacob's avatar jacob @happinessdata.bsky.social
[ View ]

When I run it, it at least does give its (horrible) sourcing

0 replies 0 reposts 2 likes


LibrarianWriterGeek's avatar LibrarianWriterGeek @sctadsen.bsky.social
[ View ]

Imagine holding AI to more or less the same standard we hold college undergrads to...

0 replies 0 reposts 16 likes


C. W. House's avatar C. W. House @cwhouse.bsky.social
[ View ]

This has been my experience with AI-generated source code. The AI-generated solution is usually a direct copy of the top Stack Exchange hit. (When it isn't, it's often unusable and will hallucinate things like nonexistent packages or else it mixes up languages.)

0 replies 0 reposts 1 likes


Lester Sabotage's avatar Lester Sabotage @lesabot.bsky.social
[ View ]

Considering every model is the combination of all of its inputs any "citation" would just be the list of everything that the model was trained on

0 replies 0 reposts 0 likes


Amelia McNamara's avatar Amelia McNamara @ameliamn.bsky.social
[ View ]

It's been a while since I've done a true spit-take, thanks for this.

0 replies 0 reposts 1 likes


Ali Fleih's avatar Ali Fleih @alifleih.bsky.social
[ View ]

Easy fix: Exclude all Quora results.

0 replies 0 reposts 1 likes


Murder Hornet's avatar Murder Hornet @knife-gator.bsky.social
[ View ]

AI is just plagiarism with extra steps.

0 replies 0 reposts 0 likes


Godspeed You! Woke Moralists's avatar Godspeed You! Woke Moralists @dashwallkick.bsky.social
[ View ]

The fact that this is how it operates and this is literally all it can do, despite the obvious total loss of context, should be a warning to the people buying in on it. And yet!

0 replies 0 reposts 8 likes


❀°。der Siebenschläfer  *.゚✿ ⋆'s avatar ❀°。der Siebenschläfer *.゚✿ ⋆ @sababausa.bsky.social
[ View ]

OpenAI argues in their copyright defense that getting it to spit out rote copies of training data requires the user to “hack” the platform in violation of the terms, but it happens a lot and it’s trivially easy to induce

bsky.app/profile/saba...

1 replies 0 reposts 4 likes


Dave's avatar Dave @davebrowne.bsky.social
[ View ]

I'm positive lawyers are working on how they can get AI classified as a person so all the scraping can be called education so they don't have to pay anyone shit.

1 replies 0 reposts 3 likes


Jonny Lobo's avatar Jonny Lobo @jonnylobo.bsky.social
[ View ]

It's because aside from the labor-saving push from the top, most enthusiasm for AI comes from a fundamental disdain for expertise. Citing sources, providing compensation, such gestures of considering the people who actually produce things ruins the experience of mindless consumption for them.

0 replies 0 reposts 6 likes


R. Wm. "Ruedii" 's avatar R. Wm. "Ruedii" @rwruedii.bsky.social
[ View ]

AI lacks a sense of humor and can't detect satire. This could be a MAJOR problem.

0 replies 0 reposts 0 likes


iquanyin 's avatar iquanyin @iquanyin.bsky.social
[ View ]

🤣

0 replies 0 reposts 0 likes


Wattle of Bits 🏳️‍🌈's avatar Wattle of Bits 🏳️‍🌈 @wattle.bsky.social
[ View ]

It's not intelligent! It's Autocomplete!

1 replies 0 reposts 1 likes


Mark's avatar Mark @marksimploding.bsky.social
[ View ]

I just thought it was because they were the ones stealing.

0 replies 0 reposts 0 likes


picklefactory's avatar picklefactory @picklefactory.org
[ View ]

my understanding has been that fair use ultimately permits this. though feel free to argue with Masnick for the next 48 hours about it, I think I might actually learn something compared to the usual interlocutors telling him that §230 makes Facebook a publisher

1 replies 0 reposts 4 likes