David Mimno's avatar

David Mimno

@dmimno.bsky.social

571 followers 581 following 195 posts

He teaches information science at Cornell


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Sorry, but the actual experience of VHS video stores was wandering around for half an hour failing to agree on anything and then randomly grabbing something from New Releases. Pretty close to Netflix honestly

0 replies 0 reposts 9 likes


Reposted by David Mimno

Kate Starbird's avatar Kate Starbird @katestarbird.bsky.social
[ View ]

Whether Biden drops out or continues, I expect a primary target of foreign info operations this year to be the legitimacy of the Democratic candidate and the legimitacy of the courts in relation to the the GOP candidate. Expect them to target audiences on both sides and exploit organic criticism.

4 replies 68 reposts 191 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

It is a national disgrace that a dinky island with 20% of our population has like 50% more representatives in its lower house

0 replies 0 reposts 4 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

masonry: not sure what you mean, is there an example you like? y-axis: tried it, most topics just look very flat average: view as proportion might answer this better? membership: this is mixed-membership token-level LDA. 65k docs takes ~20 mins on one CPU.

0 replies 0 reposts 0 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Llama3 title: "Biomedical Poetry"

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

But can your neural network give you topics like THIS??? "drug poetry protein molecular chemical poems adverse discovery biological reactions drugs chemistry reaction literature molecules molecule poem properties interaction proteins"

2 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

No one is above the law, but some people are more not above the law than others

0 replies 0 reposts 5 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

So if someone heckles the State of the Union, an official duty specifically enumerated in the constitution, can the president shoot them?

2 replies 0 reposts 1 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Alaric the Goth was probably like “hey you know that band The Cure? They’re pretty chill” and then FOR ALL TIME…

0 replies 0 reposts 1 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Is this about the new budget model?

0 replies 0 reposts 3 likes


Reposted by David Mimno

Melanie Walsh's avatar Melanie Walsh @mellymeldubs.bsky.social
[ View ]

We release our code and 1.4k public domain poems with form tags, as well as metadata about their presence in popular pretraining datasets and memorization by GPT-4:

github.com/maria-antoni...

1 replies 3 reposts 5 likes


Reposted by David Mimno

Melanie Walsh's avatar Melanie Walsh @mellymeldubs.bsky.social
[ View ]

Poetry is weirdly prominent in LLM conversations. But what do models really "know" about poetry?

We tested how well LLMs can recognize 20+ poetic forms in English & probed major pretraining datasets to see which poems might be memorized.

New preprint: arxiv.org/abs/2406.18906

1 replies 17 reposts 39 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I decided against adding adding 🙄🤷‍♂️, but you can imagine it

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I'm looking for a good example of LLM fine-tuning + preference learning that's compelling to students but small enough to run as an in-class demo. Ideas?

1 replies 4 reposts 6 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

That would be one of the prompts I'd like to see variants of! I'm about to do some experiments myself (realized I was missing a big chunk of arxiv cs.CL, rerunning models now)

1 replies 1 reposts 1 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

In practice it seems to be berttopic (ie hard clustering on doc embeddings + post-hoc keywords). I think it's mostly popular because the package is well written. I'm most excited about the Stanford CHI paper version that seems to let you have a dialog of "like that, but with more..."

2 replies 1 reposts 3 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

DM for a shared doc

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I’m really impressed by llama3 in ollama for Humanities applications. Thinking of adding a prompt collection to aiforhumanists.com, would people contribute?

6 replies 8 reposts 25 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I appreciate that Google is trying to attribute suggested code from Colab to sources, but the threshold is set waaaay too low. Possible opportunities for malicious license trolls.

0 replies 0 reposts 3 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Laure and I found that there’s a lot of terms that are author specific beyond named entities, I wonder if there’s a similar boost from swapping synonyms there too? (Cf “authorless topic models”)

1 replies 0 reposts 4 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

After 4mins, pretty unimpressive. Seems like a good model but not for this.

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Got to the point of spinning progress thingy! Gemini in colab was quite helpful in debugging an image format error

1 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I want to try the new Florence models OCR but stuck yak-shaving on JBIG2 file formats

1 replies 0 reposts 4 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

What did little tiny ants do before they invented sidewalks?

0 replies 2 reposts 4 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I know you’re working on a write up, but any info on how you’re prompting it would be great!

1 replies 0 reposts 6 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

UMass administration building

0 replies 2 reposts 11 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Was debating whether to use the p word

1 replies 0 reposts 3 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

It’s great that ACL ARR has a section for Comp Soc Sci and Cultural Analytics, but all the keywords seem to be CSS. What are the key NLP problems for CA? Narrative? Character? Dealing with complex documents? Language change?

1 replies 2 reposts 7 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

If mosquitoes were like “hey can I have a minuscule amount of your blood so I can have my babies?” I’d be like “sure! Here you go” but they have to be dicks about it

0 replies 1 reposts 6 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Just saw Hitman. Seemed totally plausible except he’s supposed to be a professor of Psychology AND Philosophy? Like wtf

2 replies 2 reposts 7 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

This is chapter 80 of de rebus gestis ricardi primi, in 81 there’s a tour of other English cities and how bad they are, eg Bath is “ad portas inferi”

0 replies 1 reposts 2 likes


Reposted by David Mimno

David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Galaxy brain: Big language models represent your docs in the context of a massive background collection. That may or may not be what you want.

0 replies 2 reposts 3 likes


Reposted by David Mimno

David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

So AI is going to destroy us by blindly optimizing objective functions despite devastating practical or human cost? Don't we already have this, and it's called private equity?

1 replies 0 reposts 8 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

I can guarantee you that fucksmith did not see one penny of that $60m

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Waikiki could be one of the world’s great biking cities but they somehow manage to treat everything but cars as an afterthought AND create a miserable experience for drivers

2 replies 1 reposts 3 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Also, Master of the Senate

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Tacitus, “Agricola”

0 replies 0 reposts 1 likes


Reposted by David Mimno

David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

It’s been pretty clear for a while they use punctuation tokens this way

0 replies 0 reposts 1 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

Yes, this photo was taken at Ithaca Beer Company

0 replies 0 reposts 2 likes


David Mimno's avatar David Mimno @dmimno.bsky.social
[ View ]

In the totality, Chimney Bluffs NY

0 replies 0 reposts 5 likes