David Mimno

@dmimno.bsky.social

571 followers 581 following 195 posts

He teaches information science at Cornell

David Mimno @dmimno.bsky.social
[ View ] Jul 07, 2024 at 22:00 UTC

Sorry, but the actual experience of VHS video stores was wandering around for half an hour failing to agree on anything and then randomly grabbing something from New Releases. Pretty close to Netflix honestly

0 replies 0 reposts 9 likes

Reposted by David Mimno

Kate Starbird @katestarbird.bsky.social
[ View ] Jul 06, 2024 at 19:55 UTC

Whether Biden drops out or continues, I expect a primary target of foreign info operations this year to be the legitimacy of the Democratic candidate and the legimitacy of the courts in relation to the the GOP candidate. Expect them to target audiences on both sides and exploit organic criticism.

4 replies 68 reposts 191 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 04, 2024 at 21:28 UTC

It is a national disgrace that a dinky island with 20% of our population has like 50% more representatives in its lower house

0 replies 0 reposts 4 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 04, 2024 at 13:23 UTC

sickos.jpg

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 03, 2024 at 14:39 UTC

masonry: not sure what you mean, is there an example you like? y-axis: tried it, most topics just look very flat average: view as proportion might answer this better? membership: this is mixed-membership token-level LDA. 65k docs takes ~20 mins on one CPU.

0 replies 0 reposts 0 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 02, 2024 at 16:15 UTC

Llama3 title: "Biomedical Poetry"

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 02, 2024 at 16:14 UTC

Updated topic model of arxiv NLP, now with Llama3-generated topic titles! mimno.infosci.cornell.edu/arxivcl/

llama up, chatgpt down

1 replies 3 reposts 13 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 02, 2024 at 15:28 UTC

But can your neural network give you topics like THIS??? "drug poetry protein molecular chemical poems adverse discovery biological reactions drugs chemistry reaction literature molecules molecule poem properties interaction proteins"

2 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 02, 2024 at 13:06 UTC

No one is above the law, but some people are more not above the law than others

0 replies 0 reposts 5 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 02, 2024 at 12:50 UTC

So if someone heckles the State of the Union, an official duty specifically enumerated in the constitution, can the president shoot them?

2 replies 0 reposts 1 likes

David Mimno @dmimno.bsky.social
[ View ] Jul 02, 2024 at 01:14 UTC

Alaric the Goth was probably like “hey you know that band The Cure? They’re pretty chill” and then FOR ALL TIME…

0 replies 0 reposts 1 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 29, 2024 at 17:47 UTC

Is this about the new budget model?

0 replies 0 reposts 3 likes

Reposted by David Mimno

Melanie Walsh @mellymeldubs.bsky.social
[ View ] Jun 28, 2024 at 23:27 UTC

We release our code and 1.4k public domain poems with form tags, as well as metadata about their presence in popular pretraining datasets and memorization by GPT-4:

github.com/maria-antoni...

1 replies 3 reposts 5 likes

Reposted by David Mimno

Melanie Walsh @mellymeldubs.bsky.social
[ View ] Jun 28, 2024 at 23:20 UTC

Poetry is weirdly prominent in LLM conversations. But what do models really "know" about poetry?

We tested how well LLMs can recognize 20+ poetic forms in English & probed major pretraining datasets to see which poems might be memorized.

New preprint: arxiv.org/abs/2406.18906

1 replies 17 reposts 39 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 28, 2024 at 14:29 UTC

Yes! Hahaha yes!

0 replies 1 reposts 5 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 28, 2024 at 14:25 UTC

I decided against adding adding 🙄🤷‍♂️, but you can imagine it

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 27, 2024 at 19:14 UTC

Llama3 examples here: mimno.infosci.cornell.edu/arxivcl/
"Be concise. Provide just a single short text title in doublequotes for the topic associated with the following words:"

0 replies 1 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 27, 2024 at 17:42 UTC

I'm looking for a good example of LLM fine-tuning + preference learning that's compelling to students but small enough to run as an in-class demo. Ideas?

1 replies 4 reposts 6 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 27, 2024 at 17:37 UTC

That would be one of the prompts I'd like to see variants of! I'm about to do some experiments myself (realized I was missing a big chunk of arxiv cs.CL, rerunning models now)

1 replies 1 reposts 1 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 27, 2024 at 17:26 UTC

In practice it seems to be berttopic (ie hard clustering on doc embeddings + post-hoc keywords). I think it's mostly popular because the package is well written. I'm most excited about the Stanford CHI paper version that seems to let you have a dialog of "like that, but with more..."

2 replies 1 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 27, 2024 at 14:37 UTC

DM for a shared doc

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 27, 2024 at 13:07 UTC

I’m really impressed by llama3 in ollama for Humanities applications. Thinking of adding a prompt collection to aiforhumanists.com, would people contribute?

6 replies 8 reposts 25 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 25, 2024 at 19:23 UTC

I appreciate that Google is trying to attribute suggested code from Colab to sources, but the threshold is set waaaay too low. Possible opportunities for malicious license trolls.

0 replies 0 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 22, 2024 at 21:23 UTC

Laure and I found that there’s a lot of terms that are author specific beyond named entities, I wonder if there’s a similar boost from swapping synonyms there too? (Cf “authorless topic models”)

1 replies 0 reposts 4 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 21, 2024 at 19:40 UTC

After 4mins, pretty unimpressive. Seems like a good model but not for this.

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 21, 2024 at 19:29 UTC

Got to the point of spinning progress thingy! Gemini in colab was quite helpful in debugging an image format error

1 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 21, 2024 at 19:05 UTC

I want to try the new Florence models OCR but stuck yak-shaving on JBIG2 file formats

1 replies 0 reposts 4 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 21, 2024 at 12:27 UTC

What did little tiny ants do before they invented sidewalks?

0 replies 2 reposts 4 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 20, 2024 at 14:02 UTC

I know you’re working on a write up, but any info on how you’re prompting it would be great!

1 replies 0 reposts 6 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 20, 2024 at 11:37 UTC

UMass administration building

0 replies 2 reposts 11 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 17, 2024 at 16:23 UTC

Was debating whether to use the p word

1 replies 0 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 17, 2024 at 12:41 UTC

It’s great that ACL ARR has a section for Comp Soc Sci and Cultural Analytics, but all the keywords seem to be CSS. What are the key NLP problems for CA? Narrative? Character? Dealing with complex documents? Language change?

1 replies 2 reposts 7 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 17, 2024 at 12:07 UTC

If mosquitoes were like “hey can I have a minuscule amount of your blood so I can have my babies?” I’d be like “sure! Here you go” but they have to be dicks about it

0 replies 1 reposts 6 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 12, 2024 at 02:34 UTC

Just saw Hitman. Seemed totally plausible except he’s supposed to be a professor of Psychology AND Philosophy? Like wtf

2 replies 2 reposts 7 likes

David Mimno @dmimno.bsky.social
[ View ] Jun 09, 2024 at 02:53 UTC

This is chapter 80 of de rebus gestis ricardi primi, in 81 there’s a tour of other English cities and how bad they are, eg Bath is “ad portas inferi”

0 replies 1 reposts 2 likes

Reposted by David Mimno

Meredith Martin @mmvty.bsky.social
[ View ] May 29, 2024 at 12:43 UTC

A ton of folks at the CDH & incredible graduate students in Princeton English have worked for SO LONG on this project -- please read & share! modernismmodernity.org/forums/world...
Especially excited about @zoeleblanc.bsky.social and @suttonkoeser.bsky.social's article.

0 replies 18 reposts 29 likes

David Mimno @dmimno.bsky.social
[ View ] May 30, 2024 at 14:06 UTC

Galaxy brain: Big language models represent your docs in the context of a massive background collection. That may or may not be what you want.

0 replies 2 reposts 3 likes

Reposted by David Mimno

Meredith Martin @mmvty.bsky.social
[ View ] May 29, 2024 at 12:40 UTC

c19datacollective.com/data/

DATASET ALERT. First dataset accepted at 19thC Data Collective. Have some 19thC Data? Submit! Congrats @micahbateman.bsky.social ! (Please post on other place, I'm not allowed back there until my book is in)

0 replies 17 reposts 20 likes

David Mimno @dmimno.bsky.social
[ View ] May 28, 2024 at 00:40 UTC

So AI is going to destroy us by blindly optimizing objective functions despite devastating practical or human cost? Don't we already have this, and it's called private equity?

1 replies 0 reposts 8 likes

David Mimno @dmimno.bsky.social
[ View ] May 24, 2024 at 11:47 UTC

I can guarantee you that fucksmith did not see one penny of that $60m

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] May 15, 2024 at 18:55 UTC

Waikiki could be one of the world’s great biking cities but they somehow manage to treat everything but cars as an afterthought AND create a miserable experience for drivers

2 replies 1 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] May 07, 2024 at 00:01 UTC

The font is Art Nouveau creativemarket.com/lizkohlerbro...

By Liz Kohler Brown www.lizkohlerbrown.com

1 replies 0 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 18, 2024 at 16:39 UTC

Also, Master of the Senate

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 18, 2024 at 16:37 UTC

Tacitus, “Agricola”

0 replies 0 reposts 1 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 16, 2024 at 19:59 UTC

👀

0 replies 0 reposts 3 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 15, 2024 at 20:42 UTC

So meta

0 replies 0 reposts 0 likes

Reposted by David Mimno

Federico Pianzola @fpianz.bsky.social
[ View ] Apr 13, 2024 at 06:53 UTC

PhD position Computational Approaches to Narrative in Argumentation

www.rug.nl/about-ug/wor...

#nlproc #nlp #computationalhumanities
@tedunderwood.me @andrewpiper.bsky.social @mellymeldubs.bsky.social @mariaa.bsky.social @dmimno.bsky.social @dbamman.bsky.social @lucy3.bsky.social

0 replies 8 reposts 7 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 11, 2024 at 16:37 UTC

It’s been pretty clear for a while they use punctuation tokens this way

0 replies 0 reposts 1 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 09, 2024 at 00:48 UTC

Yes, this photo was taken at Ithaca Beer Company

0 replies 0 reposts 2 likes

David Mimno @dmimno.bsky.social
[ View ] Apr 09, 2024 at 00:36 UTC

In the totality, Chimney Bluffs NY

0 replies 0 reposts 5 likes