Reposted by Maria Antoniak
Poetry is weirdly prominent in LLM conversations. But what do models really "know" about poetry?
We tested how well LLMs can recognize 20+ poetic forms in English & probed major pretraining datasets to see which poems might be memorized.
New preprint: arxiv.org/abs/2406.18906
1 replies
17 reposts
39 likes
I’m aware, hence the question!
1 replies
0 reposts
1 likes
Well, in this case they are.
0 replies
0 reposts
2 likes
"LLMs are just next-word predictors."
What are the current best references to respond to this, either supporting or critiquing?
6 replies
2 reposts
6 likes
Ok but my current greatest organizational innovation is creating a "Today" section in my main Slack workspace, giving it an emoji that indicates how scrambled I am, and then each day moving channels in and out of this section.
1 replies
0 reposts
5 likes
Research is complicated, sometimes you need space to do something unexpected, and you’re balancing this against 10 other priorities like TAing. Freedom to manage one’s time is a wonderful luxury and is one reason I chose to do a PhD. Clear communication and organization are still important though!
1 replies
0 reposts
4 likes
One more thought that doesn’t seem to be shared: if my advisor had hired a PM and the PM had started tagging me in kanban tickets, I would have done everything to claw my way out of being managed and surveilled. It can be a good system, but it’s one I intentionally left behind in industry.
1 replies
0 reposts
4 likes
Yeah this is key. My Slack channels always have lots of bookmarks, and my Google docs usually have a “Quick Links” section where I link to everything (Overleaf, other Google docs, data, etc.).
Notion is way better at all of this, of course, but everyone uses Google and no one uses Notion, so.
1 replies
0 reposts
6 likes
This might be a feature of bigger collaborations, as keeping a large group up to date requires more managing than when it’s just student + advisor.
1 replies
0 reposts
3 likes
On the practical side, when I’m that person, I’ve learned to make and maintain a Slack channel, a meeting notes doc, and a project-specific to-do list.
1 replies
0 reposts
4 likes
In my opinion, that’s the role of the “first author” in NLP, the person who is leading the charge and making sure everything that needs to happen happens. That person is usually the PhD student.
1 replies
0 reposts
3 likes
Working on this with @mellymeldubs.bsky.social and Anna Preus has been so much fun. Poetry, data, models, evaluation, memorization, and more! 📜🤖
1 replies
1 reposts
5 likes
Reposted by Maria Antoniak
Love to spend a decade learning how to meticulously quantify human suffering, then get a job doing it, only to find out that, to quote Vonnegut, "the power of this weapon turns out to be that of a custard pie dropped from a stepladder six feet high."
0 replies
2 reposts
12 likes
Two sci-fi novels that destroy this perspective, that I read recently and loved:
“I Who Have Never Known Men” by Jacqueline Harpman
www.transitbooks.org/books/iwhoha...
“The Wall” by Marlen Haushofer
bookshop.org/p/books/the-...
1 replies
1 reposts
4 likes
Interdisciplinary researchers, please take our survey!
▶️ forms.gle/XULo3xTbkm9n...
Just 6 questions, should take just 2-3 minutes, and we would be so grateful for your input 🙏
Calling digital humanists, computational social scientists, and anyone doing research in multiple communities!
0 replies
14 reposts
10 likes
“The widespread integration of Gemini with Google search and GPT-4.0 with Bing search has led to a shift in the trend of self-diagnosis… [comparing] GPT-4.0 and the Gemini model on the task of self-diagnosis… [we find] accuracies of 63.07% and 6.01%, respectively.”