Ben Schmidt's avatar

Ben Schmidt

@bschmidt.bsky.social

It's a general phenomenon though -- we hosted an event last night Sasha Rush from Cornell who was saying that adding text makes image models better, code models better, etc.; but adding other modalities tends not to make text models better. Maybe language is just a good way to describe things.

1 replies 0 reposts 2 likes


Thomas Dickerson's avatar Thomas Dickerson @elfprince13.mumak.app
[ View ]

Neat! most of our models are doing sensor fusion so we are multimodal along an orthogonal axis

0 replies 0 reposts 1 likes