Ben Schmidt

@bschmidt.bsky.social

It's a general phenomenon though -- we hosted an event last night Sasha Rush from Cornell who was saying that adding text makes image models better, code models better, etc.; but adding other modalities tends not to make text models better. Maybe language is just a good way to describe things.

Jun 06, 2024 at 18:07 UTC

1 replies 0 reposts 2 likes

Thomas Dickerson @elfprince13.mumak.app
[ View ] Jun 06, 2024 at 18:10 UTC

Neat! most of our models are doing sensor fusion so we are multimodal along an orthogonal axis

0 replies 0 reposts 1 likes