Articles | WordDisk

Understanding the visual knowledge of language models

LLMs trained primarily on text can generate complex visual concepts through code with self-correction. Researchers used these illustrations to train an image-free computer vision system to recognize real photos.

Alex Shipps | MIT CSAIL • mit
June 17, 2024 • ~6 min

A smarter way to streamline drug discovery

The SPARROW algorithm automatically identifies the best molecules to test as potential new medicines, given the vast number of factors affecting each choice.

Adam Zewe | MIT News • mit
June 17, 2024 • ~8 min

Technique improves the reasoning capabilities of large language models

Combining natural language and programming, the method enables LLMs to solve numerical, analytical, and language-based tasks transparently.

Adam Zewe | MIT News • mit
June 14, 2024 • ~7 min

Researchers use large language models to help robots navigate

The method uses language-based inputs instead of costly visual data to direct a robot through a multistep navigation task.

Adam Zewe | MIT News • mit
June 12, 2024 • ~7 min

Making climate models relevant for local decision-makers

A new downscaling method leverages machine learning to speed up climate model simulations at finer resolutions, making them usable on local levels.

Paige Colley | EAPS • mit
June 11, 2024 • ~6 min

New algorithm discovers language just by watching videos

DenseAV, developed at MIT, learns to parse and understand the meaning of language just by watching videos of people talking, with potential applications in multimedia search, language learning, and robotics.

Rachel Gordon | MIT CSAIL • mit
June 11, 2024 • ~9 min

New computer vision method helps speed up screening of electronic materials

The technique characterizes a material’s electronic properties 85 times faster than conventional methods.

Jennifer Chu | MIT News • mit
June 11, 2024 • ~8 min

Mouth-based touchpad enables people living with paralysis to interact with computers

The startup Augmental allows users to operate phones and other devices using their tongue, mouth, and head gestures.

Zach Winn | MIT News • mit
June 5, 2024 • ~8 min

A technique for more effective multipurpose robots

With generative AI models, researchers combined robotics data from different sources to help robots learn better.

Adam Zewe | MIT News • mit
June 3, 2024 • ~7 min

Looking for a specific action in a video? This AI-based method can find it for you

A new approach could streamline virtual training processes or aid clinicians in reviewing diagnostic videos.

Adam Zewe | MIT News • mit
May 29, 2024 • ~7 min