In defense of screen time
My daughter is constantly creating– her passions include making art, writing fiction, coding interactive games, and composing music. Yet, I regularly see news articles and media pundits suggesting that my…
My daughter is constantly creating– her passions include making art, writing fiction, coding interactive games, and composing music. Yet, I regularly see news articles and media pundits suggesting that my…
LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these measures get more…
Introduction Imagine yourself a decade ago, jumping directly into the present shock of conversing naturally with an encyclopedic AI that crafts images, writes code, and debates philosophy. Won’t this technology…
The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered by Large Language Models…
A brief overview and discussion on gender bias in AI
Is Attention all you need? Mamba, a novel AI model based on State Space Models (SSMs), emerges as a formidable alternative to the widely used Transformer models, addressing their inefficiency…
Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges?
'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data.
Have you ever trained a model you thought was good, but then it failed miserably when applied to real world data? If so, you’re in good company.
On the the pivotal role that Deep Learning has played as a key enabler for advancing single-cell sequencing technologies.
On fish counting – a complex sociotechnical problem in a field that is going through the process of digital transformation.
Jeremy Howard (founding CEO, previously co-founder of Kaggle and fast.ai) and Eric Ries (founding director, previously creator of Lean Startup and the Long-Term Stock Exchange) today launched Answer.AI , a…
In this article, we will talk about classical computation : the kind of computation typically found in an undergraduate Computer Science course on Algorithms and Data Structures [1]. Think shortest…
This essay first appeared in Reboot . Credulous, breathless coverage of “AI existential risk” (abbreviated “x-risk”) has reached the mainstream. Who could have foreseen that the smallcaps onomatopoeia “ꜰᴏᴏᴍ” —…
Summary: recently while fine-tuning a large language model (LLM) on multiple-choice science exam questions, we observed some highly unusual training loss curves. In particular, it appeared the model was able…
Friends with no previous interest in AI ethics have begun asking me questions in the wake of the release of ChatGPT4, Bard, and Bing Chat. This new generation of large…
Proposals for stringent AI model licensing and surveillance will likely be ineffective or counterproductive, concentrating power in unsustainable ways, and potentially rolling back the societal gains of the Enlightenment. The…
This article is the result of a collaboration between philosopher Seth Lazar , AI impacts researcher Arvind Narayanan , and fast.ai’s Jeremy Howard. At fast.ai we believe that planning for…
I remember the first time I used the v1.0 of Visual Basic. Back then, it was a program for DOS. Before it, writing programs was extremely complex and I’d never…
Today we’re releasing our new course, From Deep Learning Foundations to Stable Diffusion , which is part 2 of Practical Deep Learning for Coders .
“ The limits of my language mean the limits of my world. ” — Ludwig Wittgenstein
Understanding the building blocks and design choices of graph neural networks.
What components are needed for building learning algorithms that leverage the structure and properties of graphs?