Close Menu
AI Week
  • Breaking
  • Insight
  • Ethics & Society
  • Innovation
  • Education and Training
  • Spotlight
Trending

UN experts warn against market-driven AI development amid global concerns

September 20, 2024

IBM launches free AI training programme with skill credential in just 10 hours

September 20, 2024

GamesBeat Next 2023: Emerging leaders in video game industry to convene in San Francisco

September 20, 2024
Facebook X (Twitter) Instagram
Newsletter
  • Privacy
  • Terms
  • Contact
Facebook X (Twitter) Instagram YouTube
AI Week
Noah AI Newsletter
  • Breaking
  • Insight
  • Ethics & Society
  • Innovation
  • Education and Training
  • Spotlight
AI Week
  • Breaking
  • Insight
  • Ethics & Society
  • Innovation
  • Education and Training
  • Spotlight
Home»Spotlight»Exploring the Potential of Small Language Models in AI Startups Without GPU Dependency
Spotlight

Exploring the Potential of Small Language Models in AI Startups Without GPU Dependency

Ivan MassowBy Ivan MassowJune 25, 20240 ViewsNo Comments2 Mins Read
Share
Facebook Twitter LinkedIn WhatsApp Email

Yejin Choi, Senior Research Director at Allen Institute of AI, discusses the viability of Small Language Models (SLMs) as a cost-effective alternative to Large Language Models (LLMs) in AI startups, presenting innovative approaches at the Databricks’ Data + AI Summit.

Building AI startups is often hindered by high computational costs, primarily due to the expense of training large language models (LLMs). However, Yejin Choi, the Senior Research Director at Allen Institute of AI, explored the potential of small language models (SLMs) at the Databricks’ Data + AI Summit. She discussed ways to develop SLMs without using GPUs.

One “mission impossible” Choi outlined centered on summarizing sentences without employing reinforcement learning from human feedback (RLHF), large-scale pre-training, and supervised datasets. Another challenge was summarizing documents under similar constraints. Choi’s team also worked on revitalizing older statistical n-gram models to remain relevant against neural language models.

Choi emphasized that SLMs do not necessitate the significant compute power that LLMs require, thereby eliminating the need for GPUs. She illustrated this by referencing Infini.gram, an engine created by researchers at Washington University and the Allen Institute for AI, which runs on standard CPU compute power.

Choi argued that AI models are only as good as the data they are trained on, suggesting that synthetically generated data could be crucial in the future. She pointed to examples like Meta AI’s Segment Anything Model and Microsoft’s ‘Textbooks Are All You Need,’ which demonstrate that high-quality synthesized data can enable SLMs to compete with larger models effectively.

Choi’s overall contention was that with the right synthesized data and a focus on abstraction rather than scale, SLMs can be a viable and less resource-intensive alternative to LLMs.

Share. Facebook Twitter LinkedIn Telegram WhatsApp Email Copy Link
Ivan Massow
  • X (Twitter)

Ivan Massow Senior Editor at AI WEEK, Ivan, a life long entrepreneur, has worked at Cambridge University's Judge Business School and the Whittle Lab, nurturing talent and transforming innovative technologies into successful ventures.

Related News

UN experts warn against market-driven AI development amid global concerns

September 20, 2024

IBM launches free AI training programme with skill credential in just 10 hours

September 20, 2024

GamesBeat Next 2023: Emerging leaders in video game industry to convene in San Francisco

September 20, 2024

Alibaba Cloud unveils cutting-edge modular datacentre technology at annual Apsara conference

September 20, 2024

Dentistry.One unveils innovative SmileScan AI tool for oral health monitoring

September 20, 2024

Inbolt secures €15 million in Series A round to propel expansion and technological advancements

September 20, 2024
Add A Comment
Leave A Reply Cancel Reply

Top Articles

IBM launches free AI training programme with skill credential in just 10 hours

September 20, 2024

GamesBeat Next 2023: Emerging leaders in video game industry to convene in San Francisco

September 20, 2024

Alibaba Cloud unveils cutting-edge modular datacentre technology at annual Apsara conference

September 20, 2024

Subscribe to Updates

Get the latest AI news and updates directly to your inbox.

Advertisement
Demo
AI Week
Facebook X (Twitter) Instagram YouTube
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact
© 2025 AI Week. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.