Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?
DataQuest|October 2024
Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?
Pratima H
Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

What happens when you compress Al's heavier parts with a new centrifugal blade? You save so much storage, memory, GPU stacks and compute gas-tanks, of course. Besides boosting speed, cutting inference latency and expanding compatibility for small devices and edge-networks. But how does this 'squeeze' affect accuracy, error compensation and application-ease? And what about all the other grinders that are attacking the same problem? Like SLMs, GPTQ and QuIP? Recently, Yandex Research, IST Austria, NeuralMagic, and KAUST announced that they have developed what they call 'two innovative compression methods for large language models'. It was also claimed that, when combined, these methods allow for a reduction in model size by up to 8 times while preserving response quality by 95 per cent. Compressed models like Llama 2 13B can run on I GPU instead of 4- they added. So how does it all work and does it address the issues we mentioned earlier?

We compress it all in this interview with Artem Babenko, Head of Yandex Research. He oversees scientific research at Yandex and the company's engagement with the international scientific community. He also supervises a team of approximately 30 researchers engaged in various areas of computer science. According to Artem, his main achievements are his scientific contributions in three key areas: neural networks for image search, high-dimensional vector compression, and fast search across massive databases containing billions of records. Who better than he to explain the ambition, ingredients and final taste of compression? Let's press those buttons.

Can you explain additive quantization and PV-tuning in simpler terms- for a layman? Is it similar to model pruning?

This story is from the October 2024 edition of DataQuest.

Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.

This story is from the October 2024 edition of DataQuest.

Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.

MORE STORIES FROM DATAQUESTView All
Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?
DataQuest

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?

time-read
7 mins  |
October 2024
Indian Research: Far from a bouquet of ORCIDS yet. Why?
DataQuest

Indian Research: Far from a bouquet of ORCIDS yet. Why?

Indian brains are universally, and globally, applauded for their 'horse sense' - whether we look at our laser-sharp unicorns and wide-bracket start-ups or pride in the unshakeable global confidence placed for many decades in our IT industry. And yet, when it comes to top-drawer academic research, we are still considered by many as either a 'one horse town' or a 'one-trick pony'. Why this ever-widening gap against our global counterparts?

time-read
6 mins  |
October 2024
Accountability in the Age of AI
DataQuest

Accountability in the Age of AI

In this interview, Aayush Ghosh Choudhury, Co-Founder & CEO OF Scrut Automation shares insights on why clear accountability is essential when working with AI, how AI is transforming GRC practices, and the key risks organizations need to navigate. We also explore the advancements expected in the coming years and how organizations can strike a balance between AI adoption and compliance, ensuring responsible and ethical use of these powerful technologies.

time-read
5 mins  |
October 2024
Live On the Edge
DataQuest

Live On the Edge

How can companies leverage the combination of edge computing & AI to foster sustainability?

time-read
6 mins  |
October 2024
Technology and Fashion - The Vanilla Girl, without being Cheugy
DataQuest

Technology and Fashion - The Vanilla Girl, without being Cheugy

Moving the needle in fashion- now that can go either way- between being too soft to being too flashy. Especially when it's about AI's copyright issues, personalisation, VR in design, Counterfeits, Quiet Fashion, Image recognition and carbon impact. Can an elastic CRM-fabric and data's new look help in threading the technology needle?

time-read
5 mins  |
October 2024
AI does not facilitate financial crime, it helps to fight it.
DataQuest

AI does not facilitate financial crime, it helps to fight it.

Why and where do AI models, SLMs, synthetic data and recommendation engines work as angels in areas when the devil is always in the details?

time-read
7 mins  |
October 2024
Multi-million and multi-year deals are back-both for first-timers and mature outsourcers
DataQuest

Multi-million and multi-year deals are back-both for first-timers and mature outsourcers

With IT budgets tightening and global uncertainties rising, multi-million, multi-year outsourcing deals are making a comeback.

time-read
2 mins  |
October 2024
The Big Five in a Technology Safari
DataQuest

The Big Five in a Technology Safari

Mobile banking, Blockchain, AI, IT Modernisation and? Wait, is Physical Banking the 5th one? Africa has, for long and quite-deservedly, been brave about exploring the wild forests of banking innovations and on-ground solutions that fit the region's limitations and untapped opportunities to the T. How do you make sure you stay gutsy, relevant and on the right track in such a region? How do you make IT your navigator when you want to be the top pan-African bank? Johnson Idesoh, Group Chief Information and Technology Officer at Absa Group takes us around and gives a peek on what customers here are actually hunting for.

time-read
6 mins  |
October 2024
India's Quantum Bet - the Dark Horse Who Eats Dark Chocolate
DataQuest

India's Quantum Bet - the Dark Horse Who Eats Dark Chocolate

Or the Dark Matter? It's about the Quantum Race's Winning Gap. India could be the unexpected winner in quantum research, innovation and markets as long as we look beyond the QuBit game and fix real issues that still slow down the last lap. The action is happening, albeit invisible, till we see it shine in full glory.

time-read
10 mins  |
October 2024
Quantum Leap or Quantum Leap of Faith? Indian Industries Dive into Infinite Possibilities!
DataQuest

Quantum Leap or Quantum Leap of Faith? Indian Industries Dive into Infinite Possibilities!

India is on the brink of a quantum leap, as industries from finance to healthcare begin to explore the transformative power of quantum computing. With the capability to perform calculations at speeds unimaginable with classical computers, quantum technology is not just a futuristic concept-it's becoming a practical tool for solving today's most pressing challenges and unlocking new opportunities for growth.

time-read
6 mins  |
October 2024