Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?
DataQuest|October 2024
Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?
Pratima H
Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

What happens when you compress Al's heavier parts with a new centrifugal blade? You save so much storage, memory, GPU stacks and compute gas-tanks, of course. Besides boosting speed, cutting inference latency and expanding compatibility for small devices and edge-networks. But how does this 'squeeze' affect accuracy, error compensation and application-ease? And what about all the other grinders that are attacking the same problem? Like SLMs, GPTQ and QuIP? Recently, Yandex Research, IST Austria, NeuralMagic, and KAUST announced that they have developed what they call 'two innovative compression methods for large language models'. It was also claimed that, when combined, these methods allow for a reduction in model size by up to 8 times while preserving response quality by 95 per cent. Compressed models like Llama 2 13B can run on I GPU instead of 4- they added. So how does it all work and does it address the issues we mentioned earlier?

We compress it all in this interview with Artem Babenko, Head of Yandex Research. He oversees scientific research at Yandex and the company's engagement with the international scientific community. He also supervises a team of approximately 30 researchers engaged in various areas of computer science. According to Artem, his main achievements are his scientific contributions in three key areas: neural networks for image search, high-dimensional vector compression, and fast search across massive databases containing billions of records. Who better than he to explain the ambition, ingredients and final taste of compression? Let's press those buttons.

Can you explain additive quantization and PV-tuning in simpler terms- for a layman? Is it similar to model pruning?

This story is from the {{IssueName}} edition of {{MagazineName}}.

Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.

This story is from the {{IssueName}} edition of {{MagazineName}}.

Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.

MORE STORIES FROM DATAQUESTView all
How big is India's Computer Accessories Market? Innovations expected by FY25
DataQuest

How big is India's Computer Accessories Market? Innovations expected by FY25

India’s computer accessories sector grows with Make in India, Digital India, and rising demand for high-tech accessories like AIl-powered devices, boosting exports globally.

time-read
3 mins  |
November 2024
When Network Threats Are Not Allowed to Check-in
DataQuest

When Network Threats Are Not Allowed to Check-in

Like elevators, networks in this hotel are segregated for guests, for service, and locked for outsiders unless they prove bona fide. Does this approach work?

time-read
4 mins  |
November 2024
Robotic Surgeries can turn into double-edged Swords
DataQuest

Robotic Surgeries can turn into double-edged Swords

Technology should never come at the cost of affordability, privacy, training ease, the ability to practice anywhere, CRM’s real use and sensitivity to patient experience. Surjeet Thakur, CIO Rajagiri Hospital, Kochi gives us an X-ray from different angles.

time-read
5 mins  |
November 2024
Does Santa Claus watch when you pay rent on time?
DataQuest

Does Santa Claus watch when you pay rent on time?

If yes, what kind of diary does he use to distinguish good kids from delinquent ones? And what stockings can you hang to translate this good behaviour into rewards home loan discounts, prop-tech speed, landlord KYC or more?

time-read
6 mins  |
November 2024
Roadmap for Bank-based Apps to Gain UPI Market Share A missed opportunity!
DataQuest

Roadmap for Bank-based Apps to Gain UPI Market Share A missed opportunity!

As UPI dominates, will banks step up to reclaim their market? UPI continues its meteoric rise, handling 69.6% of India's digital transactions and reaching ₹20.6 lakh crore across 14.8 billion transactions in August 2024 alone.

time-read
3 mins  |
November 2024
The Future of Machine Identity
DataQuest

The Future of Machine Identity

The biggest challenge today in the digital security sphere is securing machine identities -the digital entities such as APIs, software applications, and IoT devicesthat are constantly being targeted due to the changing nature of cyber threats.

time-read
3 mins  |
November 2024
Redefining AI Application Delivery
DataQuest

Redefining AI Application Delivery

In this discussion with F5’s Ahmed Guetari and Adam Judd, we explore the collaboration between F5 and NVIDIA, which leverages the innovative capabilities of NVIDIA's BlueField-3 DPUs and F5’s BIG-IP Next platform. This partnership is set to redefine AI application delivery and security, particularly in the high-growth Indian market, by enabling better resource allocation, faster data processing, and robust security enhancements. With AI adoption surging across industries, F5 and NVIDIA's synergy brings transformative solutions for service providers and enterprises, poised to drive India’s AI-led innovation.

time-read
3 mins  |
November 2024
EPAM's Bold Leap: Transforming Business with Al-Driven Salesforce Solutions.
DataQuest

EPAM's Bold Leap: Transforming Business with Al-Driven Salesforce Solutions.

Siba Padhy, Head of Salesforce Business, EPAM Systems highlights how Salesforce has evolved from a core sales solution to a comprehensive platform that encompasses service, marketing, and commerce, with a significant emphasis on leveraging AI technologies, particularly the newly launched Agentforce and Einstein GPT. Padhy discusses the burgeoning agent tech economy in India and EPAM's strategic focus on delivering tailored solutions that address specific market needs. With ambitious growth targets, EPAM aims to double its Salesforce expertise in the coming years, positioning itself as a key player in driving digital transformation across various industries.

time-read
3 mins  |
November 2024
The Smart Manufacturing Revolution in India
DataQuest

The Smart Manufacturing Revolution in India

Explore how cutting-edge technologies are redefining manufacturing in India, shaping a future where efficiency and sustainability go hand in hand.

time-read
5 mins  |
November 2024
Digital Governance: Changing that Red Carpet into a Green One
DataQuest

Digital Governance: Changing that Red Carpet into a Green One

India's approach and success stories in digital governance show how technology should be used to cover the entire carpet area - From expanding inclusion to the digitally not-so-savvy folks to strengthening interoperability - we have made sure the grass is green, every side

time-read
10+ mins  |
November 2024