The excitement surrounding large language models (LLMs) is rapidly increasing, with industries widely exploring diverse use cases. As a transformative technology, LLMs are being closely monitored for their potential to revolutionize and optimize everything from customer service to complex data analysis to advance health care. Bill Gates recently wrote a blog on how agents will be the next big thing in software. He further claimed that in the next 5 years, anyone who’s online will be able to have a personal assistant powered by artificial intelligence.
While the industries & user community are still embracing the euphoria of Large Language Models (LLMs), the Hi-Tech industry has already started to work on evolution of Large Multimodal Models (LMM) - a step towards extending the ‘emergent’ abilities of LLMs beyond text-only input/output models.
▾ Large Multimodal Models
We human beings are blessed with multiple sensory & cognitive capabilities and our intelligence is a collective intelligence derived from multiple sources. As we grow, we learn to use one or more of these ‘Modes of interactions’ to interact with the world around us. The future of AI will likely follow the same realm and will work on integrating multiple data modalities at input and/or output into AI models, leading to the development of LMMs. The input or output modes of interest could be text/language, images, video, audio, sensors data, actuator data, etc. Till recently, the focus was on unimodal models which could process only one data mode (such as text or speech or image) at a time.
By combining these different types of data, LMMs can achieve a more holistic understanding of the world, enabling them to perform complex tasks. For instance, an LMM could analyze a video, recognize objects, understand spoken language, and generate descriptive text all in one seamless iteration.
This story is from the september 2024 edition of PCQuest.
Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.
Already a subscriber ? Sign In
This story is from the september 2024 edition of PCQuest.
Start your 7-day Magzter GOLD free trial to access thousands of curated premium stories, and 9,000+ magazines and newspapers.
Already a subscriber? Sign In
ASUS ExpertBook P5
The ASUS ExpertBook P5 aims to deliver more than just a typical business laptop experience. Designed for modern professionals, it boasts powerful hardware, AI-driven tools, and robust security features.
Early Warning Systems (EWS): The recipe to combat fraud and delinquency
An EWS isn't just a compliance tool; it's a financial guardian that transforms chaos into clarity. By weaving unstructured data, automation, and adaptive analytics, it empowers lenders to outpace fraud, foresee risks, and revolutionize credit management with precision
Empowering businesses with data privacy compliance: Key takeaways from PCQuest's DPDPA workshop
From chaos to clarity-PCQuest's DPDPA Workshop explored how businesses can master data privacy laws, turn compliance into opportunity, and build unshakable trust in a data-driven world.
Brightening Lives: Assistive Technology for art and entertainment
Assistive Technology is transforming art and entertainment into playgrounds of inclusion-cinemas that narrate, museums that adapt, and platforms that empower. It's more than access; it's a revolution of creativity, breaking barriers for a connected, empathetic world
Gaming saw dramatic advancements in hardware, software & AI
The 2024 gaming revolution fuses cutting-edge hardware with Al, delivering immersive worlds, lifelike NPCs, and dynamic gameplay. With next-gen consoles, blazing GPUs, VR/AR, and personalized experiences, gaming evolves into interactive ecosystems, redefining entertainment and innovation for players worldwide
The tools of tomorrow tackling the challenges of today
Drones are revolutionizing farming, replacing hard labor with precision tools powered by Al. From spraying to scouting, they work smarter, not harder. With innovation tackling challenges like language barriers and training, drones are redefining how fields are managed and harvested
In race between hackers and cybersecurity, quantum is key
Hackers wield Al, encryption quakes under quantum power, but quantum cryptography flips the script. With physics as its ally and India as a trailblazer, it crafts unbreakable, ever-changing keys. The future of data isn't just safe-it's quantum-proof brilliance
The tech that's changing the game across the board
Machine learning is rewriting the rules, from crafting infinite gaming worlds to saving lives with real-time health data. It's not just tech-it's transformation. As challenges arise, innovation keeps pushing the boundaries of what's possible every day
2024's biggest technology trends: What's changing and why it matters
2024's tech isn't just evolving-it's reinventing the rules. AI creates, hardware accelerates, and green innovations heal the planet. From smarter machines to faster connections, it's a year of bold leaps, where innovation doesn't just support life-it redefines it
Exploring the development of 3D and spatial sound in consumer tech
Spatial audio is the art of painting sound in 3D-turning every note, whisper, or explosion into an immersive journey. It blurs the line between the real and digital, making listeners not just hear but feel sound as if they're living inside it