![Vector Databases You can choose From Vector Databases You can choose From](https://cdn.magzter.com/Linux For You/1688017500/articles/uLPlyE9QK1688034483302/VECTOR-DATABASES-YOU-CAN-CHOOSE-FROM.jpg)
In natural language processing (NLP), an embedding is a representation of text in the form of vectors. The goal of an embedding is to capture the semantic meaning of words or documents in a way that can be understood by a machine learning model.
A vector database (or an embedding database) in NLP is a specialised database designed to efficiently store, retrieve, and perform operations on high-dimensional vector data (such as the embeddings mentioned above). Vector databases are optimised to perform nearest neighbour search operations efficiently, which is a common requirement in NLP applications. They provide a way of organising and searching through large amounts of embedding data, which can be beneficial in various tasks like information retrieval, document similarity, clustering, and others.
As an example, let’s say you’ve embedded a large number of documents using a Doc2Vec model. Now, given a new document, you want to find the most similar documents in your database. To do this, you would:
1. First, embed the new document into the same high-dimensional space.
2. Next, search the vector database for the vectors closest to the new document’s vector. This is the nearest neighbour search.
Due to the high-dimensional nature of the data, this search can be computationally intensive. However, vector databases use specialised indexing and querying algorithms (like k-d trees, ball trees, or hashing techniques) to speed up these operations. Examples of such databases include FAISS developed by Facebook AI and Annoy developed by Spotify.
Open source vector databases
Denne historien er fra July 2023-utgaven av Open Source For You.
Start din 7-dagers gratis prøveperiode på Magzter GOLD for å få tilgang til tusenvis av utvalgte premiumhistorier og 9000+ magasiner og aviser.
Allerede abonnent ? Logg på
Denne historien er fra July 2023-utgaven av Open Source For You.
Start din 7-dagers gratis prøveperiode på Magzter GOLD for å få tilgang til tusenvis av utvalgte premiumhistorier og 9000+ magasiner og aviser.
Allerede abonnent? Logg på
![Red Hat unveils Red Hat OpenShift Virtualization Engine Red Hat unveils Red Hat OpenShift Virtualization Engine](https://reseuro.magzter.com/100x125/articles/148/1974748/iKWwmvYpQ1738067591582/RED-HAT-UNVEILS-RED-HAT-OPENSHIFT-VIRTUALIZATION-ENGINE.jpg)
Red Hat unveils Red Hat OpenShift Virtualization Engine
Red Hat OpenShift Virtualization Engine is a new edition of Red Hat OpenShift that offers a dedicated solution for organisations to leverage the virtualisation capabilities already available within Red Hat OpenShift.
![Spring AI: A Door to GenAI Heaven for Java Developers Spring AI: A Door to GenAI Heaven for Java Developers](https://reseuro.magzter.com/100x125/articles/148/1974748/hu8J5jOlJ1738065650032/SPRING-AI-A-DOOR-TO-GENAI-HEAVEN-FOR-JAVA-DEVELOPERS.jpg)
Spring AI: A Door to GenAI Heaven for Java Developers
Let's explore the Spring AI framework and its advantages, and look at how it is helping Java developers adopt AI.
![Significant security vulnerabilities drive the release of Rsync 3.4 Significant security vulnerabilities drive the release of Rsync 3.4](https://reseuro.magzter.com/100x125/articles/148/1974748/9F8_ki36D1738067500143/SIGNIFICANT-SECURITY-VULNERABILITIES-DRIVE-THE-RELEASE-OF-RSYNC-34.jpg)
Significant security vulnerabilities drive the release of Rsync 3.4
Rsync, the widely used utility for incremental file transfers and synchronisation, has released version 3.4. This update isn't packed with exciting new features but is instead critical due to several newly disclosed security vulnerabilities.
![NVIDIA puts Grace Blackwell at every AI developer's fingertips NVIDIA puts Grace Blackwell at every AI developer's fingertips](https://reseuro.magzter.com/100x125/articles/148/1974748/8SzatJcZi1738067285778/NVIDIA-PUTS-GRACE-BLACKWELL-AT-EVERY-AI-DEVELOPERS-FINGERTIPS.jpg)
NVIDIA puts Grace Blackwell at every AI developer's fingertips
NVIDIA has introduced NVIDIA Project DIGITS, a groundbreaking personal AI supercomputer designed to empower AI researchers, data scientists, and students NVIDIA® NVIDIA GRACE BLACKWELL with the immense capabilities of the NVIDIA Grace Blackwell platform.
![Top Tools for DevOps, Cybersecurity, and Cloud Management in 2025 Top Tools for DevOps, Cybersecurity, and Cloud Management in 2025](https://reseuro.magzter.com/100x125/articles/148/1974748/X9BhIP3xm1738063965087/TOP-TOOLS-FOR-DEVOPS-CYBERSECURITY-AND-CLOUD-MANAGEMENT-IN-2025.jpg)
Top Tools for DevOps, Cybersecurity, and Cloud Management in 2025
In 2025, organisations will continue to rely on open source tools to retain a competitive edge. We look at why the best tools for DevOps, cybersecurity and cloud management will remain relevant and how best to integrate them into your organisation.
![CREW: Open source platform to improve human-AI interaction CREW: Open source platform to improve human-AI interaction](https://reseuro.magzter.com/100x125/articles/148/1974748/v7JdzeKei1738067145220/CREW-OPEN-SOURCE-PLATFORM-TO-IMPROVE-HUMANAI-INTERACTION.jpg)
CREW: Open source platform to improve human-AI interaction
As human-AI collaboration deepens, critical questions arise: How should humans and AI complement one another? What kind of feedback enhances AI training? How can trust in AI be optimised to balance collaboration without over-reliance? Researchers at Duke University are addressing these challenges through CREW-an innovative platform designed to advance human-AI teaming.
![Red Hat completes the acquisition of Neural Magic Red Hat completes the acquisition of Neural Magic](https://reseuro.magzter.com/100x125/articles/148/1974748/GRbPQQAPN1738067430536/RED-HAT-COMPLETES-THE-ACQUISITION-OF-NEURAL-MAGIC.jpg)
Red Hat completes the acquisition of Neural Magic
Red Hat, Inc., has announced the completion of its acquisition of Neural Magic, a trailblazer in software and algorithms that accelerate generative AI (GenAI) inference workloads.
![The Do's and Don'ts for Software Architects The Do's and Don'ts for Software Architects](https://reseuro.magzter.com/100x125/articles/148/1974748/B_0gEr6Ts1738063947528/THE-DOS-AND-DONTS-FOR-SOFTWARE-ARCHITECTS.jpg)
The Do's and Don'ts for Software Architects
Here's a list of best practices for software architects as well as the common mistakes they should try not to fall prey to.
![openSUSE's Tumbleweed introduces Wayland support for the LXQt desktop environment openSUSE's Tumbleweed introduces Wayland support for the LXQt desktop environment](https://reseuro.magzter.com/100x125/articles/148/1974748/JezmN60Cy1738067329409/OPENSUSES-TUMBLEWEED-INTRODUCES-WAYLAND-SUPPORT-FOR-THE-LXQT-DESKTOP-ENVIRONMENT.jpg)
openSUSE's Tumbleweed introduces Wayland support for the LXQt desktop environment
The openSUSE Project has announced that its Tumbleweed rolling release distribution now includes Wayland support for users of the LXQt desktop environment.
![A Guide for Software Architects: Common Mistakes and Best Practices A Guide for Software Architects: Common Mistakes and Best Practices](https://reseuro.magzter.com/100x125/articles/148/1974748/-9kh2vXnZ1738066616299/A-GUIDE-FOR-SOFTWARE-ARCHITECTS-COMMON-MISTAKES-AND-BEST-PRACTICES.jpg)
A Guide for Software Architects: Common Mistakes and Best Practices
Software architects play an invaluable role in the digital transformation of an organisation. To make a mark, they must imbibe certain qualities and avoid common errors.