Vector Databases You can choose From
Open Source For You|July 2023
Vector databases offer efficiency and scalability, and are transforming the way we harness the potential of embedding data in the digital age. There are quite a few open source vector databases that come with their own benefits. We take a quick look at them.
Mohit Raj
Vector Databases You can choose From

In natural language processing (NLP), an embedding is a representation of text in the form of vectors. The goal of an embedding is to capture the semantic meaning of words or documents in a way that can be understood by a machine learning model.

A vector database (or an embedding database) in NLP is a specialised database designed to efficiently store, retrieve, and perform operations on high-dimensional vector data (such as the embeddings mentioned above). Vector databases are optimised to perform nearest neighbour search operations efficiently, which is a common requirement in NLP applications. They provide a way of organising and searching through large amounts of embedding data, which can be beneficial in various tasks like information retrieval, document similarity, clustering, and others.

As an example, let’s say you’ve embedded a large number of documents using a Doc2Vec model. Now, given a new document, you want to find the most similar documents in your database. To do this, you would:

1. First, embed the new document into the same high-dimensional space.

2. Next, search the vector database for the vectors closest to the new document’s vector. This is the nearest neighbour search.

Due to the high-dimensional nature of the data, this search can be computationally intensive. However, vector databases use specialised indexing and querying algorithms (like k-d trees, ball trees, or hashing techniques) to speed up these operations. Examples of such databases include FAISS developed by Facebook AI and Annoy developed by Spotify.

Open source vector databases 

هذه القصة مأخوذة من طبعة July 2023 من Open Source For You.

ابدأ النسخة التجريبية المجانية من Magzter GOLD لمدة 7 أيام للوصول إلى آلاف القصص المتميزة المنسقة وأكثر من 9,000 مجلة وصحيفة.

هذه القصة مأخوذة من طبعة July 2023 من Open Source For You.

ابدأ النسخة التجريبية المجانية من Magzter GOLD لمدة 7 أيام للوصول إلى آلاف القصص المتميزة المنسقة وأكثر من 9,000 مجلة وصحيفة.

المزيد من القصص من OPEN SOURCE FOR YOU مشاهدة الكل
Red Hat unveils Red Hat OpenShift Virtualization Engine
Open Source For You

Red Hat unveils Red Hat OpenShift Virtualization Engine

Red Hat OpenShift Virtualization Engine is a new edition of Red Hat OpenShift that offers a dedicated solution for organisations to leverage the virtualisation capabilities already available within Red Hat OpenShift.

time-read
1 min  |
February 2025
Spring AI: A Door to GenAI Heaven for Java Developers
Open Source For You

Spring AI: A Door to GenAI Heaven for Java Developers

Let's explore the Spring AI framework and its advantages, and look at how it is helping Java developers adopt AI.

time-read
6 mins  |
February 2025
Significant security vulnerabilities drive the release of Rsync 3.4
Open Source For You

Significant security vulnerabilities drive the release of Rsync 3.4

Rsync, the widely used utility for incremental file transfers and synchronisation, has released version 3.4. This update isn't packed with exciting new features but is instead critical due to several newly disclosed security vulnerabilities.

time-read
1 min  |
February 2025
NVIDIA puts Grace Blackwell at every AI developer's fingertips
Open Source For You

NVIDIA puts Grace Blackwell at every AI developer's fingertips

NVIDIA has introduced NVIDIA Project DIGITS, a groundbreaking personal AI supercomputer designed to empower AI researchers, data scientists, and students NVIDIA® NVIDIA GRACE BLACKWELL with the immense capabilities of the NVIDIA Grace Blackwell platform.

time-read
1 min  |
February 2025
Top Tools for DevOps, Cybersecurity, and Cloud Management in 2025
Open Source For You

Top Tools for DevOps, Cybersecurity, and Cloud Management in 2025

In 2025, organisations will continue to rely on open source tools to retain a competitive edge. We look at why the best tools for DevOps, cybersecurity and cloud management will remain relevant and how best to integrate them into your organisation.

time-read
9 mins  |
February 2025
CREW: Open source platform to improve human-AI interaction
Open Source For You

CREW: Open source platform to improve human-AI interaction

As human-AI collaboration deepens, critical questions arise: How should humans and AI complement one another? What kind of feedback enhances AI training? How can trust in AI be optimised to balance collaboration without over-reliance? Researchers at Duke University are addressing these challenges through CREW-an innovative platform designed to advance human-AI teaming.

time-read
1 min  |
February 2025
Red Hat completes the acquisition of Neural Magic
Open Source For You

Red Hat completes the acquisition of Neural Magic

Red Hat, Inc., has announced the completion of its acquisition of Neural Magic, a trailblazer in software and algorithms that accelerate generative AI (GenAI) inference workloads.

time-read
1 min  |
February 2025
The Do's and Don'ts for Software Architects
Open Source For You

The Do's and Don'ts for Software Architects

Here's a list of best practices for software architects as well as the common mistakes they should try not to fall prey to.

time-read
4 mins  |
February 2025
openSUSE's Tumbleweed introduces Wayland support for the LXQt desktop environment
Open Source For You

openSUSE's Tumbleweed introduces Wayland support for the LXQt desktop environment

The openSUSE Project has announced that its Tumbleweed rolling release distribution now includes Wayland support for users of the LXQt desktop environment.

time-read
1 min  |
February 2025
A Guide for Software Architects: Common Mistakes and Best Practices
Open Source For You

A Guide for Software Architects: Common Mistakes and Best Practices

Software architects play an invaluable role in the digital transformation of an organisation. To make a mark, they must imbibe certain qualities and avoid common errors.

time-read
6 mins  |
February 2025