Gå ubegrenset med Magzter GOLD

Gå ubegrenset med Magzter GOLD

Få ubegrenset tilgang til over 9000 magasiner, aviser og premiumhistorier for bare

$149.99
 
$74.99/År

Prøve GULL - Gratis

Vector Databases You can choose From

Open Source For You

|

July 2023

Vector databases offer efficiency and scalability, and are transforming the way we harness the potential of embedding data in the digital age. There are quite a few open source vector databases that come with their own benefits. We take a quick look at them.

- Mohit Raj

Vector Databases You can choose From

In natural language processing (NLP), an embedding is a representation of text in the form of vectors. The goal of an embedding is to capture the semantic meaning of words or documents in a way that can be understood by a machine learning model.

A vector database (or an embedding database) in NLP is a specialised database designed to efficiently store, retrieve, and perform operations on high-dimensional vector data (such as the embeddings mentioned above). Vector databases are optimised to perform nearest neighbour search operations efficiently, which is a common requirement in NLP applications. They provide a way of organising and searching through large amounts of embedding data, which can be beneficial in various tasks like information retrieval, document similarity, clustering, and others.

As an example, let’s say you’ve embedded a large number of documents using a Doc2Vec model. Now, given a new document, you want to find the most similar documents in your database. To do this, you would:

1. First, embed the new document into the same high-dimensional space.

2. Next, search the vector database for the vectors closest to the new document’s vector. This is the nearest neighbour search.

Due to the high-dimensional nature of the data, this search can be computationally intensive. However, vector databases use specialised indexing and querying algorithms (like k-d trees, ball trees, or hashing techniques) to speed up these operations. Examples of such databases include FAISS developed by Facebook AI and Annoy developed by Spotify.

Open source vector databases 

Weaviate:

FLERE HISTORIER FRA Open Source For You

Open Source For You

Open Source For You

Ukraine builds sovereign AI using Google's Gemma

Ukraine has launched a landmark national AI initiative, building a fully sovereign large language model (LLM) using Google's open source Gemma framework.

time to read

1 min

January 2026

Open Source For You

Open Source For You

Kubernetes vs Docker Swarm: Choosing the Right Orchestration Tool

This overview of the differences between Kubernetes and Docker Swarm will help DevOps developers determine the right container orchestration tool for their project.

time to read

5 mins

January 2026

Open Source For You

Open Source For You

Quantum Programming: Speaking the Language of Qubits

Quantum software tools are evolving and will soon make quantum computing easily accessible. With tools like Qiskit and Cirq, anyone can begin exploring the quantum world, experiment with algorithms, and contribute to a rapidly evolving field.

time to read

7 mins

January 2026

Open Source For You

Open Source For You

Building a Real-Time Grocery Price Comparison System

This real-time grocery price comparison system has been designed for leading Indian e-commerce platforms. Built on microservices architecture, the system leverages FastAPI for backend services and Selenium for dynamic web scraping to deliver accurate, up-to-date pricing data.

time to read

7 mins

January 2026

Open Source For You

Open Source For You

The Role of Generative AI in the AWS Database Migration Service

The integration of generative AI in the AWS Database Migration Service enhances schema conversion, making data migrations more accurate, speedy, and efficient.

time to read

7 mins

January 2026

Open Source For You

Open Source For You

Proxmox unveils open source Datacenter Manager

Proxmox has officially launched the first full and stable version of its Datacenter Manager, positioning itself as a robust open source alternative for private cloud deployments.

time to read

1 min

January 2026

Open Source For You

Open Source For You

Linux Foundation lauds Japan's OSS boom but warns of governance and security gaps

Linux Foundation Research has released its latest report, ‘The State of Open Source Japan 2025: Accelerating Business Value through Strategic Open Source Engagement’.

time to read

1 min

January 2026

Open Source For You

Open Source For You

Improving Microservices Performance with Django

Django, along with other open source tools like Redis, PostgreSQL, Celery and NGINX, helps address the challenges of request overhead and latency in microservices architecture.

time to read

4 mins

January 2026

Open Source For You

Open Source For You

Manim: Creating Dynamic Visual Animations

Learn about how Manim, a Python library, converts code into captivating animations, and why it has become the preferred choice for educators, developers, and content creators worldwide.

time to read

4 mins

January 2026

Open Source For You

Open Source For You

openSUSE empowers Linux developers with Intel NPU access

The openSUSE project has begun distributing packaging for the Intel Neural Processing Unit (NPU) driver, enabling small-scale AI development on Linux.

time to read

1 min

January 2026

Translate

Share

-
+

Change font size