Tech Stack

Built on industry standard open source tech, this list is constantly in motion.

Fully extensible

Built on industry standard open source tech. Our RAG ingestion, indexing and execution infrastructure can be plugged into a variety of inference and embeddings models and endpoints.

Tech stack:

LangChain

The largest community building the future of LLM apps

LangChain’s flexible abstractions and AI-first toolkit make it the #1 choice for developers when building with GenAI.

NextJs

The React Framework for the Web

Used by some of the world's largest companies, Next.js enables you to create high-quality web applications with the power of React components.

Vercel AI SDK

An open source library for building AI-powered user interfaces.

The Vercel AI SDK is an open-source library designed to help developers build conversational streaming user interfaces in JavaScript and TypeScript.

Supabase / Postgres

Supabase is an open source Firebase alternative. Start your project with a Postgres database, Authentication, instant APIs, Edge Functions, Realtime subscriptions, Storage, and Vector embeddings.

Mistral / Mixtral models

Mixtral 8x7B

Currently the best open model. A 7B sparse Mixture-of-Experts (SMoE). Uses 12B active parameters out of 45B total.

Mistral 7B

Our very first. A 7B transformer model, fast-deployed and easily customisable. Small, yet very powerful for a variety of use cases.

Microsoft/WizardLM-2-8x22B

PgVector

Open-source vector similarity search for Postgres. Store your vectors with the rest of your data.

TransformersJS

Transformers.js is designed to be functionally equivalent to Hugging Face’s transformers python library, meaning you can run the same pretrained models using a very similar API.

nomic-embed-text

The first Open source, Open data, Open training code, Fully reproducible and auditable text embedding model with a 8192 context-length that outperforms OpenAI Ada-002 and text-embedding-3-small on both short and long context tasks.

Unstructured.io

Unstructured effortlessly extracts and transforms complex data for use with every major vector database and LLM framework.

Playwright

Playwright Test was created specifically to accommodate the needs of end-to-end testing. Playwright supports all modern rendering

Auth with DID

Decentralized identifiers (DIDs) are a type of globally unique identifier that enables an entity to be identified in a manner that is verifiable, persistent (as long as the DID controller desires), and does not require the use of a centralized registry.

Kafka

Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications.

Kubernetes

Kubernetes (K8s) is an open source platform to automate the implementation, scaling, and management of container applications.

Helm

Helm is the package manager for Kubernetes

Can support:

custom models including openAI embeddings and inference endpoints
any kind of auth
python custom endpoints
custom pipelines, agents and tools
other vector stores

Technicals:

HSNW vector indexes using PgVector 0.5
50min indexing time for 1 million rows
Apis

PreviousVersions and releases NextCurated Contained Datasets

Last updated 1 year ago