LlamaIndex
Corporate Round in 2025
LlamaIndex is a versatile framework that connects custom data sources with large language models. It offers advanced data connectors, secure cloud infrastructure, and customizable indexing to enable businesses to create scalable AI agents for document research, workflow automation, and insights generation.
Fennel AI
Acquisition in 2025
Whether it is recommending videos to watch on TikTok, things to buy on Amazon, or recommending jobs to apply for on LinkedIn, recommendation engines are the drivers of the modern digital economy. However, the technology to power these recommendations has only been available to a select few big tech companies so far.
We, at Fennel AI, are an ex-Facebook/Google team that’s on a mission to enable all the companies in the world to harness this technology and build delightful products for their customers.
Gable is a business-to-business data infrastructure software-as-a-service company that provides a platform aimed at enhancing collaboration through the writing and execution of data contracts. This platform facilitates effective communication between data providers and consumers, promoting improved data quality on a larger scale. Additionally, Gable offers a management platform focused on enhancing data visibility and governance. By employing artificial intelligence, Gable's platform scans code to identify data creation points, monitor its movement, and assess its effects prior to deployment. This comprehensive approach helps organizations prevent disruptions, fosters collaboration, and ensures compliance with relevant standards.
Blade Bridge
Acquisition in 2025
Blade Bridge offers a suite of tools that accelerate data projects for SI's and product vendors.
Twelve Labs
Series A in 2024
Twelve Labs offers a platform that empowers businesses and developers to analyze and interpret video content through multiple modalities like visual and auditory data. This enables the creation of intelligent video applications across industries such as technology, media, entertainment, and security.
Koantek
Venture Round in 2024
Koantek is an IT consulting firm specializing in data-driven solutions. It assists businesses in maximizing their data and AI investments by providing advisory and implementation services in data strategy, migration, engineering, advanced analytics, and cloud infrastructure automation. By simplifying data complexities, Koantek helps clients accelerate business growth.
SuperAnnotate
Series B in 2024
SuperAnnotate is a platform for building high-quality training datasets for Generative AI, large language models, computer vision, and natural language processing. It provides advanced annotation and data management tools, automation features, data curation, orchestration, and an integrated workforce marketplace to help machine learning teams organize, annotate, and scale image, video, text, and lidar data. The platform supports collaboration, quality management, and pipelines to train and deploy accurate models, aiming to accelerate ML development and improve dataset integrity.
Galileo.ai is a technology company specializing in AI application management. It offers a suite of services including development, testing, monitoring, and security for AI applications. The core product is an AI platform designed to enhance machine learning processes by automatically identifying errors and data gaps, thereby improving efficiencies, reducing costs, and mitigating biases across various industries such as healthcare, finance, and insurance.
Braintrust Data
Series A in 2024
Braintrust offers an AI stack to simplify the process from evaluations to data management, ensuring seamless integration into any business. Braintrust simplifies the evaluation process by facilitating easy scoring, logging, and visualization of outputs. Users can investigate failures, monitor performance trends, and address queries like identifying regressions or assessing new models.
The platform offers a Prompt Playground, allowing users to compare multiple prompts, benchmarks, and input/output pairs across runs, enabling both ephemeral tinkering and experiment evaluation on large datasets.
In Continuous Integration, Braintrust seamlessly integrates into workflows, enabling progress tracking on the main branch and automatic comparison of new experiments with live versions before deployment.
With a focus on Datasets, the platform enables the effortless capture of rated examples from staging and production, incorporating them into versioned "golden" datasets stored in the cloud. This ensures the evolution of datasets without jeopardizing evaluations dependent on them.
Voyage AI
Series A in 2024
Voyage AI specializes in creating state-of-the-art embedding models and rerankers, designed to enhance the quality and efficiency of unstructured data search and retrieval, particularly in retrieval-augmented generation (RAG) systems. Led by top-tier researchers, the company's offerings outperform competitors in terms of accuracy, speed, and cost-effectiveness, and provide flexible deployment options. Voyage AI caters to specific domains such as code, finance, and law, delivering tailored, high-precision models. Additionally, it offers customized solutions and bespoke models to address unique business needs, enabling clients to improve their AI-driven processes and data analysis capabilities.
Cube Dev provides an open-source analytical API platform that enables internal business intelligence and customer-facing analytics through a semantic layer. The semantic layer unifies business logic, centralizes governance and security, and optimizes query performance, while supporting integration with any data endpoint. Cube Cloud offers a managed infrastructure with query inspection and tracing, monitoring, and pre-aggregation management. The platform includes visualization-agnostic tools for building user interfaces and analytical APIs, and supports modern data stores to handle large data volumes with built-in security. Cube Dev serves organizations seeking scalable, governed analytics applications.
XponentL Data
Seed Round in 2024
XponentL Data is a technology company that specializes in data and artificial intelligence (AI) platforms. Its core business is to streamline the process of data-driven decision-making by automating data collection, preparation, and analysis. The company's platform bridges the gap between data producers and consumers, delivering data products that offer insights, fuel AI, and capture value. This enables businesses to extract knowledge from their data and make more informed decisions, ultimately increasing productivity and fostering creativity.
Fireworks AI
Series A in 2024
Fireworks AI develops a generative artificial intelligence platform that enables rapid product iteration while minimizing operational costs. Its platform focuses on running, fine-tuning, and sharing large language models to solve product challenges efficiently.
Lilac AI
Acquisition in 2024
Lilac AI provides data science tools for improving the quality of data for generative AI applications and language models (LLMs).
Unstructured
Series B in 2024
Unstructured is a developer of an open-source data transformation platform that simplifies the process of converting raw data, such as PDFs and Microsoft Office documents, into formats compatible with language models. The platform supports over 25 file types, including PDF, DOC, and PPTX, and offers connectors to various systems like SharePoint, S3, and Databricks. By facilitating effortless data extraction and integration into AI workflows, Unstructured enhances the accessibility of human-generated information, ensuring that it is readily available for generative AI systems. Its modular architecture allows users to incorporate any third-party model, making it a versatile solution for preprocessing natural language data for machine learning applications.
Mistral AI
Venture Round in 2024
Mistral AI specialises in developing advanced artificial intelligence solutions. They focus on creating state-of-the-art AI models for natural language processing and complex problem-solving, aiming to enhance business efficiency and decision-making.
Adaptive ML
Series A in 2024
Adaptive ML is a technology company that specializes in developing a Large Language Model (LLM) platform. This platform enables businesses to train and deploy language models, with a unique feature of incorporating user feedback to enhance model performance. By leveraging company data, user interactions, and feedback, Adaptive ML's platform generates AI models that continuously learn and improve, helping businesses achieve superior performance without requiring expertise in complex techniques like reinforcement learning.
Entrada
Seed Round in 2024
Entrada recognizes the transformative impact of data on business agility and innovation.
Founded to empower Databricks users to unlock their data's full potential, Entrada offers expert services in modernizing data platforms to drive business objectives and monetization of data-centric services. As a trusted partner, Entrada is dedicated to ensuring businesses can fully harness their data for great decision-making and excellent customer experiences.
Glean develops an AI-based search engine software that connects enterprise data and generates answers through a tool integrated into any company's existing workflows. Its platform, Workplace Search, connects to internal data sources, enabling employees to find information and receive personalized answers using advanced search techniques, retrieval augmented generation, and large language models.
Einblick
Acquisition in 2024
Einblick is a visual data computing platform that enhances organizations' ability to analyze data, forecast outcomes, and make informed decisions. The platform uniquely integrates the computational capabilities of traditional data science notebooks with modern canvas-based collaboration tools, creating a user-friendly graphical environment for building and deploying models. This touch-enabled interface allows data scientists to efficiently construct high-performance models and present them interactively to decision-makers. Einblick's clientele includes prominent organizations such as a major German luxury car brand, DARPA, and a significant internet service provider, highlighting its diverse application across various industries.
Anomalo is a developer of an artificial intelligence data validation tool that enables users to continuously inspect and validate data entering their warehouses. The company’s solution automatically detects and explains issues in enterprise data, facilitating seamless integration with data warehouses. By employing automated machine learning technology, Anomalo’s tool allows organizations to validate and document their data with minimal configuration, eliminating the need for users to write any code. This innovation helps companies maintain data integrity and improve the reliability of their data-driven decisions.
Mistral AI
Series A in 2023
Mistral AI specialises in developing advanced artificial intelligence solutions. They focus on creating state-of-the-art AI models for natural language processing and complex problem-solving, aiming to enhance business efficiency and decision-making.
Arcion
Acquisition in 2023
Arcion is a cloud-native data mobility platform that specializes in high-performance, real-time data pipelines. The company offers an autonomous migration and cloud-neutral database replication solution, allowing businesses to efficiently migrate database updates to streaming data pipelines. This capability provides a consistent, real-time view of customer data and business intelligence across various applications and business units. By streamlining data migration processes, Arcion helps organizations reduce migration and licensing costs while enhancing the productivity of their engineering teams.
Prophecy.io
Series B in 2023
Prophecy.io offers a data transformation copilot that assists users in developing, deploying, and monitoring data pipelines across cloud platforms. The platform integrates AI and a visual interface to enhance productivity for various data users, enabling them to manage complex data workflows with ease.
Cleanlab specializes in enhancing the reliability of artificial intelligence systems by focusing on data-centric approaches. Its platform improves dataset quality, identifies low-quality outputs, determines root causes, enhances response accuracy, and applies guardrails to enable safe, accurate, and scalable AI deployment.
DigPath
Non Equity Assistance in 2023
We are an innovative AI-driven digital pathology company, dedicated to shaping the future of diagnostics and research, with a mission to empower accurate diagnoses and elevate patient care globally.
Neon is a cloud-native, fully managed Postgres as a service. By separating storage from computing, Neon offers autoscaling, branching, and bottomless storage to give developers a simple, reliable, and powerful experience. Neon aims to provide a highly performant and cost-effective database infrastructure by leveraging cloud-native technologies and innovative architectural features.
Hightouch
Series B in 2023
Hightouch operates a customer data platform that synchronizes data between various marketing and operational tools. It connects businesses' existing data warehouses with applications like CRM systems, email platforms, and advertising networks for personalized marketing campaigns and enhanced customer engagement.
MosaicML
Acquisition in 2023
MosaicML is a company focused on creating an efficient infrastructure for training large language models and improving the overall efficiency of neural networks. It develops software and artificial intelligence training algorithms that enhance the training process by utilizing algorithmic techniques such as sparsity and network pruning. These innovations allow users to effectively and securely train large-scale AI models on their proprietary data, while also optimizing for speed, quality, and cost. MosaicML aims to streamline the machine learning model recomposition process, making it easier for organizations to harness the power of AI in their operations.
Snowplow
Venture Round in 2023
Snowplow is an enterprise-grade event analytics platform that specializes in behavioral data management. It empowers data teams by providing tools to track, contextualize, validate, and model customer interactions on websites and applications. The platform integrates web analytics with various third-party data sources, allowing businesses to gain comprehensive insights into customer behavior. Snowplow's solutions facilitate customer journey analytics, marketing attribution, product analytics, and paywall optimization, addressing complex data challenges and enhancing overall data-driven decision-making for organizations.
Lovelytics
Venture Round in 2023
Lovelytics is a data, AI, and analytics consulting company that specializes in transforming data into actionable insights for leading organizations. The firm offers a range of services, including data advisory, enterprise data environment design and implementation, data science and machine learning, data visualization, and training. By partnering with clients, Lovelytics focuses on enhancing self-sufficiency and hands-on enablement, ultimately driving business outcomes and creating sustainable value. Through its expertise, the company aims to help clients optimize and modernize their data ecosystems, ensuring they can better understand and leverage their data for strategic decision-making.
Catalyst Software
Venture Round in 2023
Catalyst Software Corporation, headquartered in New York, develops an intuitive customer success platform designed to enhance customer experience and reduce churn for businesses. As a Software-as-a-Service (SaaS) provider, it offers a comprehensive suite of features including analytics, workflow automation, product usage tracking, and a task manager that consolidates various communication tools into a single interface. The platform allows users to log customer interactions automatically, create 360º profiles, and manage campaigns and account segmentation effectively. Additionally, it integrates with other SaaS applications to provide a unified dashboard that facilitates data-driven decision-making around customer success. Catalyst aims to empower teams to identify expansion opportunities and drive recurring revenue growth by aligning strategic actions with customer objectives. Founded in 2016, the company is now part of Totango.
Immuta
Venture Round in 2023
Immuta, Inc. is a data security company that specializes in providing organizations with a platform for managing data privacy, security, and access control. The Immuta Data Security Platform enables users to discover and classify sensitive data, implement access control policies, and monitor data usage without the need for coding. This automated data governance solution supports self-service access to data while ensuring compliance with various regulations. Immuta is utilized by a diverse range of industries, including finance, healthcare, government, and manufacturing, to facilitate cloud migration and secure collaboration. Founded in 2014 and headquartered in College Park, Maryland, with additional offices in Boston, Massachusetts and Columbus, Ohio, Immuta has established itself as a trusted partner for Fortune 500 companies and government agencies worldwide.
Okera Inc. is a data management company that operates an Active Data Access Platform designed to streamline data provisioning, access, governance, and auditing. Founded in 2016 and based in San Francisco, California, with an additional office in Seattle, Okera focuses on enhancing data security, privacy, compliance, and sensitive data management. The platform enables organizations to automatically discover and audit data lakes, create no-code access policies through a visual policy engine, and enforce fine-grained access controls across hybrid and multi-cloud environments, including AWS and Azure. By implementing comprehensive data access controls, Okera empowers data teams to confidently harness the potential of their data for innovation and growth while navigating the complexities of evolving data privacy regulations.
Perplexity
Series A in 2023
Perplexity is an AI-driven search engine platform that combines large language models with traditional search engines. It uses natural language processing (NLP) and generative AI to provide conversational responses, bridging the gap between conventional search engines and interactive AI assistance.
Matillion
Venture Round in 2022
Matillion Ltd. specializes in cloud data integration software solutions that empower companies to effectively utilize their data. The company offers a range of products, including Matillion ETL for Amazon Redshift and Matillion ETL for Snowflake, both of which facilitate the extraction, loading, and transformation of structured and semi-structured data in cloud environments. Additionally, Matillion Data Loader serves as a SaaS-based tool that loads data from source systems into cloud data warehouses, enhancing data accessibility for informed decision-making. Matillion also provides a business intelligence solution for self-service reporting and analytics, alongside Matillion Exchange, a marketplace for users to share and download integration jobs. With a client base that includes Fortune 500 companies and mid-sized tech enterprises, Matillion operates from its headquarters in Manchester, United Kingdom, and has offices in New York, Denver, and Seattle. The company, established in 2010, is dedicated to accelerating data readiness and maximizing the impact of data across various industries.
DataJoy
Acquisition in 2022
DataJoy is a developer of a revenue intelligence platform that integrates data across various organizational functions, including marketing, sales, product, and finance. The platform utilizes machine learning algorithms to analyze this unified data, providing insights that help companies understand and enhance their revenue performance. By tracking key performance indicators and detecting anomalies, DataJoy enables organizations to make informed projections and optimize their strategies for growth. Ultimately, the company aims to assist businesses in building a repeatable, profitable, and predictable revenue model.
Tecton offers an enterprise-grade feature store, empowering businesses to harness machine learning effectively. Its platform addresses unique ML data requirements, enabling teams to build, serve, and scale features swiftly and reliably.
Cortex Labs
Acquisition in 2022
Cortex Labs is a developer of a serverless computing platform that supports machine learning engineering teams by providing cloud-native model serving infrastructure. The platform is designed to facilitate the deployment of large-scale machine learning applications, including computer vision and natural language processing. It enables users to build and integrate APIs into any application, manage both real-time and batch inference workloads, and create streamlined, reproducible workflows. By doing so, Cortex Labs empowers engineering teams to efficiently ship machine learning applications into production.
Hex Technologies
Series B in 2022
Hex is a software company that provides collaborative data science and analytics. They provide individuals to learn and organizations to know things so they can make better decisions. Hex brings together SQL, Python, R, and no-code in powerful notebooks and allows users to publish projects as interactive data apps that anyone can use with one click.
Founded in 2016, dbt Labs develops an open-source analytics engineering tool that empowers data analysts with SQL knowledge to build and share organizational knowledge through data modeling. Its platform facilitates collaborative deployment of analytics code, adhering to software engineering practices.
Arcion is a cloud-native data mobility platform that specializes in high-performance, real-time data pipelines. The company offers an autonomous migration and cloud-neutral database replication solution, allowing businesses to efficiently migrate database updates to streaming data pipelines. This capability provides a consistent, real-time view of customer data and business intelligence across various applications and business units. By streamlining data migration processes, Arcion helps organizations reduce migration and licensing costs while enhancing the productivity of their engineering teams.
Revelate is a developer of a data fulfillment platform that offers a comprehensive suite of capabilities for data sharing and commercialization. The platform is designed to alleviate the challenges faced by data teams in distributing data according to consumer needs, both within and outside their organizations. By seamlessly integrating into existing data ecosystems, Revelate empowers companies to prepare, package, and distribute data efficiently and effectively from any source to any recipient. This innovative approach helps organizations fully realize the value of their data assets.
Hunters is a cybersecurity company that specializes in developing an artificial intelligence-based platform for detecting and responding to cyber threats. Founded in 2018 and headquartered in Tel Aviv, Israel, the company offers its solution, Hunters.AI, which autonomously identifies cyberattacks that may evade traditional security measures across various IT environments, including cloud and network systems. Hunters.AI integrates diverse security telemetry and intelligence, enriching threat signals with detailed tactics, techniques, and procedures to enhance detection capabilities. By utilizing machine learning and cloud-based analytics, the platform correlates threat patterns and generates high-fidelity attack narratives, enabling cybersecurity teams to respond swiftly and effectively to potential breaches.
Labelbox, Inc. is a technology company that specializes in providing an AI-driven platform for data labeling and management. Founded in 2018 and headquartered in San Francisco, California, the company enables businesses to outsource their data annotation needs, facilitating the creation and management of datasets essential for machine learning applications. The platform features a visual workflow interface, annotation tools, quality control capabilities, and performance analytics, which collectively streamline the data labeling process. Labelbox supports teams in utilizing the latest advancements in generative AI and large language models, ensuring that AI systems receive appropriate human oversight and automation. Its services are utilized by prominent enterprises, including Walmart, Procter & Gamble, Genentech, and Adobe, as well as numerous leading AI teams across various industries.
8080 Labs
Acquisition in 2021
8080 Labs is a software development company that specializes in creating tools to enhance the accessibility of data science for users of all skill levels. The company's flagship product, bamboolib, is a user-friendly, UI-based data science tool that allows users to quickly and efficiently explore and transform data without the need for coding. By streamlining the data manipulation process, 8080 Labs empowers data scientists to boost their productivity, enabling them to focus on analysis rather than programming. The company offers both paid and open-source software, reinforcing its commitment to making data science tools available to a broader audience.
Redash
Acquisition in 2020
Redash, Ltd. is a company founded in 2015 and based in Tel Aviv-Yafo, Israel, that specializes in developing an open-source platform designed to facilitate data-driven decision-making for organizations. The platform enables data scientists and SQL analysts to integrate various data sources, such as operational databases and data lakes, into cohesive dashboards. By democratizing data access, Redash allows enterprises to visualize and share data insights effectively, fostering a culture of data utilization within organizations. In June 2020, Redash became a subsidiary of Databricks Inc., further enhancing its capabilities in the data analytics landscape.
Theom is an IT company that provides cloud and data security services to discover, track, and protect enterprise data in cloud environments. Its cloud-native security platform deploys quickly to uncover risks of data loss, prioritize corrective actions, and enable enterprises to securely use data in the cloud while focusing on growth. Founded in 2020 and headquartered in San Francisco.
Neon is a cloud-native, fully managed Postgres as a service. By separating storage from computing, Neon offers autoscaling, branching, and bottomless storage to give developers a simple, reliable, and powerful experience. Neon aims to provide a highly performant and cost-effective database infrastructure by leveraging cloud-native technologies and innovative architectural features.