Two Sigma Ventures, established in 2012, is a New York-based venture capital firm that invests in early-stage companies. It focuses on sectors such as infrastructure tools, healthcare, real estate, consumer hardware, and artificial intelligence. The firm, a subsidiary of Two Sigma Investments, provides not only capital but also access to a network of experts and resources to support its portfolio companies. Two Sigma Ventures typically invests in 8 to 10 companies per fund, targeting seed to series B rounds.
100 Avenue of the Americas, 16th Floor, New York, NY 10013, US
Colin Beirne
Founder and Partner
Jason Beverage
Managing Director
Kyra Durko
Principal
Villi Iltchev
Partner
David Joerg
Managing Director
Nikhil Namburi
Investor
Rohit Rao
Investor
Vin Sachidananda
Investor
Edward Schmidt
Venture Partner
Frances Schwiep
Partner
Past deals in Data Mining
Distributional
Series A in 2024
Distributional is a developer of an AI testing and evaluation platform focused on ensuring that artificial intelligence systems are safe, reliable, and secure. The company offers tools for continuous monitoring, which include generating test cases and analyzing results. These tools help organizations ensure that their AI systems perform as expected, thereby fostering trust in AI technologies. By providing these capabilities, Distributional enables customers to deploy AI products more frequently and effectively, allowing them to fully leverage the potential benefits of AI within their operations.
Anomalo
Series B in 2024
Anomalo is a developer of an artificial intelligence-based data validation tool that helps organizations continuously inspect and validate the data entering their warehouses. The company’s solution automatically detects and explains issues in enterprise data, facilitating a seamless connection to data warehouses. By employing automated machine learning technology, Anomalo enables companies to validate and document their data with minimal configuration, eliminating the need for users to write any code. This approach streamlines the data validation process, enhancing data integrity and reliability for enterprises.
Distributional
Seed Round in 2023
Distributional is a developer of an AI testing and evaluation platform focused on ensuring that artificial intelligence systems are safe, reliable, and secure. The company offers tools for continuous monitoring, which include generating test cases and analyzing results. These tools help organizations ensure that their AI systems perform as expected, thereby fostering trust in AI technologies. By providing these capabilities, Distributional enables customers to deploy AI products more frequently and effectively, allowing them to fully leverage the potential benefits of AI within their operations.
Crux
Series B in 2023
Crux Informatics Inc., founded in 2017 and headquartered in San Francisco, California, specializes in data processing and management services through its external data automation platform. This platform facilitates the integration, transformation, and observability of third-party data, effectively bridging the gap between data suppliers and consumers. Crux automates the development of data pipelines, ensures ongoing validation of data quality, and manages operations on a large scale across various cloud platforms. By leveraging advanced technology and a team of experienced data engineers, Crux enhances the efficiency of external data integration workflows, catering to the needs of companies seeking to optimize their data operations. The company has garnered support from prominent financial institutions, indicating its strong presence in the industry.
Hexagon Bio
Venture Round in 2022
Hexagon Bio, Inc. is a biotechnology company based in Menlo Park, California, focused on discovering novel drugs for diseases that lack effective treatments. Established in 2016, the company utilizes a data-driven approach that combines genomics, synthetic biology, and computational techniques. By mining genomic data from fungal genomes, Hexagon Bio aims to identify targeted small molecule therapeutics. Its proprietary platform leverages data science to engineer drugs from DNA sequences, enabling the identification of small molecule inhibitors that interact with specific target proteins. This innovative method allows researchers to tap into the global metagenome, facilitating the development of new medicines aimed at improving patient outcomes.
Flatfile
Series B in 2022
Flatfile Inc. is a company that specializes in data onboarding solutions, enabling developers to efficiently validate, map, and import CSV data from various web applications. Founded in 2018 and based in Denver, Colorado, Flatfile offers a platform that integrates seamlessly with software applications, allowing users to upload data through formats such as CSV, XLS, or TSV. Its key products include Portal, which provides an import button embedded via a JavaScript snippet, and Concierge, which creates secure collaborative workspaces for organizations to manage complex data ingestion challenges. The platform is designed to enhance developer productivity, reduce costs, and improve data quality by learning to identify and clean incoming data over time. Numerous companies, including AstraZeneca and Square, utilize Flatfile’s API-first platform to streamline their data import processes.
Hivemind Technologies
Acquisition in 2022
Hivemind Technologies AG is a company based in Cologne, Germany, specializing in the development of social video platforms and mobile applications. It provides a range of application programming interface (API) solutions that facilitate topic identification, social metrics, polarity analysis, language detection, and news monitoring. The company offers tools for publishers, including analytics for tracking and comparing articles, news widgets to enhance websites with relevant feeds, and a dashboard for monitoring social news performance. Additionally, Hivemind Technologies delivers consulting services to assist clients in understanding data flows, selecting appropriate technology stacks, designing automated processes, and building data science teams. With a focus on machine learning and data analytics, the company aims to help businesses modernize their technology and effectively manage large volumes of data. Founded in 2011, Hivemind Technologies was formerly known as Alpha 12.
Radar
Series C in 2022
Radar provides a location data infrastructure platform designed for app developers to integrate location-based services into their applications. The platform facilitates the creation of personalized and contextually aware experiences through features such as geofencing, place detection, and location tracking. It supports a variety of use cases across sectors like retail, travel, and logistics, and offers tools for easy integration, including mobile SDKs and APIs. Radar's capabilities include detecting home, work, and traveling locations, as well as geocoding addresses into geographic coordinates. The company emphasizes privacy and security, ensuring responsible handling of location data. Founded in 2016, Radar continues to innovate with new APIs in private beta that enhance the functionality of its platform.
Comet
Series B in 2021
Comet is doing for ML what GitHub did for Code. It allows data scientists to automatically track their datasets, code changes, experimentation history, and production models creating efficiency, transparency, and reproducibility. Comet.ml is the first platform built for ML that enables engineers and data scientists to efficiently maintain their preferred workflow and tools, while easily tracking previous work and collaborating throughout the iterative process. Comet.ml also optimizes models with bayesian hyperparameter optimization - a type of algorithm - which saves time typically spent on manual tuning ML models. As a result, users have increased visibility of data science, ML results, and progress throughout an organization.
Anomalo
Series A in 2021
Anomalo is a developer of an artificial intelligence-based data validation tool that helps organizations continuously inspect and validate the data entering their warehouses. The company’s solution automatically detects and explains issues in enterprise data, facilitating a seamless connection to data warehouses. By employing automated machine learning technology, Anomalo enables companies to validate and document their data with minimal configuration, eliminating the need for users to write any code. This approach streamlines the data validation process, enhancing data integrity and reliability for enterprises.
Hexagon Bio
Series B in 2021
Hexagon Bio, Inc. is a biotechnology company based in Menlo Park, California, focused on discovering novel drugs for diseases that lack effective treatments. Established in 2016, the company utilizes a data-driven approach that combines genomics, synthetic biology, and computational techniques. By mining genomic data from fungal genomes, Hexagon Bio aims to identify targeted small molecule therapeutics. Its proprietary platform leverages data science to engineer drugs from DNA sequences, enabling the identification of small molecule inhibitors that interact with specific target proteins. This innovative method allows researchers to tap into the global metagenome, facilitating the development of new medicines aimed at improving patient outcomes.
Castor
Series B in 2021
Castor is an international health-tech company that provides a cloud-based clinical data platform designed to streamline the clinical trial process for researchers globally. Founded by CEO Derk Arts, MD, Ph.D., the platform is utilized by over 50,000 researchers in 90 countries, supporting more than 4,000 studies across diverse therapeutic areas, including diabetes, cardiovascular disease, rare diseases, infectious diseases, and oncology. Castor's platform facilitates the collection and analysis of extensive data from both traditional and remote trials, having reached significant milestones such as 180 million data points and 2 million enrolled patients. The company's mission is to make research data reusable, thereby enabling AI-driven clinical trials and enhancing the overall impact of data within the medical research community.
Comet
Series A in 2021
Comet is doing for ML what GitHub did for Code. It allows data scientists to automatically track their datasets, code changes, experimentation history, and production models creating efficiency, transparency, and reproducibility. Comet.ml is the first platform built for ML that enables engineers and data scientists to efficiently maintain their preferred workflow and tools, while easily tracking previous work and collaborating throughout the iterative process. Comet.ml also optimizes models with bayesian hyperparameter optimization - a type of algorithm - which saves time typically spent on manual tuning ML models. As a result, users have increased visibility of data science, ML results, and progress throughout an organization.
Flatfile
Series A in 2021
Flatfile Inc. is a company that specializes in data onboarding solutions, enabling developers to efficiently validate, map, and import CSV data from various web applications. Founded in 2018 and based in Denver, Colorado, Flatfile offers a platform that integrates seamlessly with software applications, allowing users to upload data through formats such as CSV, XLS, or TSV. Its key products include Portal, which provides an import button embedded via a JavaScript snippet, and Concierge, which creates secure collaborative workspaces for organizations to manage complex data ingestion challenges. The platform is designed to enhance developer productivity, reduce costs, and improve data quality by learning to identify and clean incoming data over time. Numerous companies, including AstraZeneca and Square, utilize Flatfile’s API-first platform to streamline their data import processes.
Hexagon Bio
Series A in 2020
Hexagon Bio, Inc. is a biotechnology company based in Menlo Park, California, focused on discovering novel drugs for diseases that lack effective treatments. Established in 2016, the company utilizes a data-driven approach that combines genomics, synthetic biology, and computational techniques. By mining genomic data from fungal genomes, Hexagon Bio aims to identify targeted small molecule therapeutics. Its proprietary platform leverages data science to engineer drugs from DNA sequences, enabling the identification of small molecule inhibitors that interact with specific target proteins. This innovative method allows researchers to tap into the global metagenome, facilitating the development of new medicines aimed at improving patient outcomes.
Recursion Pharmaceuticals
Series D in 2020
Recursion Pharmaceuticals, Inc. is a clinical-stage biotechnology company based in Salt Lake City, Utah, that focuses on revolutionizing drug discovery through the integration of advanced technologies such as artificial intelligence, automation, and bioinformatics. Founded in 2013, the company has developed a comprehensive drug discovery platform that includes various tools and software, such as ReChem for chemical compound design, ReScreen for managing complex experimental workflows, and RePredict for modeling drug relationships using machine learning. These innovations enable Recursion to generate extensive biological and chemical datasets, facilitating the exploration of foundational biology and accelerating the development of new therapeutic solutions. Through its unique approach, Recursion aims to significantly enhance patient outcomes and streamline the drug discovery process.
Castor
Series A in 2020
Castor is an international health-tech company that provides a cloud-based clinical data platform designed to streamline the clinical trial process for researchers globally. Founded by CEO Derk Arts, MD, Ph.D., the platform is utilized by over 50,000 researchers in 90 countries, supporting more than 4,000 studies across diverse therapeutic areas, including diabetes, cardiovascular disease, rare diseases, infectious diseases, and oncology. Castor's platform facilitates the collection and analysis of extensive data from both traditional and remote trials, having reached significant milestones such as 180 million data points and 2 million enrolled patients. The company's mission is to make research data reusable, thereby enabling AI-driven clinical trials and enhancing the overall impact of data within the medical research community.
Flatfile
Seed Round in 2020
Flatfile Inc. is a company that specializes in data onboarding solutions, enabling developers to efficiently validate, map, and import CSV data from various web applications. Founded in 2018 and based in Denver, Colorado, Flatfile offers a platform that integrates seamlessly with software applications, allowing users to upload data through formats such as CSV, XLS, or TSV. Its key products include Portal, which provides an import button embedded via a JavaScript snippet, and Concierge, which creates secure collaborative workspaces for organizations to manage complex data ingestion challenges. The platform is designed to enhance developer productivity, reduce costs, and improve data quality by learning to identify and clean incoming data over time. Numerous companies, including AstraZeneca and Square, utilize Flatfile’s API-first platform to streamline their data import processes.
Comet
Venture Round in 2020
Comet is doing for ML what GitHub did for Code. It allows data scientists to automatically track their datasets, code changes, experimentation history, and production models creating efficiency, transparency, and reproducibility. Comet.ml is the first platform built for ML that enables engineers and data scientists to efficiently maintain their preferred workflow and tools, while easily tracking previous work and collaborating throughout the iterative process. Comet.ml also optimizes models with bayesian hyperparameter optimization - a type of algorithm - which saves time typically spent on manual tuning ML models. As a result, users have increased visibility of data science, ML results, and progress throughout an organization.
Radar
Series B in 2020
Radar provides a location data infrastructure platform designed for app developers to integrate location-based services into their applications. The platform facilitates the creation of personalized and contextually aware experiences through features such as geofencing, place detection, and location tracking. It supports a variety of use cases across sectors like retail, travel, and logistics, and offers tools for easy integration, including mobile SDKs and APIs. Radar's capabilities include detecting home, work, and traveling locations, as well as geocoding addresses into geographic coordinates. The company emphasizes privacy and security, ensuring responsible handling of location data. Founded in 2016, Radar continues to innovate with new APIs in private beta that enhance the functionality of its platform.
Recursion Pharmaceuticals
Series C in 2019
Recursion Pharmaceuticals, Inc. is a clinical-stage biotechnology company based in Salt Lake City, Utah, that focuses on revolutionizing drug discovery through the integration of advanced technologies such as artificial intelligence, automation, and bioinformatics. Founded in 2013, the company has developed a comprehensive drug discovery platform that includes various tools and software, such as ReChem for chemical compound design, ReScreen for managing complex experimental workflows, and RePredict for modeling drug relationships using machine learning. These innovations enable Recursion to generate extensive biological and chemical datasets, facilitating the exploration of foundational biology and accelerating the development of new therapeutic solutions. Through its unique approach, Recursion aims to significantly enhance patient outcomes and streamline the drug discovery process.
Radar
Series A in 2019
Radar provides a location data infrastructure platform designed for app developers to integrate location-based services into their applications. The platform facilitates the creation of personalized and contextually aware experiences through features such as geofencing, place detection, and location tracking. It supports a variety of use cases across sectors like retail, travel, and logistics, and offers tools for easy integration, including mobile SDKs and APIs. Radar's capabilities include detecting home, work, and traveling locations, as well as geocoding addresses into geographic coordinates. The company emphasizes privacy and security, ensuring responsible handling of location data. Founded in 2016, Radar continues to innovate with new APIs in private beta that enhance the functionality of its platform.
Zymergen
Series C in 2018
Zymergen, Inc. is a biotechnology company that focuses on researching, developing, and manufacturing microbes for various industries, including agriculture, chemicals, materials, pharmaceuticals, electronics, and personal care. Founded in 2013 and headquartered in Emeryville, California, Zymergen employs a platform that integrates automation, machine learning, and genomics to enhance the efficiency of microbial strain optimization and production processes. This technology enables the company to improve existing manufacturing strains and facilitates the development of new products by engineering novel molecules from microbes. With additional offices in Boise, Idaho; Medford, Massachusetts; Seattle, Washington; and Tokyo, Japan, Zymergen aims to partner with nature to create innovative materials and products that deliver significant value across multiple sectors.
Crux
Series B in 2018
Crux Informatics Inc., founded in 2017 and headquartered in San Francisco, California, specializes in data processing and management services through its external data automation platform. This platform facilitates the integration, transformation, and observability of third-party data, effectively bridging the gap between data suppliers and consumers. Crux automates the development of data pipelines, ensures ongoing validation of data quality, and manages operations on a large scale across various cloud platforms. By leveraging advanced technology and a team of experienced data engineers, Crux enhances the efficiency of external data integration workflows, catering to the needs of companies seeking to optimize their data operations. The company has garnered support from prominent financial institutions, indicating its strong presence in the industry.
Enigma
Series C in 2018
Enigma Technologies, Inc. is an operational data management and intelligence company based in New York. It specializes in providing a searchable database of public records, information, and documents, facilitating streamlined operations and informed decision-making for its users. The company offers a suite of products, including Enigma Data Infrastructure, which features tools for data operations and metadata enhancement, and Enigma Solutions, tailored for specific industries such as financial services, pharmacovigilance, and insurance. Enigma also provides analysis-ready public data relevant to sectors like oil and gas, healthcare, and company reference data. Additionally, Enigma operates Enigma Labs, which focuses on developing open data tools for public use, and offers an API to support developers in creating data-rich applications. The company is recognized for its contributions to small business intelligence, delivering timely and accurate insights on the identity and risk profile of small businesses, thus aiding firms in areas such as insurance risk assessment and fraud prevention. Enigma Technologies was incorporated in 2011.
Comet
Seed Round in 2018
Comet is doing for ML what GitHub did for Code. It allows data scientists to automatically track their datasets, code changes, experimentation history, and production models creating efficiency, transparency, and reproducibility. Comet.ml is the first platform built for ML that enables engineers and data scientists to efficiently maintain their preferred workflow and tools, while easily tracking previous work and collaborating throughout the iterative process. Comet.ml also optimizes models with bayesian hyperparameter optimization - a type of algorithm - which saves time typically spent on manual tuning ML models. As a result, users have increased visibility of data science, ML results, and progress throughout an organization.
Crux
Corporate Round in 2018
Crux Informatics Inc., founded in 2017 and headquartered in San Francisco, California, specializes in data processing and management services through its external data automation platform. This platform facilitates the integration, transformation, and observability of third-party data, effectively bridging the gap between data suppliers and consumers. Crux automates the development of data pipelines, ensures ongoing validation of data quality, and manages operations on a large scale across various cloud platforms. By leveraging advanced technology and a team of experienced data engineers, Crux enhances the efficiency of external data integration workflows, catering to the needs of companies seeking to optimize their data operations. The company has garnered support from prominent financial institutions, indicating its strong presence in the industry.
Recursion Pharmaceuticals
Series B in 2017
Recursion Pharmaceuticals, Inc. is a clinical-stage biotechnology company based in Salt Lake City, Utah, that focuses on revolutionizing drug discovery through the integration of advanced technologies such as artificial intelligence, automation, and bioinformatics. Founded in 2013, the company has developed a comprehensive drug discovery platform that includes various tools and software, such as ReChem for chemical compound design, ReScreen for managing complex experimental workflows, and RePredict for modeling drug relationships using machine learning. These innovations enable Recursion to generate extensive biological and chemical datasets, facilitating the exploration of foundational biology and accelerating the development of new therapeutic solutions. Through its unique approach, Recursion aims to significantly enhance patient outcomes and streamline the drug discovery process.
Zymergen
Series B in 2016
Zymergen, Inc. is a biotechnology company that focuses on researching, developing, and manufacturing microbes for various industries, including agriculture, chemicals, materials, pharmaceuticals, electronics, and personal care. Founded in 2013 and headquartered in Emeryville, California, Zymergen employs a platform that integrates automation, machine learning, and genomics to enhance the efficiency of microbial strain optimization and production processes. This technology enables the company to improve existing manufacturing strains and facilitates the development of new products by engineering novel molecules from microbes. With additional offices in Boise, Idaho; Medford, Massachusetts; Seattle, Washington; and Tokyo, Japan, Zymergen aims to partner with nature to create innovative materials and products that deliver significant value across multiple sectors.
Terbium Labs
Series A in 2016
Terbium Labs, Inc., founded in 2013 and based in Baltimore, Maryland, specializes in data intelligence solutions for information security. The company offers Matchlight, a comprehensive and fully private dark web monitoring system designed to detect stolen data and mitigate the impact of data breaches. This automated platform monitors for leaks of sensitive information, providing organizations with real-time alerts when their data appears in unauthorized locations on the internet or dark web. By enabling proactive risk management, Terbium Labs helps organizations safeguard their high-value data against theft and misuse.
Theorem
Venture Round in 2016
Theorem operates an investment management platform that leverages machine learning and data science to invest in loans from online lending platforms. Founded in 2014, the company initially managed $50,000 in assets and has since grown to manage over $800 million in assets under management, with a significant increase over the past few years. Theorem's approach combines statistical analysis with credit assessments and human insights, which enables it to generate strong yields across various economic conditions. This dual focus benefits both investors, who receive attractive returns, and borrowers, who gain access to low-interest loans. The leadership team includes experienced professionals with backgrounds in finance and technology.
Zymergen
Series A in 2015
Zymergen, Inc. is a biotechnology company that focuses on researching, developing, and manufacturing microbes for various industries, including agriculture, chemicals, materials, pharmaceuticals, electronics, and personal care. Founded in 2013 and headquartered in Emeryville, California, Zymergen employs a platform that integrates automation, machine learning, and genomics to enhance the efficiency of microbial strain optimization and production processes. This technology enables the company to improve existing manufacturing strains and facilitates the development of new products by engineering novel molecules from microbes. With additional offices in Boise, Idaho; Medford, Massachusetts; Seattle, Washington; and Tokyo, Japan, Zymergen aims to partner with nature to create innovative materials and products that deliver significant value across multiple sectors.
Enigma
Series B in 2015
Enigma Technologies, Inc. is an operational data management and intelligence company based in New York. It specializes in providing a searchable database of public records, information, and documents, facilitating streamlined operations and informed decision-making for its users. The company offers a suite of products, including Enigma Data Infrastructure, which features tools for data operations and metadata enhancement, and Enigma Solutions, tailored for specific industries such as financial services, pharmacovigilance, and insurance. Enigma also provides analysis-ready public data relevant to sectors like oil and gas, healthcare, and company reference data. Additionally, Enigma operates Enigma Labs, which focuses on developing open data tools for public use, and offers an API to support developers in creating data-rich applications. The company is recognized for its contributions to small business intelligence, delivering timely and accurate insights on the identity and risk profile of small businesses, thus aiding firms in areas such as insurance risk assessment and fraud prevention. Enigma Technologies was incorporated in 2011.
Switchboard Software
Seed Round in 2015
Switchboard Software, Inc. is a San Francisco-based company that offers a software-as-a-service data operations platform designed to help enterprises manage and utilize their data effectively. Founded in 2014 by a team that previously developed Google BigQuery, Switchboard enables organizations to consolidate disparate data into a single, reliable source in real time. Its platform provides tools for cloud data monitoring, dashboard creation, and customizable reporting, which assist clients in optimizing inventory yield and generating actionable insights. Market-leading companies, including Dotdash Meredith, Target, and the Financial Times, leverage Switchboard’s solutions to enhance customer insights and revenue operations while maintaining control over their data without the complexities of daily management.
Indico Data
Seed Round in 2014
Indico Data Solutions, established in 2013 and headquartered in Boston, Massachusetts, specializes in enterprise-level artificial intelligence solutions for automating unstructured content processing. Its platform empowers businesses to create tailored machine learning models using smaller datasets, streamlining workflows such as contract analysis, regulatory compliance, and customer support automation. The company's tools facilitate the extraction and analysis of data from documents, images, and other non-standard formats, enabling users to access relevant information for diverse applications.
Ufora
Seed Round in 2011
Ufora, Inc. is a New York-based company that designs and operates a data analytics platform focused on quantitative modeling and numerical computing applications. Founded in 2008, Ufora's platform facilitates the exploration, discovery, deployment, and iteration of data sets, empowering data scientists to address complex challenges in statistics, machine learning, and predictive analytics. The platform is built to operate at a modern scale, automatically managing computations and data across multiple machines, which allows users to work in a familiar coding environment without the need for specialized parallel programming. Ufora aims to provide seamless access to high-performance computing for big data analysis, enhancing the efficiency and capability of data professionals.
Spot something off? Help us improve by flagging any incorrect or outdated information. Just email us at support@teaserclub.com. Your feedback is most welcome.