Best data lineage tools in 2025

Pachyderm

Automated solution for managing and tracking data workflows.

Secoda AI

Unifies data governance with cataloging, observability, and lineage.

Observo

Smart data pipeline for optimized observability and threat response.

Hexo AI

Automated data transformation for efficient analysis and insights.

SvectorDB

Serverless vector database optimized for AWS environments.

Amazon Redshift

Cloud data warehouse for efficient data storage and analysis.

UserSketch

Streamlined data management and communication for businesses.

Recall - Mobile Apps

Transform scattered information into a smart, organized database.

Tegile Intelligence

Storage solution optimized for AI workloads and real-time analytics.

OpenStack CLI

Open-source software for building and managing cloud infrastructure.

Alibaba Genie

Cloud infrastructure for scalable computing and data storage solutions.

GeoMesa

Efficiently manage and analyze large geospatial datasets.

BigDL

Run deep learning models efficiently on large datasets.

Hadoop

Framework for processing large data sets across multiple systems.

Amazon Aurora

Managed database solution with high performance and scalability.

Qubole

Cost-effective data lake solution for efficient analytics.

IBM Watson Natural Language Generation

Intelligent software that generates human-like written content.

Apache Samza

Real-time data processing framework for stateful applications.

Spark SQL

Run SQL queries on big data with ease and efficiency.

Meteron AI

Resource management for AI development and deployment.

Yugabyte DB

Distributed SQL database for mission-critical applications.

ClearGPT

Launch large language models with ease and efficiency.

Knowbase

Store and interact with knowledge in one organized space.

Appliful

Launch web applications within a day with minimal hassle.

Paid + from $149
open
Marple AI

Real-time data analysis for informed decision-making in engineering.