Overview
Data lineage shows the complete journey of your data - from source systems through transformations to final entities. Entegrata provides visual lineage tracking to help you understand data flow, troubleshoot issues, and maintain compliance. This guide covers how to view and interpret lineage information for your entities and fields.Understanding Data Lineage
What is Data Lineage?
Data lineage traces the lifecycle of data:- Origin: Where data comes from (source systems, tables, fields)
- Transformations: How data is modified (CONCAT, CASE, COALESCE, etc.)
- Dependencies: What data depends on other data
- Destination: Where data ends up (entities, fields, downstream systems)

Why Lineage Matters
Troubleshooting:- Trace data quality issues back to source
- Understand unexpected values
- See what breaks if you change a source
- Identify downstream dependencies before changes
- Plan migrations and updates safely
- Document data provenance for regulations
- Track sensitive data through systems
- Audit data access and usage
- Understand existing pipelines
- Onboard new team members
- Maintain institutional knowledge
Viewing Entity Lineage
Accessing Lineage View
Using Lineage for Troubleshooting
Tracing Data Quality Issues
Identify Problem
You notice incorrect data in an entity field.Example: Customer names are showing as “NULL NULL”
Trace Backward
Follow the lineage upstream to identify:
- Which source field(s) provide the data
- Where the issue is introduced
Check Each Step
Examine each node in the path:
- Check transformation logic (is CONCAT handling nulls correctly?)
- Verify join conditions (are records matching?)
Identify Root Cause
The lineage helps pinpoint:
- Bad source data
- Incorrect transformation logic
- Failed joins
- Missing default values
Impact Analysis for Changes
Identify Affected Entities
The lineage shows:
- All entities using this source field
- Indirect dependencies
Plan Updates
Create a list of all mappings that need updating:
- Direct field mappings
- Transformations using the field
- Validation rules referencing it
Lineage for Compliance
Data Privacy Compliance
Track sensitive data through your systems:Tag Sensitive Fields
In source systems or entity fields, add tags like “PII”, “PHI”, “Confidential”.
Trace Sensitive Data
Follow lineage to see:
- Where sensitive data originates
- Where it’s stored
- Who has access
Audit Trail
Lineage includes audit information:- Who created each mapping
- When it was created or modified
- What changes were made
- Why (from publish notes)
Lineage Best Practices
Lineage Limitations
Current Limitations:
- Lineage shows design-time flow, not runtime data paths
- External transformations (outside Entegrata) not included
- Very complex transformations may be simplified in visualization
- Cross-workspace lineage requires additional configuration
- Historical lineage shows current state only, not previous versions
Troubleshooting Lineage
Lineage Not Showing
Issue: Lineage view is blank or incomplete. Solutions:- Ensure entity has been published at least once
- Refresh the page
- Check that you have permission to view lineage
- Verify sources are still connected
- Try exporting - may show more than visualization
Missing Connections
Issue: Some connections don’t appear in lineage. Solutions:- Check if fields are actually mapped
- Verify transformations are saved
- Republish the entity to refresh lineage
- Look for indirect connections (via intermediate transformations)
