Documentation Index
Fetch the complete documentation index at: https://support.entegrata.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Entegrata connectors are the integrations that move data between your source systems and the Lakehouse. They fall into a few broad types — each with its own setup style, authentication options, and operational characteristics. This page covers connector types and the general requirements that apply to all of them. For the full, card-by-card catalog of every specific system Entegrata integrates with, see the Data Connectors page.Browse all connectors
Full catalog of specific systems Entegrata can collect from and emit to, with per-connector setup guides
Connector Types
SQL Connectors
SQL connectors read directly from relational databases. This includes generic engines (Microsoft SQL Server, Oracle, Microsoft Fabric SQL / Warehouse / Lakehouse) as well as purpose-built legal platforms whose data lives in SQL Server (Aderant, 3E, Iridium, Intapp, Foundation, and others). Key features:- Direct database access — query tables and views in the source with no middleware
- Schema discovery — tables, views, columns, and constraints are discovered automatically
- Incremental extraction — pull only changed rows using a cursor column
- Sync & Delete Operations — most SQL connectors can detect row-level changes without a cursor and soft-delete records removed from the source (see Sync Operation)
API Connectors
API connectors integrate with SaaS platforms through their REST APIs — DealCloud, Zoom, ADP, UKG, Microsoft Entra, Dynamics CRM, and others. They handle the operational complexity of REST automatically: authentication, pagination, rate limiting, and retries. Key features:- OAuth, API key, or bearer-token authentication
- Automatic pagination and rate-limit handling
- Schema discovery from API responses (or OpenAPI specs where available)
- Scheduled collection with incremental updates
Delimited File Connectors
Delimited file connectors ingest CSV / TXT files from a directory on SFTP, Azure Blob Storage, or a local drive. They read flat-file data plus a separate schema definition file that tells the collector how to interpret each file.Supported File Types
Data files:.csv.txt
.csv.xlsx(detected automatically by file contents)
Schema Modes
The schema file format depends on theHeaders in Schema parameter set on the connection:
true(Full Schema mode): The schema definition file contains the full column definition for every table — column names, data types, and primary key flags. Use this when you need explicit control over data types or when your data files do not include a header row.false(Inferred Schema mode): The schema definition file only maps tables to files. Column names are read from the first row of each data file, and data types are inferred from the first 100 rows. Use this when your data files have headers and you want quick setup.
Required Schema File Headers
- Full Schema Mode (Headers in Schema = true)
- Inferred Schema Mode (Headers in Schema = false)
The schema file must contain these columns in the header row:
Copy-paste header row:Example:
| Column | Required | Description |
|---|---|---|
TABLE_NAME | Yes | Name of the table. Rows with the same TABLE_NAME are grouped into a single resource. |
COLUMN_NAME | Yes | Field name in the data file. |
DATA_TYPE | Yes | SQL data type (e.g. varchar(50), int, bigint, date, timestamp, decimal, bool). |
PRIMARY_KEY | No | Whether the column is part of the primary key. Accepts Y/N, true/false, 1/0, y/n, yes/no. Empty = not PK. |
FILE_PATH | Yes | Path to the data file, relative to the connection’s directory. Each table needs at least one non-empty value. |
FILE_PATH only needs a value on one row per table — subsequent rows for the same table can leave it blank to avoid repetition. To load multiple files into the same table (e.g. yearly partitions), add additional rows with the same TABLE_NAME and different FILE_PATH values.