What is Data Ingestion?
gu1βs Data Ingestion system allows you to seamlessly import data from any source (CSV files, APIs, databases, or custom formats) by defining custom schemas and field mappings. This intelligent mapping system ensures your data is properly structured for risk analysis.How It Works
1
Define Your Schema
Create a custom schema that describes your data structure with field definitions, types, and validation rules.
2
Map Fields
Create field mappings that translate your data fields to gu1βs unified entity model.
3
Transform Data
Apply transformations (formatting, calculations, conditionals) as data flows through the mapping.
4
Import Entities
Use the mapped schema to create entities via API or bulk upload.
Key Features
Custom Schemas
Define your data structure with flexible field types and validation
Field Mappings
Map your fields to gu1βs unified model with transformations
Smart Detection
Auto-detect field types, patterns, and suggested mappings
Bulk Processing
Import thousands of records efficiently with batch processing
Schema Types
gu1 supports multiple schema types for different data sources:| Type | Description | Use Case |
|---|---|---|
| database | Relational database schemas | Direct database integration |
| api | API response structures | Third-party API integration |
| file | File formats (CSV, JSON, XML) | File-based imports |
| custom | Custom data structures | Proprietary formats |
Schema Categories
Organize schemas by business domain:Financial
Financial
Bank accounts, transactions, financial statements, payment data
Identity
Identity
Personal information, identity documents, KYC data
Compliance
Compliance
Sanctions lists, PEPs, adverse media, regulatory data
Transaction
Transaction
Payment transactions, wire transfers, transaction history
General
General
Any other type of structured data
Field Types
Supported field types for schema definition:| Type | Description | Example |
|---|---|---|
| string | Text data | βAcme Corpβ, β[email protected]β |
| number | Numeric values | 1000, 99.99, -50 |
| boolean | True/false | true, false |
| date | Date/timestamp | β2025-10-03T12:00:00Zβ |
| array | List of values | [βtag1β, βtag2β] |
| object | Nested structure | {"city": "NYC", "country": "US"} |
Transformation Types
Apply transformations during field mapping:Direct
Copy field as-is with no changes
Calculate
Perform mathematical calculations
Format
Format strings, dates, numbers
Conditional
Apply if/then logic based on conditions
Lookup
Look up values from reference tables
Custom
Custom JavaScript expressions
Validation Rules
Ensure data quality with built-in validations:minLength/maxLength- String length limitsmin/max- Numeric value rangespattern- Regular expression validationenum- Allowed values listrequired- Field is mandatory
Example: Banking Data Schema
Hereβs a complete example of defining a schema for banking customer data:Best Practices
Schema Design
Schema Design
- Use descriptive field names that match your source data
- Include detailed descriptions for complex fields
- Set appropriate validation constraints
- Version your schemas (1.0.0, 1.1.0, etc.)
Field Mapping
Field Mapping
- Start with direct mappings, add transformations as needed
- Test mappings with sample data before bulk import
- Document custom transformation logic
- Handle null/missing values gracefully
Data Quality
Data Quality
- Validate data at the source before importing
- Use strict mode for production environments
- Monitor failed imports and validation errors
- Implement data cleansing for known issues
Performance
Performance
- Use bulk processing for large datasets (>1000 records)
- Set appropriate batch sizes (100-1000 records)
- Schedule imports during off-peak hours
- Monitor processing times and adjust batch sizes
Common Use Cases
CSV File Import
Import customer data from CSV files with automatic field detection
API Integration
Connect third-party APIs and sync data in real-time
Database Sync
Synchronize data from your existing databases
Banking Onboarding
Complete KYB workflow with data mapping example
API Endpoints
Create Schema
POST /custom-schemas
List Schemas
GET /custom-schemas
Create Mapping
POST /custom-schemas/mappings
Smart Detection
POST /custom-schemas/detect-fields
Next Steps
1
Create Your First Schema
Follow the Custom Schemas guide to define your data structure
2
Map Your Fields
Learn how to map fields to gu1βs model in the Field Mappings guide
3
Import Data
Start importing entities using the Entities API