Skip to main content

CCC ETL

Service provides capabilities for extracting, transforming, and loading (ETL) data across diverse sources and sinks. It supports batch and real-time streaming architectures, managed data orchestration (DAGs), and serverless execution engines to process large-scale datasets with built-in fault tolerance.

Release Details

Version:
DEV
Assurance Level:
Release Manager:
DB
Development Build

Contributors

DT
Development Team

Change Log

  • Development build - no formal changelog available

Capabilities

IDTitleDescriptionThreat Mappings
CCC.ETL.CP01Batch ProcessingSupports the processing of bounded (batch) data sources using a consistent programming model or engine.
0
CCC.ETL.CP02Stream ProcessingSupports the processing of unbounded (streaming) data sources using a consistent programming model or engine.
0
CCC.ETL.CP03Schema EvolutionAutomatically detects source data structures and manages changes in schema (e.g., column additions) over time without pipeline failure.
0
CCC.ETL.CP04Distributed Data ShufflingProvides an internal service to re-partition and group data across distributed workers for complex operations like joins and aggregations.
0
CCC.ETL.CP05Windowing and Event-Time ProcessingEnables grouping of data based on time attributes, supporting tumbling, hopping, and session windows with late-data handling (watermarking).
0
CCC.ETL.CP06Change Data Capture (CDC) IntegrationSupports incremental data ingestion by tracking changes in source transaction logs rather than full table scans.
0
CCC.ETL.CP07Connectivity and Connector LibraryProvides pre-built, managed connectors for a variety of sources and sinks (e.g., Object Storage, RDBMS, NoSQL, Pub/Sub).
0
CCC.ETL.CP08Job BookmarksPersists the state of a processing job (e.g., checkpointing or bookmarks) to ensure exactly-once processing and fault tolerance.
0
CCC.ETL.CP09Pushdown OptimizationThe ability to translate transformation logic into the native language of the source or sink (e.g., SQL) to minimize data movement.
0
CCC.ETL.CP10Visual OrchestrationProvides a graphical interface to define dependencies between extraction, transformation, and loading tasks.
0
CCC.ETL.CP11Data Lineage & Metadata TrackingCaptures and exports metadata regarding the data sources, the transformation steps, and the final destination (sink), showing the flow from source to destination for compliance and debugging.
0
CCC.ETL.CP12User-Defined Function (UDF) SupportAllows developers to inject custom logic (Python, Java, SQL) into the managed processing pipeline for complex transformations.
0
CCC.ETL.CP13Time-Based Job TriggeringSupports time-based (cron) mechanisms to initiate data processing workflows.
0
CCC.ETL.CP14Event Based Job TriggeringSupports event-based (file arrival) mechanisms to initiate data processing workflows.
0
CCC.Core.CP01Encryption in Transit Enabled by DefaultThe service automatically encrypts all data using industry-standard cryptographic protocols prior to transmission via a network interface.
0
CCC.Core.CP02Encryption at Rest Enabled by DefaultThe service automatically encrypts all data using industry-standard cryptographic protocols prior to being written to a storage medium.
0
CCC.Core.CP03Access Log PublicationThe service automatically publishes structured, verbose records of activities performed within the scope of the service by external actors.
0
CCC.Core.CP06Access ControlThe service automatically enforces user configurations to restrict or allow access to a specific component or a child resource based on factors such as user identities, roles, groups, or attributes.
0
CCC.Core.CP07Event PublicationThe service automatically publishes a structured state-change record upon creation, deletion, or modification of data, configuration, components, or child resources.
0
CCC.Core.CP09Metrics PublicationThe service automatically publishes structured, numeric, time-series data points related to the performance, availability, and health of the service or its child resources.
0
CCC.Core.CP10Log PublicationThe service automatically publishes structured, verbose records of activities, operations, or events that occur within the service.
0
CCC.Core.CP14API AccessThe service exposes a port enabling external actors to interact programmatically with the service and its resources using HTTP protocol methods such as GET, POST, PUT, and DELETE.
0
CCC.Core.CP18Resource VersioningThe service automatically assigns versions to child resources which can be used to preserve, retrieve, and restore past iterations.
0
CCC.Core.CP20Resource TaggingThe service provides users with the ability to tag a child resource with metadata that can be reviewed or queried.
0
CCC.Core.CP23Network Access RulesThe service restricts access to child or networked resources based on user-defined network parameters such as IP address, protocol, port, or source.
0
CCC.Core.CP28Command-line InterfaceThe service includes a component that reads and translates text into commands that can be executed by the service.
0
CCC.Core.CP29Active IngestionWhile running, the service can receive inputs, commands, or data streams from external sources such as dedicated APIs, exposed network ports, message queues, and persistent data ingestion channels.
0