Querona is a data virtualization platform that allows all applications to use data from multiple, heterogeneous data sources. Data is accessed and integrated in real-time across distributed data sources, without copying or moving data from its source. When required, data can be extracted from the source, cached on a mix of the supported database platforms and transparently used, instead of accessing data source directly.
The process of creating the data virtualization solutions requires the following steps:
|connect||Connect to data sources and virtualize data from the source|
|govern||Catalog metadata, apply data security and governance|
|transform||Clean and transform|
|share||Share business views for consumption by end users or other systems|
In connect step you register a connection to the data source(s) and virtualize data from the source(s). The data virtualization stage involves the creation of a virtual database (VDB) that wraps selected tables and views from the source using metadata and normalizes source data types by mapping them to SQL Server-compatible types used by Querona. This VDB type directly utilizes the connection to data source and is called a pass-through virtual database. By design it does not support caching. Caching can be configured by creating an integration virtual database.
In this step you apply necessary labels to metadata, configure data masking and apply other data security settings.
In transform step you create an integration virtual database and pick a connection to the data processing engine that you wish to use for caching of data. Because Apache Spark engine is built-into Querona, it is an always-available engine that can be used by any integration virtual database to store and process cached data.