Virtual Data Assistant
Virtual Data Assistant
  • Virtual Data Assistant
  • Overview
    • What we do
    • Features
      • Data Sources
      • Datasets
      • Dashboards
      • Workbooks
        • Expectations Book
        • Query Book
        • Document Book
          • API Documentation
  • Quickstart
    • VDA in Docker
  • How-to-guides
    • Data Catalog
      • DataSource
      • Datasets
      • Exploration
    • Data Quality
      • Expectations
        • Templated Expectations
        • Custom Expectations
      • Profiling
      • Reconciliation
    • Data Analytics
      • Data Modeling
      • Visualization
      • Data Ingestion
    • Governance
Powered by GitBook
On this page
  1. Overview
  2. Features

Data Sources

PreviousFeaturesNextDatasets

Last updated 11 months ago

A Datasource is an entity within the Virtual Data Assistant that serves as a container for a collection of metadata. Metadata refers to information about datasets, such as data source location, schema, data types, and other relevant properties. Essentially, a Datasource is like a virtual folder that groups related datasets together, making it easier for users to manage and access data efficiently.

Datasource creation using Connectors

Connectors are modules or plugins that establish connections to specific data sources or databases. VDA offers multiple connectors to popular datasources e.g PostgreSQL, MySQL etc. When users select a connector and provide the necessary connection details, it establishes a link to the data source, allowing access to the data within that source.

Searching Datasets within a Datasource

Once datasets are created and organized within a Datasource, users can perform searches to find specific entities or datasets within that Datasource. This search functionality simplifies data discovery, especially in cases where multiple datasets are stored within the same Datasource.An user can search using specific filters to find appropriate datasets.

List of Available Connectors

  • CSV

and anything built over it

(through dbapi or sql_alchemy)

Amazon Athena
Amazon EventBridge
Amazon Glue
Amazon Redshift
Apache Cassandra
Apache Druid
Apache Hive
dbt
Delta Lake
Elasticsearch
Google BigQuery
IBM DB2
Kafka Schema Registry
Microsoft SQL Server
MySQL
Oracle
PostgreSQL
PrestoDB
Trino (formerly Presto SQL)
Vertica
Snowflake
Create a datasource