Skip to main content

Apache Drill

Apache Drill is a schema-free, distributed SQL query engine that enables you to explore and analyze various data sources, including NoSQL databases, cloud storage, and file systems, without the need for predefined schemas. Drill provides flexibility and speed in querying diverse data formats such as JSON, Parquet, and more.

Timbr + Apache Drill

Integrating Apache Drill with Timbr allows you to leverage Timbr's semantic graph capabilities while maintaining Drill's ability to query heterogeneous data sources. This integration enables advanced features like semantic querying, data virtualization, and integrated data management, providing powerful insights and efficiencies.

Getting Started with Apache Drill

If you haven't yet set up Apache Drill, follow these steps to get started:

To connect Timbr to your Apache Drill datasource, you need to use the Apache Drill JDBC driver. For information on the Drill JDBC driver and how to set it up, refer to the Apache Drill JDBC Driver documentation.

Connecting Timbr to Apache Drill in the Timbr Platform

Once you've set up Apache Drill and have the JDBC driver ready, it's time to connect Timbr to Drill. Follow the steps below for a detailed tutorial on connecting Timbr to Apache Drill:

  1. Navigate to Manage Datasources: From the Timbr interface, click on the Manage tab and select Datasources.

  2. Add New Datasource: On the top right, click the Add New Datasource button. Alternatively, you can click the big add button beneath the knowledge graphs and datasources tabs.

Add new datasource

  1. Fill in Connection Details: A pop-up window will appear where you need to provide the relevant connection details to connect Apache Drill to Timbr.

There are two options in Timbr to enter the connection information: Simple or JDBC Url.

On the top right of the pop-up window , the toggle can be switched from Simple to JDBC Url in order to connect the datasource using the relevant JDBC URL.

tabs

Simple Tab Connection Configuration

  1. Select Simple Tab: Ensure the Simple tab is selected.

  2. Fill in the Following Details:

    • Database Type: Select Apache Drill.
    • Datasource Name: Enter a name for your datasource.
    • Description: Provide a description of the datasource (Optional).
    • Hostname / IP: Enter the hostname or IP address of your Apache Drill datasource.
    • Port: Default is 10000.
    • Username: Enter your Apache Drill username.
    • Password: Enter your Apache Drill password.
    • Additional Parameters: Add any additional connection parameters using ; as separators.

JDBC URL Tab Configuration

  1. Select JDBC URL Tab: Ensure the JDBC URL tab is selected.

  2. Fill in the Following Details:

    • Database Type: Select Apache Drill.
    • Datasource Name: Enter a name for your datasource.
    • Description: Provide a description of the datasource (Optional).
    • Username: Enter your Apache Drill username.
    • Password: Enter your Apache Drill password.
    • JDBC URL: Enter the JDBC connection URL (e.g., jdbc:drill:drillbit=[hostname]:port;).

jdbc tab

Save and Test Connection

  1. After entering the details, click the Save button to save the information, connecting the Apache Drill datasource to the Timbr environment.
  2. Once saved, on the top right, click the Test connection button to verify the datasource’s connectivity, ensuring it is active and responsive.

For a comprehensive guide on connecting Timbr to various datasources, refer to the detailed documentation: