Resources > FAQ

FAQ

Functionality

1. What data source connections does Qluster support?

‍Qluster has integrations with Amazon S3, MinIO, Google Cloud Storage, SFTP, and Dropbox. Qluster out of the box can read data from public addresses such as Google Sheets too.

2. What data destinations does Qluster support?

‍Postgres, S3, Google Cloud Storage, MinIO, and Snowflake.

3. Can data be transformed before arriving at their destination?

‍Yes, although Qluster is not designed to replace a traditional ETL tool, many of the common transformations can be achieved with Qluster.

4. What data formats does Qluster support?

‍We accept CSV, XLSX, JSON, JSONL. The files can be compressed and even GPG encrypted.

5. Which cloud environments does Qluster run on?

‍We currently support Google Cloud Services and AWS.

6. What infrastructure is required to run Qluster?

‍In Qluster SaaS deployment, you only need a Postgres database and an object storage layer such as AWS S3 or Google Cloud Storage.In Qluster enterprise deployment, a Kubernetes cluster is required in addition to the above.

Why use Qluster?

1. Why can't I require my customers or vendors to use a CSV template instead of Qluster for data ingestion?

While this might be fine for ad-hoc imports, companies consistently sending new data need more time and resources to update their data exports to match other systems manually. Qluster makes this process effortless and eliminates issues related to human error.

2. Does Qluster offer managed support services for rule building or
customization?

Yes, we can offer additional professional services to tailor the solution to your specific needs.

Data Security

1. What client data does Qluster keep?

a. The client's data belongs to the client.
b. The entire data flow lifecycle stays within your virtual private cloud in the on-prem deployment.
c. In the hosted version, ephemeral data specific to a data source may be used by a process, i.e., in the form of logs. This data stream is essential to the ingestion process and is retained for debugging for up to a few days, depending on the client's requirements.
d. In the hosted version, we can let you host the settings and logs in your infrastructure. Then there will be absolutely no traces of your data in our infrastructure except the metadata about your data.

2. How are the AI models trained?

Metadata from successful data imports will be used to train the AI model to help automate data quality rules and field mappings.

3. Does Qluster sell any client data to other companies?

Never have and never will! Client data security is our number one priority, and we take it very seriously.

Training Data

1. Is algorithm training required for each source?

No, training is required for the dataset, not for each individual data source.

2. Who is responsible for training the Qluster algorithm?

The Qluster team will handle the training for each client. This is not something clients need to worry about.

3. What is the minimum number of rows for a training file?

We recommend at least 20 rows of data to have an accurate baseline of your data. We can work with less, but it's not ideal.

4. What information from the training data file is kept by Qluster?

The training data is used to extract metadata about your data that explains how your data looks in aggregate. We use the training data to make our anomaly detection algorithm more accurate and better understand your typical data structure, shape, and other data properties. Qluster retains this metadata to enhance our product offering. We do not keep any of your actual data. Your data belongs to you.

Contact us!

FAQ

Functionality

Why use Qluster?

Data Security

Training Data

Questions?