A comprehensive data processing and analytics platform for managing SQL workflows across multiple engines with AWS EMR integration
Everything you need for enterprise-grade data processing and analytics
Execute queries across Hive, Presto, Spark SQL, MySQL, and PySpark with seamless integration
Create and manage complex data pipelines with dependency tracking and visual DAG representation
Comprehensive execution model with scheduled, on-demand, and fault-tolerant recovery
Intelligent S3-based data validation with adaptive algorithms based on file size
Native AWS EMR and S3 integration for scalable cloud-based data processing
Upload and manage custom User-Defined Functions with JAR file support
Built with industry-leading technologies for performance and reliability
Backend Development
REST API Framework
Frontend Dashboard
Database
Cloud Services
Build Tool
Get up and running with Data Phantom in minutes
Set up MariaDB and configure your environment
brew install mariadb
brew services start mariadb
Create database and run the DDL script
mysql -u root -p data_phantom < database.ddl
Build the application and start the server
mvn clean install
java -jar target/annihilator-data-phantom-1.0-SNAPSHOT.jar server config-dev.yml