Data Engineer SQL

Remotely
Full-time

We are looking for a strong Data Engineer who can work with a large amount of data and can prepare it for the ML pipelines, provide data analysis and visualization, and continuously do storage optimization tasks. Experience with a different type of SQL databases (MySQL/PostgreSQL/MSSQL), knowledge of JOINS, inner selects and index specification/creation to speed up requests. 


Responsibilities: 

  • Design, develop, and maintain scalable data pipelines and ETL processes to support analytics and business intelligence. 
  • Write complex SQL queries to extract, transform, and analyze large datasets from multiple sources. 
  • Build and maintain dashboards, reports, and other data visualizations to deliver actionable insights to business stakeholders. 
  • Collaborate with cross-functional teams including engineering, product, and business units to understand data needs and deliver robust solutions. 
  • Perform exploratory data analysis (EDA) to uncover trends, anomalies, and opportunities for deeper investigation. 
  • Ensure data quality and integrity through rigorous validation and data governance practices. 
  • Develop scripts and tools using languages like Python or R to automate workflows and conduct advanced data analyses. 
  • Support data infrastructure improvements and contribute to the overall data architecture and design. 
  • Document processes, data models, and pipelines for internal knowledge sharing and future reference. 


Requirements: 

  • Proficiency in SQL with the ability to write efficient, optimized queries for complex datasets. 
  • Strong analytical skills with experience working with large-scale data environments (e.g., data warehouses, relational databases). 
  • Programming experience in Python, R, or another data-focused language. 
  • Familiarity with ETL tools and frameworks (e.g., Airflow, dbt, Apache NiFi). 
  • Experience with BI platforms such as Tableau, Power BI, Looker, or similar. 
  • Solid understanding of data modeling, data warehousing concepts, and database design. 
  • Experience working with cloud-based data platforms (e.g., AWS Redshift, Google BigQuery, Snowflake). 
  • Strong problem-solving abilities and attention to detail. 
  • Excellent communication skills with the ability to translate technical findings into business insights. 
  • Bachelor's degree in Computer Science, Data Science, Statistics, Engineering, or a related field (or equivalent work experience).