Search...

Data Specialist

Skills

About the Role

You will turn raw, complex data into trusted, AI-enhanced intelligence that powers data products. You will explore and profile customer data, translate domain knowledge into data infrastructure requirements, and design end-to-end ETL/ELT pipelines with automation and self-healing. You will run automated and manual quality control, document assumptions, and build AI features such as automated insights, anomaly detection, and natural-language interfaces. You will evaluate AI/ML outputs against domain expectations, design human-in-the-loop checks, and deliver interactive analyses, reports, and visualizations to customers. You will also identify opportunities to improve internal data cleaning and quality-assessment tools and help automate repetitive data tasks.

Requirements

  • 3+ years of experience in a data-driven role or equivalent in data-related research projects
  • Bachelor’s or Master’s degree in science, mathematics, engineering, or a data-driven field
  • Competence in Python, R, or an equivalent programming language
  • Competence in at least two of: SQL/PostgreSQL, NumPy/pandas, Dagster/Airflow/dbt, AWS/Azure
  • Ability to translate complex data findings into clear, compelling narratives
  • Strong communication skills to decompose operational workflows into repeatable steps
  • Passion for data integrity and transforming raw inputs into trusted datasets
  • Self-starter with demonstrated ability to learn new technologies quickly
  • Experience with generative AI tools (e.g., AWS Bedrock, LangChain) is a plus
  • Experience with testing frameworks (e.g., pytest) is a plus

Responsibilities

  • Understand the data and the domain
  • Partner with client delivery and customers to translate domain knowledge into data infrastructure requirements
  • Explore raw customer data and profile files, columns, and statistical characteristics
  • Design and implement end-to-end AI-enhanced ETL/ELT pipelines
  • Coordinate with stakeholders and customers to resolve missing information and discrepancies
  • Run quality control on data and data products through automated tests and targeted manual review
  • Document assumptions and decisions to maintain traceability
  • Build AI into data products including automated insights, anomaly detection, and AI-assisted data quality checks
  • Evaluate AI/ML outputs and design human-in-the-loop verification
  • Deliver interactive data analysis, data quality reports, statistical analyses, and visualizations
  • Contribute improvements to internal tools for data cleaning and data quality assessment

Benefits

  • 401(k) plan with up to 5% employer contribution
  • Fully funded health benefits including vision and dental from day one for whole family
  • Up to 24 weeks paid parental leave, 4-week paid ramp-back, and a $10,000 family forming benefit
  • Flexible vacation policy with no set annual limit, Summer Fridays, extended December holiday period
  • Flexible work options with access to offices and ability to work remotely as needed
  • Opportunity to work remotely from eligible locations for up to two months per year
  • Individualized mentoring, growth opportunities, and access to learning programs
  • Dedicated wellness advisor
  • Transit benefits and in-person events for team connection