Data Specialist
Skills
About the Role
You will turn raw, complex data into trusted, AI-enhanced intelligence that powers data products. You will explore and profile customer data, translate domain knowledge into data infrastructure requirements, and design end-to-end ETL/ELT pipelines with automation and self-healing. You will run automated and manual quality control, document assumptions, and build AI features such as automated insights, anomaly detection, and natural-language interfaces. You will evaluate AI/ML outputs against domain expectations, design human-in-the-loop checks, and deliver interactive analyses, reports, and visualizations to customers. You will also identify opportunities to improve internal data cleaning and quality-assessment tools and help automate repetitive data tasks.
Requirements
- 3+ years of experience in a data-driven role or equivalent in data-related research projects
- Bachelor’s or Master’s degree in science, mathematics, engineering, or a data-driven field
- Competence in Python, R, or an equivalent programming language
- Competence in at least two of: SQL/PostgreSQL, NumPy/pandas, Dagster/Airflow/dbt, AWS/Azure
- Ability to translate complex data findings into clear, compelling narratives
- Strong communication skills to decompose operational workflows into repeatable steps
- Passion for data integrity and transforming raw inputs into trusted datasets
- Self-starter with demonstrated ability to learn new technologies quickly
- Experience with generative AI tools (e.g., AWS Bedrock, LangChain) is a plus
- Experience with testing frameworks (e.g., pytest) is a plus
Responsibilities
- Understand the data and the domain
- Partner with client delivery and customers to translate domain knowledge into data infrastructure requirements
- Explore raw customer data and profile files, columns, and statistical characteristics
- Design and implement end-to-end AI-enhanced ETL/ELT pipelines
- Coordinate with stakeholders and customers to resolve missing information and discrepancies
- Run quality control on data and data products through automated tests and targeted manual review
- Document assumptions and decisions to maintain traceability
- Build AI into data products including automated insights, anomaly detection, and AI-assisted data quality checks
- Evaluate AI/ML outputs and design human-in-the-loop verification
- Deliver interactive data analysis, data quality reports, statistical analyses, and visualizations
- Contribute improvements to internal tools for data cleaning and data quality assessment
Benefits
- 401(k) plan with up to 5% employer contribution
- Fully funded health benefits including vision and dental from day one for whole family
- Up to 24 weeks paid parental leave, 4-week paid ramp-back, and a $10,000 family forming benefit
- Flexible vacation policy with no set annual limit, Summer Fridays, extended December holiday period
- Flexible work options with access to offices and ability to work remotely as needed
- Opportunity to work remotely from eligible locations for up to two months per year
- Individualized mentoring, growth opportunities, and access to learning programs
- Dedicated wellness advisor
- Transit benefits and in-person events for team connection
