Job Summary

Our Client has an immediate opening for a Data Scientist with a strong background in data engineering and a solid understanding of data science principles. The ideal candidate will play a critical role in designing, developing, and maintaining our data infrastructure, while also adding expertise to enable advanced analytics and machine learning initiatives.

This position is based in Redmond, WA, and we are able to hire remote candidates in the following states: CA, CO, FL, GA, MO, NY, OR, SC, TN, TX, WA.

Job Responsibilities

Data Pipeline Development

Design, implement, and maintain scalable data pipelines to collect, process, and store data from various sources.
Ensure data quality, accuracy, and consistency throughout the pipeline.

Data Modeling

Design and implement data models for predictive analytics, machine learning, and data exploration.
Optimize data structures and storage to support efficient querying and analysis.

Data Integration

Work closely with cross-functional teams to integrate data from diverse sources, including databases, APIs, and external data providers.
Develop and maintain ETL processes to transform and enrich raw data into actionable insights.

Performance Tuning

Monitor and optimize the performance of data pipelines and databases to meet business requirements.
Identify and resolve bottlenecks and performance issues.

Continuous Learning and mentoring

Stay up-to-date with the latest advancements in data engineering and data science technologies.
Share knowledge with team members.

Requirements

Must Have

3+ years experience in SQL Query Design, SQL Performance Tuning and Query Optimization
3+ years of relevant experience in Data Warehouse Design, Data Warehouse Technical Architectures, Development and Implementation
3+ years of relevant experience in ETL Development, ETL Implementation, Unit Testing, Troubleshooting and Support of ETL Processes
3+ years of relevant experience with the application of Data Science principles and data modeling.

Knowledge and Skills

Proficiency in SQL Query Design and Implementation
Strong Experience with Relational Data Warehouse Systems
Data Warehouse Management Systems
Optimization by Indexing, Partitioning and Denormalization
Strong Ability to build and optimize data sets, ‘big data’ data pipelines and architecture
Knowledge of data science concepts, machine learning algorithms, and statistical analysis.
Programming skills: Python required (bonus for Java or C#)
Strong analytical and problem-solving skills
BONUS: Experience with Pandas, scikit-learn and Multi-agent systems (MAS)
BONUS: Experience working at scale in a production environment with Personally Identifiable Information (PII) data

Benefits:

Compensation: $140,000-160,000 base pay
Paid Vacation Time and Paid Holidays
Medical/Vision/Dental Insurance, Voluntary Life & AD&D Insurance, Short-Term & Long-Term Disability, Critical Illness & Accident Insurance
401(k) with employer matching
Hybrid/remote with flexible work schedule