Job Summary
Our Client has an immediate opening for a Data Scientist with a strong background in data engineering and a solid understanding of data science principles. The ideal candidate will play a critical role in designing, developing, and maintaining our data infrastructure, while also adding expertise to enable advanced analytics and machine learning initiatives.
This position is based in Redmond, WA, and we are able to hire remote candidates in the following states: CA, CO, FL, GA, MO, NY, OR, SC, TN, TX, WA.
Job Responsibilities
Data Pipeline Development
- Design, implement, and maintain scalable data pipelines to collect, process, and store data from various sources.
- Ensure data quality, accuracy, and consistency throughout the pipeline.
Data Modeling
- Design and implement data models for predictive analytics, machine learning, and data exploration.
- Optimize data structures and storage to support efficient querying and analysis.
Data Integration
- Work closely with cross-functional teams to integrate data from diverse sources, including databases, APIs, and external data providers.
- Develop and maintain ETL processes to transform and enrich raw data into actionable insights.
Performance Tuning
- Monitor and optimize the performance of data pipelines and databases to meet business requirements.
- Identify and resolve bottlenecks and performance issues.
Continuous Learning and mentoring
- Stay up-to-date with the latest advancements in data engineering and data science technologies.
- Share knowledge with team members.
Requirements
Must Have
- 3+ years experience in SQL Query Design, SQL Performance Tuning and Query Optimization
- 3+ years of relevant experience in Data Warehouse Design, Data Warehouse Technical Architectures, Development and Implementation
- 3+ years of relevant experience in ETL Development, ETL Implementation, Unit Testing, Troubleshooting and Support of ETL Processes
- 3+ years of relevant experience with the application of Data Science principles and data modeling.
Knowledge and Skills
- Proficiency in SQL Query Design and Implementation
- Strong Experience with Relational Data Warehouse Systems
- Data Warehouse Management Systems
- Optimization by Indexing, Partitioning and Denormalization
- Strong Ability to build and optimize data sets, ‘big data’ data pipelines and architecture
- Knowledge of data science concepts, machine learning algorithms, and statistical analysis.
- Programming skills: Python required (bonus for Java or C#)
- Strong analytical and problem-solving skills
- BONUS: Experience with Pandas, scikit-learn and Multi-agent systems (MAS)
- BONUS: Experience working at scale in a production environment with Personally Identifiable Information (PII) data
Benefits:
- Compensation: $140,000-160,000 base pay
- Paid Vacation Time and Paid Holidays
- Medical/Vision/Dental Insurance, Voluntary Life & AD&D Insurance, Short-Term & Long-Term Disability, Critical Illness & Accident Insurance
- 401(k) with employer matching
- Hybrid/remote with flexible work schedule