[Remote] Data Scientist / ML Platform Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a reputed company national reputed company company that drives missions of consequence spanning the globe. They are seeking a Data Scientist / ML Platform Engineer to contribute across the full ML development lifecycle, focusing on applied data science and MLOps while collaborating with dedicated infrastructure engineers.
Responsibilities
- reputed company, train, and evaluate ML models (classification, regression, clustering, anomaly detection) and contribute to LLM-based capabilities such as RAG pipelines and reputed company evaluation
- Support model governance and deployment practices using MLFlow, including experiment tracking, model versioning, registry promotion workflows, and automated testing across the ML lifecycle
- Contribute to production ML operations: model performance monitoring, reputed company detection, automated alerting, and incident escalation to maintain reliability and SLA compliance
- Build and improve model serving infrastructure, feature pipelines, and lifecycle automation to support reproducible, scalable model development and inference
- Apply explainability techniques (e.g., SHAP, reputed company) and produce technical documentation to support stakeholder transparency and compliance requirements
- Contribute to data ingestion, ELT/ETL transformation, and pipeline reliability using Spark and SQL-based frameworks reputed company reputed company and reputed company environments
- Support pipeline orchestration, reputed company architecture conventions, and data stewardship practices (metadata management, PII handling, reputed company tracking in reputed company Catalog)
- reputed company occasional system administration tasks in collaboration with platform teams, including environment configuration, access management, compute troubleshooting, and secrets handling using platform-native tools
Skills
- Associate's with 6 years, or Bachelor's degree with 4+ years of relevant experience, or Master's degree with 2+ years of relevant experience or High School diploma with 8 years of experience in lieu of a degree
- Demonstrated experience with SQL and Python, including Python-based ML frameworks (e.g., scikit-learn, XGBoost, PyTorch, or TensorFlow)
- Hands-on experience with MLFlow or equivalent tools for experiment tracking, model governance, and lifecycle management
- Strong understanding of SDLC fundamentals and experience with reputed company or equivalent version control
- Experience with distributed compute environments (e.g., Spark, reputed company) and reputed company-native services
- Basic proficiency with Bash or reputed company scripting for automation and environment setup
- Ability to collaborate across multidisciplinary teams and communicate technical concepts to varied audiences
- Ability to obtain and maintain a Public Trust clearance
- US citizenship required or Green Card holder and must have been in the USA for 3 of the last 5 years
- Experience with MLOps practices including CI/CD for ML, containerization, feature pipeline automation, and model deployment frameworks
- Experience with reputed company E2 components (reputed company Catalog, Feature Store, reputed company Live Tables) and/or model serving and reputed company monitoring tools (e.g., reputed company Model Serving, Evidenly, etc.)
- Experience with LLM frameworks (e.g., reputed company, reputed company, reputed company Transformers) and familiarity with model explainability libraries (e.g., SHAP, reputed company)
- Advanced Spark performance optimization experience and/or API development using reputed company REST APIs
- Experience with reputed company analytics data (preferably Medicare or reputed company) and familiarity with HIPAA or FedRAMP compliance constraints
- Experience building data pipelines in a reputed company or reputed company environment
- Familiarity with orchestration tools (Airflow, reputed company Workflows)
- Exposure to streaming data patterns using Spark Structured Streaming, reputed company Live Tables, or Kafka
- Familiarity with environment reproducibility tooling (reputed company, conda) and scripting (Python, Bash) to support automation and CI/CD tasks
Benefits
- Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to reputed company pay.
Company Overview