See all roles

Principal Data Scientist (AI)- REMOTE (US) (Houston, TX, US, 77086)

Work from home Full-time role Hiring

Responsibilities

Hexagon's ETQ division is seeking a hands-on Data Scientist to build predictive models, implement Generative AI and Agentic AI features, and architect data-driven solutions for our document-based compliance management platform. This role requires a technical expert who can develop, deploy, and maintain ML systems in production environments.

  • Build and deploy Generative AI features using foundation models (AWS Bedrock, OpenAI, Anthropic Claude) and RAG architectures with vector databases for compliance document understanding
  • Design agentic AI systems that autonomously handle compliance workflows, document review, regulatory mapping, and multi-step reasoning tasks
  • Implement comprehensive LLM evaluation frameworks with automated pipelines, custom metrics, benchmark datasets, and safety guardrails ensuring regulatory compliance
  • Build end-to-end MLOps pipelines for model training, deployment, monitoring, versioning, and automated retraining with drift detection
  • Develop predictive models for compliance risk scoring, regulatory change impact, anomaly detection, and time-series forecasting
  • Write production-quality Python code for data processing, feature engineering, API development (FastAPI/Flask), and ETL/ELT workflows
  • Lead A/B experiments and product analytics to measure AI feature impact and drive data-driven decision-making
  • Create explainability frameworks (SHAP/LIME) and monitoring dashboards ensuring transparency and regulatory adherence
  • Collaborate with cross-functional teams to translate business needs into ML solutions and communicate insights to stakeholders Python (5+ years): Production-level experience with Pandas, NumPy, scikit-learn, XGBoost, TensorFlow/PyTorch, Hugging Face Transformers, FastAPI/Flask, MLflow, and pytest SQL: Advanced proficiency with complex queries, window functions, and optimization Machine Learning & NLP: Strong foundation in supervised/unsupervised learning, deep learning, document understanding, text classification, and semantic analysis Generative AI & LLMs: Hands-on experience with foundation models (GPT, Claude, Llama), prompt engineering, RAG architectures, and vector databases (Pinecone, Weaviate, Chroma) MLOps & ModelOps: End-to-end experience with ML pipelines, experiment tracking (MLflow, W&B), model versioning, feature stores, drift detection, CI/CD for ML, and Docker containerization LLM Evaluation: Experience with evaluation frameworks (RAGAS, DeepEval), custom metrics, benchmark datasets, and human-in-the-loop validation Cloud & AWS: Experience with AWS services including SageMaker, Bedrock, S3, Lambda, EC2, and CloudWatch Statistics & Experimentation: Strong foundation in statistics, A/B testing, causal inference, and experimental design Visualization: Proficiency with Tableau, Power BI, or Python visualization libraries Education / Qualifications Experience & Education
  • 7+ years in data science, ML engineering, or related roles
  • 3+ years building NLP/generative AI applications and implementing MLOps in production
  • Bachelor's or Master's degree in Data Science, Computer Science, Statistics, or related field (PhD preferred)
  • Track record of deploying ML systems processing large-scale datasets with proper monitoring and governance Preferred Qualifications
  • Experience with agentic AI frameworks (LangGraph, LangChain, AutoGen, CrewAI)
  • Knowledge of Life Sciences/regulated industries (FDA, EMA, ISO, GxP) and compliance management systems
  • Familiarity with big data tools (Spark, Databricks, Snowflake), orchestration (Airflow, Kubeflow), and monitoring tools (Datadog, Prometheus)
  • Experience with LLM fine-tuning, document processing libraries, multi-modal AI, or distributed training
  • Understanding of ML governance, bias detection, model risk management, and data privacy regulations (GDPR, CCPA, HIPAA)
  • Experience working in agile environments with Jira
  • AWS ML certifications or similar credentials Key Competencies
  • Strong communication skills explaining complex models to technical and non-technical audiences
  • Ability to work independently and collaboratively in fast-paced environments
  • Proven ability to convert POCs into production-grade solutions
  • Understanding of ethical AI and building trustworthy, explainable systems for regulated environments Hexagon will NOT be able to provide visa sponsorship at any time during employment. If you will require visa sponsorship at any point in time, please refrain from applying. #LI-KK1 #LI-Remote About Hexagon Hexagon is a global leader in digital reality solutions, combining sensor, software and autonomous technologies. We are putting data to work to boost efficiency, productivity, quality and safety across industrial, manufacturing, infrastructure, public sector, and mobility applications. Hexagon’s Asset Lifecycle Intelligence division helps clients design, construct, and operate more profitable, safe, and sustainable industrial facilities. We empower customers to unlock data, acc

Apply tot his job Apply To this Job

You might like

Southwest Airlines Careers Remote (Principal Data Scientist) $20-25 An Hour

Work from home Full-time role

Principal Data Scientist- Global Marketing Measurement and Optimization job at McCormick & Company in Hunt Valley, MD

Work from home Full-time role

Pricing Analyst - (100% Remote)

Work from home Full-time role

Principal Data Scientist - Hybrid in MN or DC, Remote Elsewhere

Work from home Full-time role

Principal Consultant – FinTech Platform Delivery

Work from home Full-time role

Associate Principal Data Scientist (Remote) Job at Blizzard Entertainment in San Francisco

Work from home Full-time role

(USA) Principal, Data Scientist

Work from home Full-time role

Principal Engineer, Developer Experience (DevEx)

Work from home Full-time role

Remote Prior Authorization Pharmacy Technician

Work from home Full-time role

Principal Engineer & Software Architect

Work from home Full-time role

Associate Director, Software Engineering – Data & Platform Modernization

Work from home Full-time role

Sr. Director, Analyst, Chief Information Officer Specialist, Midsize Business Transformation – Remote US in Arlington, TX

Work from home Full-time role

Immediately Require Certified English Teacher (Remote) in Kenner, LA

Work from home Full-time role

MANAGER DE RAYON EPICERIE (H/F)

Work from home Full-time role

Experienced Part-Time Remote Data Entry Clerk – Precision Typing and Data Management

Work from home Full-time role

Associate Creative Director

Work from home Full-time role

Experienced Customer Service Representative – Wells Fargo Work from Home Opportunity with Competitive $26 Hourly Rate

Work from home Full-time role

Analyst, Digital Marketing

Work from home Full-time role

Remote Customer Service Representative – Female‑Focused Pet Care Support Specialist at arenaflex

Work from home Full-time role

Senior Client Partner - Life Sciences (Pharma & Biotech Focus)

Work from home Full-time role