Data Scientist - Job Opportunity at Homethrive, Inc.

Remote, US
Full-time
Mid-level
Posted: July 19, 2025
Remote
USD 110,000 - 140,000 per year based on mid-level experience requirements, specialized AI/ML skills including Graph RAG systems, remote work arrangement, and venture-backed healthcare technology company compensation patterns

Key Responsibilities

Design and implement sophisticated graph data models and ontologies to revolutionize caregiving intelligence, establishing the foundation for next-generation AI systems that will transform how families navigate complex healthcare decisions
Build and maintain enterprise-grade data ingestion pipelines that synthesize structured and unstructured healthcare data into actionable knowledge graphs, directly impacting the company's ability to deliver personalized member experiences
Develop, fine-tune, and deploy production-ready Graph RAG systems that serve as the intelligent backbone for AI-powered caregiving solutions, ensuring contextually rich and explainable responses that build member trust
Lead cross-functional collaboration with product and engineering teams to seamlessly integrate advanced AI systems into customer-facing products, driving measurable improvements in member engagement and care outcomes
Acquire, clean, and transform complex healthcare datasets from diverse sources to fuel both traditional ML models and knowledge graphs, establishing data quality standards that enable reliable AI decision-making
Conduct comprehensive exploratory data analysis to uncover hidden patterns and relationships within caregiving data, translating findings into actionable insights that inform product strategy and member interventions
Design and implement statistical and machine learning models that solve complex business problems in the caregiving domain, complementing Graph RAG systems to deliver comprehensive AI-powered solutions
Create compelling data visualizations and executive dashboards that effectively communicate the value of graph-based AI systems to both technical teams and business stakeholders, influencing strategic decision-making

Requirements

Education

BS/MS in Computer Science, Data Science, another related discipline or equivalent experience

Experience

3+ years of related professional experience required

Required Skills

Strong proficiency in programming languages such as Python and SQL Proven, hands-on experience designing, building, and deploying Retrieval Augmented Generation (RAG) systems in a production environment Deep expertise in graph database technologies (e.g., Neo4j, Neptune, TigerGraph) and graph data modeling/ontology design Strong background in Natural Language Processing (NLP), Natural Language Understanding (NLU), and vector databases (e.g., Pinecone, Weaviate) Experience with Large Language Models (LLMs) and state-of-the-art NLP libraries and frameworks (e.g., Hugging Face, Transformers, LangChain, LlamaIndex) Expertise in machine learning algorithms, statistical modeling, and data mining techniques Experience with data visualization tools (e.g., Tableau, Power BI) Familiarity with cloud computing platforms (e.g., AWS, GCP, Azure) and big data technologies Strong knowledge of Data Warehousing (Snowflake) and Data Modeling techniques A successful history of integrating source systems and building data pipelines, especially for complex AI systems Self-directed and comfortable supporting the data needs of cross-functional teams, systems, and products in a high-pressure environment Passion for learning & results-oriented
Advertisement
Ad Space

Sauge AI Market Intelligence

Industry Trends

The healthcare technology sector is experiencing unprecedented growth in AI-powered personalization, with companies increasingly investing in knowledge graph technologies to create more intelligent and contextually aware healthcare solutions. This trend is particularly pronounced in the aging care and family caregiving space, where complex relationships between patients, caregivers, and healthcare providers require sophisticated data modeling approaches. Retrieval Augmented Generation (RAG) systems are becoming the gold standard for healthcare AI applications, as they provide the explainability and accuracy required in medical contexts. Organizations are moving beyond simple chatbots to implement graph-enhanced RAG systems that can navigate complex healthcare ontologies and provide evidence-based recommendations. The convergence of large language models with domain-specific knowledge graphs is creating new opportunities for healthcare technology companies to deliver more precise and trustworthy AI solutions. This trend is driving significant investment in data science roles that combine traditional ML expertise with cutting-edge graph technologies and NLP capabilities.

Role Significance

Likely part of a 3-8 person data science and AI engineering team within a growing healthcare technology startup, with direct collaboration across product management, engineering, and potentially clinical teams
This is a mid-level individual contributor role with significant technical ownership and cross-functional influence. The position carries substantial responsibility for architecting core AI infrastructure that directly impacts product capabilities and member experiences, indicating a role that bridges senior technical execution with strategic product development.

Key Projects

Development of comprehensive healthcare knowledge graphs that model complex relationships between patients, conditions, interventions, and outcomes Implementation of production-scale Graph RAG systems that power customer-facing AI assistants and recommendation engines Creation of data pipeline architectures that integrate disparate healthcare data sources into unified, AI-ready datasets Design and deployment of machine learning models that predict care needs and optimize intervention timing for aging adults and their caregivers

Success Factors

Ability to translate complex healthcare domain knowledge into effective graph data models and ontologies that capture the nuanced relationships inherent in family caregiving scenarios Proficiency in bridging the gap between cutting-edge AI research and production-ready systems that can scale to serve thousands of members while maintaining accuracy and explainability Strong communication skills to effectively convey the value and capabilities of advanced AI systems to non-technical stakeholders, including product managers, executives, and potentially clinical advisory boards Adaptability to work in a fast-paced startup environment where priorities can shift rapidly and the ability to deliver results under pressure is essential for company growth and member satisfaction

Market Demand

Very High - The combination of healthcare AI specialization, graph database expertise, and RAG system experience represents a highly sought-after skill set with limited talent supply in the current market

Important Skills

Critical Skills

Graph database expertise and ontology design skills are absolutely essential as they form the foundation of the company's AI architecture. The ability to model complex healthcare relationships through graph structures directly impacts the effectiveness of all downstream AI applications and member experiences. Production-ready RAG system development experience is crucial because this technology serves as the primary interface between the company's knowledge base and member-facing AI assistants. The quality and reliability of these systems directly affect member trust and product adoption. Python programming proficiency and cloud platform experience are fundamental requirements for implementing and maintaining the scalable data infrastructure needed to support the company's growth trajectory and handle increasing volumes of healthcare data. Natural Language Processing and Large Language Model expertise are critical for developing AI systems that can understand and respond to the complex, nuanced communication patterns common in healthcare and family caregiving contexts.

Beneficial Skills

Healthcare domain knowledge and familiarity with clinical workflows would accelerate the development of more effective AI solutions and improve collaboration with potential clinical advisory teams Experience with healthcare data standards and compliance requirements (such as HIPAA) would be valuable for ensuring that AI systems meet regulatory requirements while maintaining data privacy and security Product management and user experience design understanding would enhance the ability to develop AI features that truly meet member needs and drive engagement with the platform Startup experience and comfort with rapid iteration cycles would be beneficial for thriving in the fast-paced environment where technical priorities may shift based on member feedback and market opportunities

Unique Aspects

This role offers the opportunity to pioneer the application of Graph RAG systems specifically in the healthcare and caregiving domain, potentially establishing new industry standards for AI-powered family support services
The position combines cutting-edge AI research with meaningful social impact, as the technology being developed directly supports families navigating complex caregiving challenges and helps older adults maintain their independence
Working with a leadership team that has proven experience scaling healthcare businesses to multi-billion-dollar valuations provides unique exposure to both technical excellence and strategic business development
The role offers significant technical ownership in building core AI infrastructure from the ground up, providing valuable experience in architecting production-scale AI systems that serve real-world healthcare needs

Career Growth

Progression to senior individual contributor roles typically occurs within 2-3 years, while advancement to management or director-level positions generally requires 4-6 years of continued growth and demonstrated leadership impact

Potential Next Roles

Senior Data Scientist or Principal Data Scientist roles with expanded scope over AI product strategy and technical team leadership AI/ML Engineering Manager positions overseeing data science teams and AI product development initiatives Director of Data Science or Head of AI roles in healthcare technology companies, with responsibility for setting technical vision and building organizational capabilities Healthcare AI consultant or advisor roles leveraging deep expertise in graph-based AI systems and healthcare domain knowledge

Company Overview

Homethrive, Inc.

Homethrive, Inc. operates as a well-funded healthcare technology startup focused on revolutionizing family caregiving and aging-in-place solutions. The company has secured significant venture capital backing from prominent healthcare-focused investment firms including 7Wire Ventures, Human Capital, and Pitango, indicating strong investor confidence in their market approach and growth potential.

Positioned as an emerging leader in the family caregiving technology space, with a unique value proposition that combines high-touch human services with high-tech AI-powered solutions. The company appears to be in a growth phase, actively expanding their technology capabilities and team to scale their impact in the aging care market.
Operating as a remote-first organization with a distributed team across the United States, allowing access to top talent nationwide while serving a national market of family caregivers and aging adults who need support to remain in their homes
Characterized by urgency, collaboration, and continuous improvement, reflecting the startup environment where rapid iteration and results-oriented execution are essential. The leadership team's experience building multi-billion-dollar businesses suggests a culture that values both innovation and operational excellence.
Advertisement
Ad Space
Apply Now

Data Sources & Analysis Information

Job Listings Data

The job listings displayed on this platform are sourced through BrightData's comprehensive API, ensuring up-to-date and accurate job market information.

Sauge AI Market Intelligence

Our advanced AI system analyzes each job listing to provide valuable insights including:

  • Industry trends and market dynamics
  • Salary estimates and market demand analysis
  • Role significance and career growth potential
  • Critical success factors and key skills
  • Unique aspects of each position

This integration of reliable job data with AI-powered analysis helps provide you with comprehensive insights for making informed career decisions.