Work
Selected projects and platform contributions
These projects show how database systems, scientific platforms, distributed execution, and startup delivery connect in practice.
Flagship projects
Project
AWESOME Polystore
Heterogeneous database integration and query processing for relational, graph, text, and analytical systems.
Project
National Data Platform Search
Federated scientific-data discovery with metadata, ontology-driven search, and national-scale APIs.
Project
TemPredict
Wearable and multimodal data infrastructure for predictive-health research and clinical support.
Detailed contributions
San Diego Supercomputer Center / UC San Diego
AWESOME Polystore
A research and platform effort focused on making heterogeneous databases work as one system for scientific and operational workloads.
Problem
How can heterogeneous data sources be ingested, indexed, and queried without forcing every workload into a single storage model?
Approach
Built polystore ingestion and planning methods across databases, graph stores, text indexes, and analytical engines.
Contributions
- Designed heterogeneous ingestion and integration workflows.
- Developed query-processing methods for cross-store access.
- Contributed to patentable database-system ideas.
- Translated research concepts into platform-oriented system design.
Outcomes
- Enabled published work on polystore ingestion and query planning.
- Connected to patentable ingestion and query-processing methods.
San Diego Supercomputer Center / National Data Platform
National Data Platform Search
A search and discovery layer for scientific data platforms that emphasizes metadata, semantic access, and community-scale usability.
Problem
How can a national data platform make distributed scientific datasets discoverable without flattening their semantics or provenance?
Approach
Built federated-search services around metadata normalization, text and semantic retrieval, APIs, and policy-aware access.
Contributions
- Built or shaped search-facing data models for scientific discovery.
- Worked on metadata and semantic access patterns.
- Helped design APIs for community use.
Outcomes
- Connected distributed scientific catalogs to a reusable discovery layer.
- Aligned with the National Data Platform's AI-ready data and workspace model.
UCSF Osher Center for Integrative Health / UC San Diego
TemPredict
A system focused on ingesting and organizing multimodal health data for analysis, collaboration, and research workflows.
Problem
How do wearable and clinical data streams get structured so they can support reproducible analysis without overfitting the platform to one study?
Approach
Supported continuous wearable-sensor ingestion, harmonization, monitoring, data-quality control, and analytical workflows.
Contributions
- Supported scalable ingestion for wearable and multimodal data.
- Contributed to analytics-oriented data organization.
- Helped enable predictive-health research workflows.
Outcomes
- Enabled a published conference paper and a large-scale wearable-sensing study.
- Connected platform work to symptom-prediction and monitoring workflows.
UCSB Wilson Group / UC San Diego / SDSC
Quantum Data Hub
A research data platform for organizing materials-science information across discovery, curation, and reuse.
Problem
How can materials-science data remain discoverable and reusable across projects, workflows, and collaborators?
Approach
Built a collaborative data and analysis platform with metadata storage, FAIR search/access, JupyterHub analysis, and workflow logging.
Contributions
- Supported collaborative metadata design.
- Worked on research data discovery flows.
- Helped shape scientific workflow support.
Outcomes
- Published as a conference paper on collaborative quantum material science data management.
- Connected to a reusable quantum materials discovery and analysis workflow.
Advanced Database and Intelligence Lab / SDSC
NOURISH
A platform concept for combining conversational access, knowledge representation, and secure data services in health and research settings.
Problem
How can people ask complex public-health questions without requiring deep technical knowledge of the underlying systems?
Approach
Connected knowledge-based conversational engines to heterogeneous data integration and privacy-aware access patterns.
Contributions
- Supported secure data-service design.
- Contributed to knowledge-graph thinking for discovery.
- Helped make multimodal health data easier to explore.
Outcomes
- Aligned with the lab's AI-enabled query processing and decision-support direction.
Kaavo
Kaavo IMOD
Startup platform work focused on product architecture, application deployment across clouds, and engineering-team formation.
Problem
How should an early-stage cloud startup build a usable platform before requirements and market expectations are fully stable?
Approach
Build the product and engineering foundation with enough rigor to survive changing requirements while keeping delivery practical.
Contributions
- Helped build the multi-cloud management platform.
- Contributed to product development in a startup setting.
- Helped establish the India engineering operation.
Outcomes
- Established the bridge between startup product delivery and operational engineering.
- Supported a cloud-management product across deployment, provisioning, monitoring, and releases.