Data Scientist – infectious disease research – United States of America

Exciting opportunity to work with the multidisciplinary infectious disease research group EPPIcenter ( in San Francisco. The EPPIcenter is a dynamic team conducting international field studies in Africa and Latin America, state-of-the-art laboratory experiments generating highly dimensional data and computational analyzes synthesizing this data for research, training and public health purposes. . We are based at Zuckerberg General Hospital in San Francisco in the Division of HIV, Infectious Diseases and Global Medicine at UCSF, with many regional and international partners.

You will be a data specialist for the research group, with responsibilities spanning the collection, organization and analysis of data, with responsibilities for building and maintaining data systems. These will include:

  • Work with international field teams (mostly remote) to collect, store and process data including QC for multiple studies
  • Build and maintain a database infrastructure to track biological samples related metadata and derived experimental data across multiple studies
  • Organize, store and curate high-dimensional data sets, including genomic and immunological data generated locally and potentially with international partners, including alignment with public repositories for dissemination

There is a strong social justice component to this work – your work will directly contribute to projects that impact millions of people. You’ll work with a leading infectious disease team that pushes boundaries and has high expectations of everyone on board. No understanding of infectious diseases is necessary.

Description of the department

ABOUT THE DIVISION OF HIV, INFECTIOUS DISEASES AND GLOBAL MEDICINE The Division of HIV, Infectious Diseases, and Global Medicine at the University of California, San Francisco (UCSF) is an internationally recognized world leader in clinical care, research, and education . With its headquarters located at Zuckerberg General Hospital in San Francisco (ZSFG), the division has been ranked as the nation’s top facility for AIDS care for more than a decade. The Division’s work is frequently featured in the media and is often visited by international, national, state and local dignitaries and delegations. The Division consists of approximately 36 faculty members and 170 academic employees and staff. It is the largest division of the ZSFG’s Department of Medicine (DOM), with a total expenditure budget of approximately $45 million.

The EPPIcenter aims to advance the understanding of infectious diseases to reduce global morbidity and mortality. We believe that the greatest success in the fight against infectious diseases will come from a highly interdisciplinary systems epidemiology approach, linking theoretical work, technology development, empirical data generation and collection, and analysis to the using statistical and mathematical models.

Qualifications required

  • Bachelor’s degree in Computer Science/Computer Science/Data Science, or related science with specialization in Computer Science/Computer Science/Data or equivalent experience.
  • Minimum 3+ years of related experience
  • Intermediate or superior knowledge and experience of relational databases and data servers.
  • Demonstrated experience and ability to collaborate effectively with all levels of staff; technicians, students, professors and administrators
  • Proven ability to understand research computing/data/CI needs, mapping use cases to requirements and how systems/software/infrastructure can support those needs and meet requirements. Demonstrated ability to develop and implement such solutions.
  • Proven ability to successfully work on multiple concurrent projects.
  • Extensive experience working in a complex computing/data/CI environment encompassing any or all of the following: HPC, infrastructure and data science tools/software and scientific application base from various fields.
  • Motivated and work independently and in a team. Able to learn effectively and meet deadlines.
  • Proven skills and experience in independently solving IT/data/CI problems using introductory and/or intermediate principles.
  • Proven communication and interpersonal skills. Demonstrated ability to communicate technical information to technical and non-technical staff at various levels of the organization and to external research and education audiences.
  • Demonstrated ability to interact regularly with management.
  • Intermediate knowledge in HPC / data science / CI.

Preferred qualifications

  • Familiarity with biological data ontology, including immunological and genomic data
  • Extensive demonstrated experience in one or more of the following areas: optimization, benchmarking, HPC performance and power modeling, hardware, software and application analysis for HPC/data/CI.
  • Demonstrated ability to contribute research and technical content to grant proposals.
  • Advanced skills and demonstrated experience associated with one or more of the following: analyzing and researching the power and performance of HPC hardware and software, designing, modifying, implementing and deploying HPC or science applications and tools data or CI.

About UCSF

The University of California, San Francisco (UCSF) is a leading university dedicated to advancing global health through advanced biomedical research, higher education in the life sciences and health professions. health and excellent patient care. It is the only campus in the 10-campus UC system dedicated exclusively to the health sciences. We bring together the world’s top experts in almost every area of ​​health. We welcome five Nobel laureates who have advanced the understanding of cancer, neurodegenerative diseases, aging and stem cells.

