This is a 100% Contract position for 1 year with the possibility of extension or conversion to career status.
The mission of the Halicioglu Data Science Institute (HDSI) is to advance the scientific foundations for this exciting new field, and to support education and training of students who will be the new leaders in this area. Digital data have emerged as central to addressing critical societal needs and enhancing the quality of life in the coming decades. Data science touches virtually all aspects of life on our planet. At the campus level, UC San Diego already has tremendous strength in this area, ranging across all segments of the university. For this reason, the Institute has been established as an independent unit that works collaboratively with schools, divisions, departments, centers, and faculty and students across the entire campus. The Institute is lead by a Director who reports to an Oversight Committee chaired by the Chancellor. The Institute is the administrative home for the newly established undergraduate B.S. major and minor in Data Science, and will work with departments and the Academic Senate to create additional graduate programs in data science. The institute supports collaborations between researchers across campus and provides critical resources to support innovation in data science.
The Amariuta Lab focuses on methods development in the area of statistical genetics and population genomics. We create integrative approaches to infer mechanisms of genetic risk for complex traits and polygenic diseases. We specifically focus on the development of cross-population methods, as non-European individuals are historically severely underrepresented in genetics and genomics studies.
The incumbent will apply his/her skills as a seasoned, experienced IT research professional using computational, computer science, data science, and CI software research and development principles, with relevant domain science knowledge where applicable, along with professional programming concepts for medium-sized projects or portions of larger projects. S/he will develop and optimize a variety of computational, data science, and CI research tools and components; perform research on current and future HPC, data, and CI technologies, hardware and software projects; as well as work on algorithm development, optimization, programming, performance analysis and / or benchmarking assignments of moderate scope where the tasks involve knowledge of either domain / computer science research requirements and / or CI design / implementation requirements.
The incumbent will work on implementation and optimization of statistical and population genetics tools designed by the Amariuta Lab. This will involve benchmarking, time/memory optimization, and user interface design as well as the continued maintenance of these tools for use by the genomics community, such as on Github. The incumbent will also frequently process and manipulate genetic and genomics data from publicly available sources of data or approved biobanks. The incumbent will also maintain database organization of files, datasets, and software used by the Amariuta Lab. The incumbent will use standard genomics processing tools and interfaces, such as bash, plink, bedtools and design software in R, python, or other appropriate programming language with the ability to perform mathematical and statistical inference.
Bachelor's degree in Computer / Computational / Data Science / Statistics / Mathematics, or Domain Sciences with computer / computational / data specialization / programming / software development or equivalent experience.
Intermediate knowledge of HPC / data science / CI. Knowledge of unix and at least one programming language (e.g. R, python, C++, Java). Experience with software development, maintenance, benchmarking, and complex data structures.
Thorough experience working in a complex computing / data / CI environment encompassing all or some of the following: HPC, data science infrastructure and tools / software, and diverse domain science application base. Thorough experience working and programming/scripting on virtual machines and remote servers.
Demonstrated broad experience in one or more of the following: optimizing, benchmarking, HPC performance and power modeling, analyzing hardware, software, and applications for HPC / data / CI. Demonstrated experience designing automatic analysis pipelines for benchmarking code and performance analytics.
Advanced skills, and demonstrated experience associ
UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 8 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom - life is their laboratory. UC San Diego's rich academic portfolio includes six undergraduate colleges, five academic divisions and five graduate and professional schools. The university's award-winning scholars are experts at the forefront of their fields with an impressive track record for achieving scientific, medical and technological breakthroughs.