The Data Engineerwill be part of a diverse interdisciplinary team of computer and neuroscientists with broad expertise spanning computer science, neuroscience, psychology, psychiatry, neuropsychology, cognitive neuroscience, neuroimaging, bioinformatics, biostatistics, epidemiology, and neurophysiology. The project will involve data processing and coordination for new multisite data collection networks, as well as develop and apply stratification tools to identify and validate biomarkers to predict outcome trajectories in individuals at high risk to develop psychosis.
This position joins the data management, processing and archiving team. The successful candidate will help develop, deploy and maintain the bioinformatics infrastructure of several large projects. Our group (spanning MGH and BWH) is also actively developing new technologies to characterize brain structure and function, which has led to the design of state-of-the-art image analysis pipelines capable of robustly processing hundreds of neuroimaging datasets. The role of the engineer will be to maintain and enhance existing image processing pipelines and to develop new pipelines based on the latest research with an emphasis on version tracking, data provenance, and high performance computing.
PRINCIPAL DUTIES AND RESPONSIBILITIES:
Relevant activities include, but are not limited to the following:
Maintain and enhance existing image processing pipelines.
Design new image processing pipelines, with an emphasis on version tracking, data provenance, and high performance computing.
Develop neuroinformatics tools to track data provenance and project management.
Test and evaluate a range of neuroimaging packages to determine their suitability for research goals.
Regular, direct interaction with neuroscientists from within and outside the DPACC to assist them with neuroimaging data analysis using a range of methods including FSL, SPM, Slicer and other specialized tools.
Design, implement, test, maintain and support applications to capture, manage, archive and monitor multi-site, multi-modal study data. Applications may include but are not limited to study monitoring systems, data management systems, workflow execution and monitoring systems, interactive viewers, and reporting tools.
Support web and application server configuration and deployment.
Support data engineering efforts, including database and API design, data extraction/transformation/load, and data aggregation/integration.
Containerize and deploy software and workflows on local high performance computing platforms and cloud computing infrastructure (AWS).
Qualifications Required:
Bachelor's Degree in Computer Science, Mathematics, Physical Sciences, Engineering, or related field
Excellent programming skills in Python, Bash, MATLAB
Superior Linux/Unix skills and comfort with command line programs - the ability to get new programs and packages running, overcoming hurdles as they arise, is particularly helpful.
Familiarity with standard software evolution method-version controlling (Git), pull requests, code reviews, issue and release management
Ability to work in an interdisciplinary, diverse, and international team in a highly collaborative and intellectually challenging environment.
Excellent oral and written communication skills
Basic knowledge of neuroscience and neuroanatomy.
Understanding of structural, diffusion, and functional Magnetic Resonance Imaging.
Preferred:
Master's Degree in Computer Science, Mathematics, Physical Sciences, Engineering, or related field
Experience with database management systems (e.g., SQL, PostgreSQL, MongoDB, CouchDB)
Experience with Linux container engines (e.g., Docker, rkt) and container orchestration systems (e.g., Kubernetes)
Experience with JavaScript libraries for interactive data visualization (e.g. d3, Recharts, Charts.js).
Experience with at least one web framework for building single-page web applications (e.g., React, Angular, Vue)
EEO Statement Massachusetts General Hospital is an Equal Opportunity Employer. By embracing diverse skills, perspectives and ideas, we choose to lead. Applications from protected veterans and individuals with disabilities are strongly encouraged.Partner's Healthcare is acting as an Employment Agency in relation to this vacancy.
MGH Institute of Health Professions, founded by Massachusetts General Hospital in 1977, is an innovative and independent graduate school in Boston that is a member of Partners HealthCare. A progressive leader in developing comprehensive models of health care education, the MGH Institute prepares advanced practice professionals in the fields of nursing, physical therapy, occupational therapy and communication sciences and disorders through a distinctive combination of academic study, clinical practice, and research. More than 1,200 students are enrolled in graduate level and certificate programs, with an increasing number of courses available online. The Institute is accredited by the New England Association of Schools and Colleges (NEASC).