The Department of Ophthalmology at Stanford University School of Medicine is seeking a highly motivated, hard-working, and professional Data Scientist to facilitate research efforts in ophthalmology. The incumbent will be part of the department of Ophthalmology; however, the position will be in a collaborative environment, engaging with other Stanford faculty and staff across multiple departments, including Biomedical Data Science, Research IT, and Research Informatics Center. The incumbent will work with a combination of structured and unstructured (text, imaging) data from several sources, including Stanford's STARR and STARR-OMOP clinical research databases, the ophthalmology IRIS (Intelligent Research InSight) national clinical data registry, the AllOfUs national cohort, commercial and Medicare claims data, national survey data, and other sources.
The position will require an incumbent who is comfortable working with some independence; consulting with and advising investigators to refine research questions, define hypotheses and project objectives, design studies and devise analysis plans; and working with project team members-including clinicians, trainees, and other statisticians/informaticists-to implement analysis plans and publish findings. The incumbent must be proficient at balancing involvement in multiple simultaneous projects and prioritizing to manage competing priorities. The incumbent will work closely with others to interrogate databases to create analytic files, perform quality control and data cleaning, and manage and analyze data. The incumbent must be an excellent and timely communicator, able to present results in oral and written form to clinical investigators.
Work directly with investigators and independently identify appropriate data analytic approaches; assist in study design and proposal development
Create analytic files with detailed documentation. Prepare data for analysis by cleaning, identifying cohorts, reshaping data, creating new variables, merging multiple data tables, creating and maintaining new databases as needed.
Implement data analyses using predictive modeling approaches (machine-learning, deep-learning), or inferential statistical methods as appropriate to the project
Develop reusable and well-documented code for all projects, that can be maintained in a repository (e.g. GitHub) for collaborative use
Quickly learn new skills as needs arise, such as new programming or statistical packages
Communicate and present results for investigators using graphs and tables.
Develop oral and written dissemination of findings for conference presentations and peer-reviewed journal articles.
* - Other duties may also be assigned
Strong background in machine learning, biostatistics, and bioinformatics
Intellectually curious; willing and eager to learn new skills
Experience with large datasets and database use
Experience with analysis of real-world observational health data (e.g., electronic medical records, insurance claims)
Manipulation and analyses of complex high-dimensional data
Ability to perform careful data cleaning and preparation, including identifying and handling data discrepancies, duplicates, missing values, outliers, etc; developing cohorts of patients based on inclusion and exclusion criteria, such as those based on billing code diagnoses, age or other demographics, length of follow-up, or other characteristics; creating new variables, including coding relevant outcomes, combining sparse variables, normalizing/standardizing variables; merging datasets on multiple key values; reshaping data from long to wide or vice versa as the befits the analysis needs; loading data into analysis programs, saving data into different file formats
Experience with at least 2 of the following: 1) Machine learning predictive models (gradient boosted trees, random forest etc.); 2) Deep learning neural networks, transfer learning; 3) Hierarchical/multilevel modeling, propensity score matching/weighting
Experience with free-text data (e.g., natural language processing) is a plus, or else willingness to learn
EDUCATION & EXPERIENCE (REQUIRED):
Master's degree in biostatistics, statistics or related field and at least 3 years of experience.
KNOWLEDGE, SKILLS AND ABILITIES (REQUIRED):
Proficient in R (preferred), or alternatively either SAS or STATA for statistical analyses and visualization.
Proficient in SQL
Python experience, including packages such as Jupyter Notebook, matplotlib, pandas, scikit-learn, and either tensorflow/keras or pytorch or both.
Able to use GitHub, write reusable and well-documented code
Familiarity with using cloud computing platforms for data analysis, such as Google Cloud Platform and/or Amazon Web Services
Outstanding ability to communicate in written and oral English how data analyses were performed, to both technical and non-technical audiences.
Demonstrated excellence in at least one area of expertise, which may include statistical methodologies such as missing data, survival analysis, or informatics; statistical computing; database design (e.g., Oracle databases, SQL); predictive modeling (machine learning and deep learning).
CERTIFICATIONS & LICENSES:
Frequently perform desk-based computer tasks, seated work, and use light/ fine grasping.
Occasionally stand, walk, and write by hand, lift, carry, push pull objects that weigh up to 10 pounds.
* - Consistent with its obligations under the law, the University will provide reasonable accommodation to any employee with a disability who requires accommodation to perform the essential functions of his or her job.
May work extended or non-standard hours based on project or business cycle needs.
Stanford is an equal opportunity employer and all qualified applicants will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other characteristic protected by law.
Copyright 2021 Jobelephant.com Inc. All rights reserved.
Located between San Francisco and San Jose in the heart of Silicon Valley, Stanford University is recognized as one of the world's leading research and teaching institutions. Leland and Jane Stanford founded the University to "promote the public welfare by exercising an influence on behalf of humanity and civilization." Stanford opened its doors in 1891, and more than a century later, it remains dedicated to finding solutions to the great challenges of the day and to preparing students for leadership in a complex world. The University's thriving diverse community is comprised of nearly 7000 undergraduate students, 12000 graduate students, 2000 faculty members, 1900 postdoctoral scholars, and over 11,000 academic and administrative staff in seven schools including several interdisciplinary research centers and institutes. The campus spreads over 8000 contiguous acres and nearly all undergraduates live on campus. Stanford offers bachelor's and master's degrees in addition to doctoral degrees (PhD, MD, DMA and JD) plus a number of professional and continuing education programs and certifications. More at http://facts.stanford.edu and http://www.stanford.edu. Stanford is an equal empl...oyment opportunity and affirmative action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic protected by law. Stanford also welcomes applications from others who would bring additional dimensions to the University’s research, teaching and clinical missions.