1. Serves as the biostatistician and the primary data manager for the division in a bioinformatics leadership capacity, as well as the administrator for a large departmental data repository currently in the planning stages of development, which will house all current and future research study data. 2. Writes policy and standard operating procedures for data monitoring, data management, data repository development, data verification and oversees compliance with information security etc. 3. Designs data architecture, establishes data dictionaries, uses statistical programming and transitions old databases into the new RedCap system. 4. Manages and constructs limited use de-identified datasets for permitted, shared use, overseeing proper authorization for data use agreements and secure release of information from the repository according to regulatory policy. 5. Works with research project teams to determine needs for coding conventions, data summary sheets and develops scoring algorithms for raw data sets. 6. Converts data stored in existing databases (i.e. Excel, MS Access, SQL etc.) into one universal format for RedCap database use. 7. Uses statistical computer programs (e.g. SAS, SPSS) and existing database programs to construct and manage limited datasets relevant for immediate analyses in CND projects. 8. Develops data conventions for managing incomplete data sets, runs query checks and resolves edits and out of range values with study teams, documenting all processes; overseeing this process when done by others. 9. Works in conjunction statisticians at collaborating research institutions (i.e. Strong Star, Mount Sinai, Yale etc.) to ensure uniformity in data entry and storage methods for shared projects across sites. 10. Conducts interim statistical analyses on study data to assist investigators with go, no-go decision making for their research projects, as well as final analyses for publication of results. 11. Provides expert advice with regard to data management strategies, data storage strategies and forms development to division staff, overseeing this work done by others. 12. Serves as subject matter expert and data liaison for all aspects of the division's data collection and management, including: statistical sections of study protocols, final study reports, final statistical analyses for research projects and data repository utilization. 13. Summarizes complex neuroimaging, EEG, laboratory, behavioral and other statistical data into tables and graphs for papers, presentations and meetings. 14. Supervises two staff members directly and indirectly influences and oversees the work standards of all research staff within the division who enter, manage, verify and monitor data.15. Coordinates and oversees the set-up and utilization of standardized electronic assessment tools and cognitive testng batteries on VA computers. 16. Develops and maintains data systems designed to ensure that databases used for research projects comply with administrative policies, procedures and requirements set forth by both the incumbent as well as the Research Office, Institutional Review Boards, Office of Research Oversight and the data security guidelines outlined by the Information Security and Compliance Offices, as well as overseeing proper authorization for data use agreements and secure release of information from the data repository according to regulatory policy.17. Prepares regular workload reports for Core review and performs other related duties as assigned.
Required Skill/ability 1: Familiarity with RNAseq pipelines (not limited to deseq2, EdgeR and Limma packages). At least completed 1 course on single cell RNAseq data analysis (with at least 8 contact-hours). Familiarity with proteomics pipelines (not limited to MAxQuant, Proteome Discoverer, MSFragpipe).
Required Skill/ability 2: Ability to independently troubleshoot analytic pipelines, and custom build pipelines to suit experimental needs. Ability to adopt existing code/pipelines from open-source forums (Ex.GitHub) and implement for in-house use.
Required Skill/ability 3: Demonstrated ability in Bioinformatics pipeline development, including RNAseq and proteomics analyses, including differential expression analysis, network analysis, gene set enrichment and pathway analysis. Quantitative academic focus and >2 years of data analysis/programming experience; or equivalent combination of education and experience.
Required Skill/ability 4: Extensive knowledge of and ability to apply standard software development principles, theories, concepts and techniques to data analysis. Strong programming skills in R, and/or Python. Demonstrated ability with data handing and processing using Amazon Cloud services (AWS). Managing large data sets.
Required Skill/ability 5: Excellent written and oral communication skills. Ability to effectively communicate with bioinformatics personnel/experts from other institutions to independently plan analytic pipelines. Ability to assemble high-quality figures for grants and publications. Ability to assemble slides and posters for talks/presentations
Preferred Education: Master's Degree in Biostatistics or Statistics AND two years of experience; or equivalent combination of education and experience, in the analyses of mass spectrometry proteomics and RNAseq data, including bulk RNAseq and single cell RNAseq.
Work Week: Standard (M-F equal number of hours per day)
Posting Position Title: Data Manager, Biostatistics
University Job Title: Data Manager, Biostatistics
Preferred Education, Experience and Skills: Master's Degree in Biostatistics or Statistics AND two years of experience; or equivalent combination of education and experience, in the analyses of mass spectrometry proteomics and RNAseq data, including bulk RNAseq and single cell RNAseq.
Master's Degree in statistics, bioinformatics, epidemiology or a related field and 5 years of related experience or an equivalent combination of education and experience.
Yale University is an American private Ivy League research university located in New Haven, Connecticut. Founded in 1701 in the Colony of Connecticut, the university is the third-oldest institution of higher education in the United States.