Columbia University's Department of Systems Biology is seeking a highly motivated genomics staff member to join the Rabadan Lab. The incumbent will take part in cutting-edge projects centered on the theme of untangling the molecular biology of cancer. These projects, though varied, all approach this problem by analyzing exomic, genomic, and transcriptomic data from both public and private sources. The staff member will contribute to this task as well as develop visualization tools for these data. The candidate will work in a highly multidisciplinary environment and will support several projects from different lab members.
The incumbent will be coordinating research computational efforts and data management for the Center for Topology of Cancer Evolution and Heterogeneity. He or she will also be in charge of genomic data coordination for the Program for Mathematical Genomics (PMG) within the Department of Systems Biology.
Prototype, develop, improve, maintain, and apply tools and pipelines for the management and analysis of large amounts of new generation sequencing data---in particular, whole genome and whole exome cancer data.
Design, develop, and maintain databases of genetic variants identified through NGS in large collections of the individuals, both from private and public sources.
Develop novel web-based data visualization and user interfaces strategies for enabling effective use of the large data sets of sequence data and genetic variants.
Manage the AWS cloud computing environment and storage.
Manage the general IT administration and software/hardware needs for the lab, including purchasing and construction of custom hardware.
Design figures for scientific publications.
Lead the data sharing and working group of grant sponsored projects (U54) and update information on the shared website.
Attend scheduled grant sponsored meetings on a bimonthly/monthly basis.
Review and confirm quarterly invoices for computer service and monthly invoices for internal and Amazon storage and computing services.
Running genomic pipelines 30%
Web development 30%
Data management 15%
AWS managements 10%
IT administration 10%
Bachelor's in computer science, bioinformatics or related field; or equivalent in education and experience, plus two years of related work experience.
Master's degree in computer science and relevant quantitative experience is preferred.
Demonstrated computational and software development skills required. Experience in processing and analysis of large genomics data sets (WGS, WES, RNA-seq, or ChIP-seq) in bioinformatics/molecular biology setting. Ability to produce well designed, documented code in Python, R, Matlab or similar.
Proficiency in using build, test, version control, and documentation tools is a plus; Proficiency working in a Linux environment and with Unix shell scripting; Proficiency with the SGE cluster environment.
Ability to multi-task and work independently as well as on a team within a multidisciplinary environment.
Equal Opportunity Employer / Disability / Veteran
Columbia University is committed to the hiring of qualified local residents.
Internal Number: 512152
About Columbia University
Columbia University is one of the world's most important centers of research and at the same time a distinctive and distinguished learning environment for undergraduates and graduate students in many scholarly and professional fields. The University recognizes the importance of its location in New York City and seeks to link its research and teaching to the vast resources of a great metropolis. It seeks to attract a diverse and international faculty and student body, to support research and teaching on global issues, and to create academic relationships with many countries and regions. It expects all areas of the university to advance knowledge and learning at the highest level and to convey the products of its efforts to the world.