USC Information Technology Services (ITS) is seeking a Data Scientist with a strong background in quantitative research methods and computer programming to join its Center for Advanced Research Computing (CARC). The Data Scientist contributes to the work of the team through his/her knowledge of statistics and programming that is both broad and deep.
The ideal candidate should have experience with multiple steps in the data life cycle, including data acquisition, data processing, data analysis, and visualization, preferably in a research environment; and brings technical programming expertise in algorithmic approaches using machine learning, deep learning, and/or other computational methods in artificial intelligence. S/he should have a history of successful collaborations, particularly in a service delivery environment.
The ITS vision aligns strategy, business, and services; affirms ITS cultural values; empowers cross-functional teamwork; embraces world-class best practices; and promotes innovation, excellence, agility, and efficiency. To achieve this vision, ITS is committed to providing a modern technology infrastructure that is resilient and delivers the performance necessary to meet the demands of a growing customer base, training in the latest technologies for its highly productive and motivated workforce, outstanding customer experience, and technology services that are aligned with the universityâs mission to provide exceptional learning opportunities for students. ITS is creating a workplace where employees can develop cutting-edge skills, take pride in the services they provide, and have access to the roles and career paths that align to their abilities and potential.
We are looking for top talent to join us on our journey.
USCâs ITS organization represents a diverse and talented team, committed to supporting a collaborative culture and delivering secure and innovative IT services, core to the mission of USC. ITS values accountability, excellence, and commitment to exceptional customer experience. ITS strives for a supportive and inclusive culture that encourages employees to do their best work every day and where individuals are recognized and celebrated for their contributions.
USC is the leading private research university in Los Angelesâa global center for arts, technology, and international business. With more than 47,500 students, we are located primarily in Los Angeles but also in various US and global satellite locations. As the largest private employer in Los Angeles, responsible for $8 billion annually in economic activity in the region, we offer the opportunity to work in a dynamic and diverse environment, in careers that span a broad spectrum of talents and skills across a variety of academic and professional schools and administrative units. As a USC employee and member of the Trojan Familyâthe faculty, staff, students, and alumni who make USC a great place to workâyou will enjoy excellent benefits, including a variety of well-being programs designed to help individuals achieve work-life balance.
Come join the ITS team and work as a trusted partner in shaping an environment of innovation and excellence for the university.
The candidate for the position of Data Scientist must meet the following qualifications:
Five years of experience in a relevant field such as data science, information technology, cognitive engineering, etc.
Proficiency in using statistical computer languages, such as R, Python, SQL, etc., to manipulate data and draw insights from large data sets.
Knowledge of current data modelling tools.
Knowledge of a variety of machine-learning techniques and algorithms, such as clustering, decision tree learning, artificial neural networks, etc.
Experience with common data science toolkits.
Experience working with and creating data models and data architecture.
Experience using data visualization tools such as Tableau, ArcGis, and D3.
Experience working with relational (e.g., MySQL, SQL Server, Oracle, Snowflake, Redshift) and non-relational (e.g., Mongo, NoSQL) databases.
Knowledge of applied statistical concepts and skills such as distributions, statistical testing, regression, etc.
Experience with scripting and programming in several languages.
Excellent written and verbal communication skills. Ability to provide both detailed information and summaries to management-level individuals and groups.
Experience developing customer relationships and delivering customer-focused service.
Proven problem-solving and decision-making skills with the ability to uncover root causes and evaluate different solution options.
Proven ability to successfully manage and prioritize multiple projects and plans, implement project specifications, report project status, identify delays, and raise concerns.
Exceptional interpersonal skills, including demonstrated ability to interact credibly with a variety of stakeholders including researchers, IT staff, IT leadership, and university administrators.
Awareness of compliance regulations (HIPAA, FERPA, CUI, and FISMA) surrounding research data including federal requirements and granting agency standards.
Ability to develop positive working relationships and a strong rapport with team members.
The ideal candidate for the position of Data Scientist has the following qualifications:
Advanced degree (PhD preferred) in a relevant field, such as computer science, data science, statistics, computational science and engineering, etc.
Demonstrated knowledge of the research process, proposal lifecycle, data lifecycle, and workflows.
Proficiency in using build, test, version control, and documentation tools is a plus; Proficiency working in a Linux environment and with Unix shell scripting; Proficiency with HPC cluster environment.
Familiarity with emerging Cloud Capabilities on Azure, GCP and/or AWS; Experience with command-line scripting, data structures, and algorithms.
Familiarity with how to optimize code to run in parallel environments, including those that leverage GPUs.
Expert level knowledge of the extraction, transformation, loading and manipulation of large data and databases (e.g. multi-terabyte files).
In-depth understanding in any of the following numerical techniques:Â causal inference, selection bias, dimensionality reduction (Singular value decomposition, Principal component analysis)
Extensive programming experience in Machine Learning tools such as Tensorflow, PyTORCH, Scikit, CNTK/Microsoft Cognitive Toolkit, Power AI, Theano, Caffe, etc.
Understanding of Machine Learning/AI methods such as random forest, neural networks, Markov models, etc.
Experience with any of the following:Â Compilers, Makefiles, and common build chains (autoconf/automake, CMake, pip, Lmod, Spack).
Demonstrated computational and software development skills required; Ability to produce well designed, documented code in Python, R, Matlab or similar
Experience in processing and analysis of large genomics data sets (WGS, WES, RNA-seq, or ChIP-seq) in bioinformatics/molecular biology setting is a plus.
Demonstrated experience conducting research in a higher education environment.
More than five years of experience in a relevant field such as data science, information technology, cognitive engineering, etc.
Published in the areas of artificial intelligence or data science.
THE WORK YOU WILL DO
Reporting to the Director of Center for Advanced Research Computing (CARC), the Data Scientist is responsible for identifying and implementing technical solutions for meeting researcher statistical and data science needs, both as an individual contributor and as part of a team. The Data Scientist will assist USC research community with the use of technology in research, including but not limited to large data manipulation and management, complex statistical analysis and modeling, data mining and other research methodologies. He/she exhibits a high level of independence, judgment, intellectual curiosity, and technical skills. The primary responsibilities include:
The Data Scientist:
Works directly with faculty on statistical and data science projects.
Serves the data needs of faculty and other researchers, identifying and applying computational and statistical tools, and advising and training researchers in the use of these tools.
Resolves short-term research-related questions and problems, or takes on faculty research projects that may involve statistical analysis and programming, or large/complex data sets management solution development.
Stays abreast of new developments in statistical methods and procedures, artificial intelligence, and computer programming used in top journals; stays current with new versions of statistical and data science software packages (e.g., Stata, R, Python, SAS)
Works independently with sound judgment, directly accountable for accuracy of methods and results.
Often works on multiple projects simultaneously, balancing short and long-term projects to meet deadlines and faculty needs.
Ensures adherence to policies related to the use of data from external sources.
Participate in a larger group to provide workshops on the use of data and data manipulation tools.Â This will include creating documentation of how to use tools in our supported environments such.
Collaborates with other team members through knowledge sharing and promotion of best practices (e.g., code review, technology showcase).
Serves as a statistical and computing resource for other members of the HPC team.
Contributes to an inclusive environment that values differences by building and maintaining collaborative relationships with team members, peers, and ITS leaders. Actively embodies ITS values and behaviors, including accountability, ethics, and best-in-class customer service.
Performs other related duties as assigned or requested. The university reserves the right to add or change duties at any time.
Come join the USC ITS team and work as a trusted partner in shaping an environment of innovation and excellence.
Minimum Education: Bachelor's degree, Combined experience/education as substitute for minimum education Minimum Experience: 2 years Minimum Field of Expertise: Experience using statistical computer languages (R, Python, SQL, etc.) to manipulate data and draw insights from large data sets. Experience working with and creating data models and data architecture, and using data visualization tools (Tableau, ArcGis, D3). Knowledge of current data modeling tools and various machine-learning techniques and algorithms (clustering, decision-tree learning, artificial neural networks, etc.). Experience scripting and programming in several languages, and with common data science toolkits. Proficient use of query languages (e.g. SQL, MDX) and experience working with relational (e.g., MySQL, SQL Server, Oracle, Snowflake, Redshift) and non-relational (e.g., Mongo, NoSQL) databases. Knowledge of applied, statistical concepts and techniques skills, including distributions, statistical testing, regression, and more. Excellent written and oral communication skills. Ability to provide both detailed information and summaries to management-level individuals and groups. Experience developing customer relationships and delivering customer-focused service. Proven problem-solving and decision-making skills, and the ability to uncover root cause and evaluate different solution options.
USC is the leading private research university in Los Angeles—a global center for arts, technology and international business. With more than 47,500 students, we are located primarily in Los Angeles but also in various US and global satellite locations. As the largest private employer in Los Angeles, responsible for $8 billion annually in economic activity in the region, we offer the opportunity to work in a dynamic and diverse environment, in careers that span a broad spectrum of talents and skills across a variety of academic and professional schools and administrative units. As a USC employee and member of the Trojan Family—the faculty, staff, students, and alumni who make USC a great place to work—you will enjoy excellent benefits, including a variety of well-being programs designed to help individuals achieve work-life balance.