Senior Computational and Data Science Research Specialist (6028U), Berkeley Public Health - 46111
University of California Berkeley
Application
Details
Posted: 08-Dec-22
Location: Berkeley, California
Type: Full-time
Salary: Open
Internal Number: 3705269
Senior Computational and Data Science Research Specialist (6028U), Berkeley Public Health - 46111
About Berkeley
At the University of California, Berkeley, we are committed to creating a community that fosters equity of experience and opportunity, and ensures that students, faculty, and staff of all backgrounds feel safe, welcome and included. Our culture of openness, freedom and belonging make it a special place for students, faculty and staff.
The University of California, Berkeley, is one of the world's leading institutions of higher education, distinguished by its combination of internationally recognized academic and research excellence; the transformative opportunity it provides to a large and diverse student body; its public mission and commitment to equity and social justice; and its roots in the California experience, animated by such values as innovation, questioning the status quo, and respect for the environment and nature. Since its founding in 1868, Berkeley has fueled a perpetual renaissance, generating unparalleled intellectual, economic and social value in California, the United States and the world.
We are looking for equity-minded applicants who represent the full diversity of California and who demonstrate a sensitivity to and understanding of the diverse academic, socioeconomic, cultural, disability, gender identity, sexual orientation, and ethnic backgrounds present in our community. When you join the team at Berkeley, you can expect to be part of an inclusive, innovative and equity-focused community that approaches higher education as a matter of social justice that requires broad collaboration among faculty, staff, students and community partners. In deciding whether to apply for a position at Berkeley, you are strongly encouraged to consider whether your values align with our Guiding Values and Principles, our Principles of Community, and our Strategic Plan.
At UC Berkeley, we believe that learning is a fundamental part of working, and our goal is for everyone on the Berkeley campus to feel supported and equipped to realize their full potential. We actively support this by providing all of our staff employees with at least 80 hours (10 days) of paid time per year to engage in professional development activities. To find out more about how you can grow your career at UC Berkeley, visit grow.berkeley.edu.
Departmental Overview
Berkeley Public Health (BPH) aims to improve population health, especially for the most vulnerable, through interdisciplinary collaborations, preeminent education, and transformational research. Established in 1943, BPH is a professional school on the UC Berkeley campus that comprises six academic divisions and nearly 30 research centers and programs. Our department's values include social justice, health as a right, challenging conventional thought, embracing diversity, and creating meaningful impact. We honor our principles of community by centering and valuing everyone in our community; prioritizing prevention while remaining grounded in social justice; promoting safety and respect; practicing self-care and kindness; and remaining optimistic, hopeful, and committed to change. Learn more at publichealth.berkeley.edu
The Forum is a multi-stakeholder initiative focused on advancing the regulatory sciences for the treatment of NAFLD/NASH, PSC, liver fibrosis, HBV, HIV, TAVI, and Ocular Diseases. The Forum brings together experts in transplantation medicine, infectious diseases, virology, immunology, and diagnostics from academia, regulatory agencies, industry, and professional societies to discuss, deliberate, and generate consensus on issues such as disease definitions, standardization of diagnostic approaches, and clinical trial design.
As the Forum's lead data scientist, this position will develop novel statistical methods and analyses for use in biomedical and public health projects of the The Forum for Collaborative Research's Data & Analysis Center (Forum's D&A Center) at the University of California Berkeley (UCB) School of Public Health (SPH). The Center provides a curated repository of clinical data and an innovative set of analytical tools to facilitate answering critical questions of drug safety and efficacy in novel cost-effective ways that will reduce time and cost of drug evaluation while maintaining or enhancing the scientific basis of that evaluation. The Center also works to translate and disseminate new knowledge through convening opportunities to discuss 'Innovation in Data Use and Analysis' for Forum stakeholders including, academia, regulatory agencies, industry, patient organizations, and professional societies to share lessons learned and provide opportunities for cross-comparison of analytic approaches and a framework for training in novel analytic approaches in a disease specific context.
This position is located in either Berkeley, CA or Washington, DC.
Application Review Date
The First Review Date for this job is: December 21, 2022
Responsibilities
Applies advanced High Performance Computing (HPC) to clinical data research and development concepts to plan, design, develop, modify, debug, deploy and evaluate highly complex HPC (software and / or hardware) or data science, or computational science or continuous integration (CI) software and technologies or in combination.
May require working with collaborators on the development of such concepts and software. Also, responsible for working with Berkeley Research IT and other campus units to ensure smooth operations of data systems administered by the university and available to the Forum's D&A Center.
Works in collaboration with stakeholders and datacenter staff to formulate logic for new and highly complex systems and new algorithms. Applies highly complex programming principles. Performs highly complex analysis and tests, and debugs highly complex software and hardware.
Manage collaboration with relevant science experts to initiate large and complex research projects under the Forum's D&A Center.
Performs or directs highly complex HPC, computational and data modeling, performance and integration testing.
Works with Forum collaborators and leadership to create, maintain, and update databases with clinical trials data. Also, oversees the development and implementation of computational and data analysis. In addition, responsible for maintaining and when needed developing analytics software and algorithms or tests, as well as optimizes and develops complex CI software. May work within Laboratory Information Management System (LIMS) and software automation that support sponsors or regulators providing data.
Document data management processes and procedures for regulatory filings.
Works closely with regulatory partners to ensure data standards acceptable to regulators. Works according to CDISC standards for enhanced accessibility, interoperability, and reusability of data.
Optimize SQL queries, create/maintain database index. Also, interprets results of profiling, identify inefficiencies, and develops algorithmic, coding, and workflow optimizations to enhance performance and capabilities.
Create specific reports, develop or enhance workflows, write and run queries to extract data. including writing advance PL/SQL scripts.
Monitor data entry and database performance to meet regulatory requirements. As well, performs in-depth evaluation of usage modes, capabilities, characteristics and performance of multiple highly complex databases.
Troubleshoot issues and recommend improvements.
Curate data including authentication, archiving, management, preservation retrieval, and formatting.
Oversees ongoing collaboration between Forum stakeholders, collaborating statisticians, and internal organization representatives participating in projects under the Forum's D&A Center.
May represent the organization as part of a team at national and international meetings, conferences and committees.
Provide regular updates to Forum leadership, Center's Steering Committee, and Working Groups on database/ datasets status and overall progress.
Initiates and contributes research proposals with Forum stakeholders, these could include proposals written in partnership with internal, other cross campus units, and centers and also with external collaborators from industry, other academic institutions, regulatory agencies, and patient organizations.
Performs regular audits to ensure data integrity and quality.
Ensure that information is backed-up, secured, and protected.
Assures that all data complies with legal regulations
Enforces and manages research and development project plans with interested collaborators, stakeholders and users as applicable.May serve as technical lead for multiple research and development projects of moderate to broad scope and lead a team of researchers and technical staff. Understands and applies advanced research and development practices, community standards and department policies and procedures.
Attends trainings as needed and required by the university and unit.
May supervise data entry, database management, and research analysis of students, support Forum staff and other statisticians that collaborate on Forum projects.
May supervise data entry, database management and research analysis of students, support staff and / or lower level analysts.
Other Duties as assigned by the Forum leadership.
Required Qualifications
Advanced knowledge of research computing (e.g., HPC / data science / CI.)
5+ years of clinical data management experience in pharmaceutical, medical device/diagnostic company or clinical research organization.
Strong understanding of good laboratory practice and knowledge of regulatory requirements as they pertain to data management.
Highly advanced skills, and demonstrated experience associated with one or more of the following: HPC hardware and software power and performance analysis and research, design, modification, implementation and deployment of HPC or data science or CI applications and tools of large-scale scope.
Demonstrated ability to regularly, effectively communicate with unit-level management.
Demonstrated ability to initiate research proposals and work as part of a team to secure funding.
Demonstrated ability to communicate technical information to technical and non-technical staff members at various levels in the organization and to external research, academic, or private sector.
In depth skills and experience in independently resolving complex computing / data / CI problems using introductory and / or intermediate principles.
Self-motivated and works independently and closely as part of a small internal team but also able to work with large diverse networks.
Advanced experience working in a complex computing / data / CI environment encompassing all or some of the following: HPC, data science infrastructure and tools / software, and diverse domain science application base.
In depth ability to successfully work and / or lead multiple concurrent projects.
Demonstrated research and technology project leadership and management skills.
In depth experience assessing a broad spectrum of technical and research needs and demands and establish priorities, delegate and / or lead development of solutions to meet such needs.
Demonstrated advanced experience in one or more of the following: optimizing, benchmarking, HPC performance and power modeling, analyzing hardware, software, and applications for HPC / data / CI.
Experience in machine learning, causal inference, and interest in application of related estimators in big data situations.
Demonstrated working knowledge of statistical programming language such as R and Phyton as well as general issues in computational statistics.
Experience formulating research questions, developing research plans, and writing papers describing research findings.
Education/Training:
Bachelor's degree in related area and / or equivalent experience / training.
Preferred Qualifications
Experience working with regulatory agencies, academic institutions or with pharmaceutical, biotech or diagnostic companies highly preferred.
Master's degree in Computer / Computational / Data Science, or Domain Sciences with computer / computational / data specialization and/or equivalent experience/training preferred.
Ph.D. in one of a variety of fields, including but not limited to Statistics, Biostatistics, or related Data Science field and/or equivalent experience/training.
Salary & Benefits
This is an exempt, monthly paid position. The annual salary is commensurate with experience within the range of $112,100.00 - $216,700.00.
This is a one-year, full-time (40 hours/week), Contract appointment.
This position is located in either Berkeley, CA or Washington, DC.
For information on the comprehensive benefits package offered by the University visit:
Please submit your cover letter and resume as a single attachment in the Resume section of your application.
Equal Employment Opportunity
The University of California is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status. For more information about your rights as an applicant see:
The University of California was chartered in 1868 and its flagship campus - envisioned as a "City of Learning" - was established at Berkeley, on San Francisco Bay. Today the world's premier public university and a wellspring of innovation, UC Berkeley occupies a 1,232 acre campus with a sylvan 178-acre central core. From this home its academic community makes key contributions to the economic and social well-being of the Bay Area, California, and the nation.