Home Office, Home Office, USA
12 hours ago
Data Curator - Childhood Cancer Data Initiative
REQ#: RQ213543Public Trust: None Requisition Type: Regular Your Impact

Own your opportunity to work alongside federal civilian agencies. Make an impact by providing services that help the government ensure the well being of U.S. citizens.

Job Description

J

Job Summary:

We are seeking a highly skilled Data Curator to join our team and support the National Cancer Institute’s (NCI’s) Childhood Cancer Data Initiative (CCDI). The primary goal is to develop a Childhood Cancer Data Ecosystem (CCDE) that maximizes the access, use, and interoperability of childhood cancer data. The successful implementation of the CCDE will enhance the pediatric cancer community’s efforts in the prevention, diagnosis, and treatment outcomes for childhood cancer patients.

Key Responsibilities:

Curate and quality check (QC) data submissions to ensure accuracy and completeness.Facilitate the process of CCDI data submissions through various stages to ensure proper deposition in repositories such as the NCI’s Childhood Cancer Data Commons and NIH’s dbGaP.Conduct QC checks of CCDI data against defined data models and permissible values.Write Python and Linux scripts to manipulate and format data into filesVerify submissions against original data manifests to ensure accuracy.Communicate and collaborate with data submitters and ensure submissions accurately represent the original data.Map cancer molecular and clinical data to appropriate data fields using defined permissible values.Report on the progress and status of data curation tasks.

Required Skills and Experience:

Bachelor’s degree in Data Science, Bioinformatics, Molecular Biology, or a related field.Strong understanding of molecular and clinical data and its implications for research.Proven experience in data curation and quality control processes.Knowledge of and/or experience in data standards and data models such as caDSR, OMOP, MONDO, LOINC, SNOMED etc.Experience working with data repositories and submission processes.Proficiency in data mapping and transformation.Experience with ETL of data leveraging Python and Linux commands. Utilize pandas, polars or other data frame package to manipulate tabular/row-oriented structured data files for cleaning and QC.Familiarity with AWS S3 and the moment of files to and within that environmentExcellent communication skills to interact with data submitters and stakeholders.Attention to detail and strong analytical skills.

Preferred Skills and Experience:

Master’s degree in Data Science, Bioinformatics, Molecular Biology, or a related field.Experience with cancer data.Knowledge of data harmonization and compliance standards.Familiarity with Cancer Research Data Commons (CRDC) and dbGaP registration processes.Proficiency in using data curation tools and software.

#GDITFedHealthJobs

#GDITHealth

Confirm your E-mail: Send Email