Description
Curate, annotate, and maintain scientific databases supporting bioinformatics, biotechnology, and medical research. Design schemas, metadata standards, and ingest/validation pipelines to integrate, quality-check, and link genomic and other biological information; collaborate with researchers and software engineers to ensure accuracy, usability, and compliant access.
- • Recommend new data models, ontologies, and curation workflows to improve database operations.
- • Stay current with data standards, ontologies, instrumentation, and curation software.
- • Coordinate with research, product, and operations teams to prioritize database features and content.
- • Collaborate with software developers to implement and modify curation tools and APIs.
- • Test and validate new and updated database features, pipelines, and tools.
- • Provide controlled vocabularies, metadata templates, and guidance for data submission and annotation.
- • Generate summary statistics, release notes, and data dictionaries for database content.
- • Train users and staff in data submission, querying, and curation best practices.
- • Improve user interfaces and search capabilities for databases and portals.
- • Lead or coordinate the work of curators, technicians, and data managers.
- • Develop ETL scripts and pipelines to ingest, normalize, and de-duplicate datasets.
- • Design and maintain database schemas and indexes for performance and integrity.
- • Create or modify web-based data access and curation tools.
- • Develop and apply data validation, parsing, and record-linkage algorithms.
- • Create novel approaches for data harmonization, provenance tracking, and integration.
- • Aggregate, annotate, and harmonize datasets (genomics, proteomics, metabolomics, clinical) for inclusion.
- • Document and communicate database updates through reports, documentation, and publications.
- • Ingest, cross-reference, and reconcile public, commercial, and proprietary datasets.
- • Consult with researchers to define metadata standards, access policies, and computational strategies.
- • Perform quality control audits and integrity checks on large molecular and clinical datasets.
Related specializations
Interview options
Interview options
Interviewee gender
Interviewee accent
Interview time
Source
Tasks & skills:
O*NET occupational data (work activities, skills, knowledge).
Learn more
Sources & Standards:
This site includes information from O*NET by the U.S. Department of Labor, Employment and Training Administration (USDOL/ETA), used under the CC BY 4.0 license. Career Clutch has modified some of this information for student readability. USDOL/ETA has not approved, endorsed, or tested these modifications. O*NET® is a trademark of USDOL/ETA.
Last reviewed: Jan 2026