The Division of Hematology is a progressive and dynamic center for basic and translational science, clinical research, patient care, and teaching, where clinicians, scholars, investigators, and trainees work together to harness the power and resources of Stanford Medicine in the pursuit of excellence.
Duties include:
- Prioritize and extract data from a variety of sources such as notes, survey results, medical reports, and laboratory data, and maintain its accuracy and completeness.
- Design and customize reports based upon data in the database. Oversee and monitor regulatory compliance for utilization of the data.
- Use system reports and analyses to identify potentially problematic data, make corrections, and eliminate root cause for data problems or justify solutions to be implemented by others.
- Create complex charts and databases, perform statistical analyses, and develop graphs and tables for publication and presentation.
- Serve as a resource for non-routine inquiries such as requests for statistics or surveys.
DESIRED QUALIFICATIONS:
- Highly preferred Master’s level degree in statistics, data science, biostatistics, computer science, or related field.
0-2 years of relevant work experience in data science, data engineering, or analytics. - Experience working with healthcare data, clinical databases, or electronic health records is highly preferred but not required.
- Demonstrated experience integrating data from multiple sources or building data pipelines through coursework, internships, or professional work.
- In-depth knowledge and experience using and applying analytical software, database management system software, database reporting software, database user interface and query software, and data mining software.
- Proficiency in Python programming for data manipulation, extraction, and ETL processes. Experience with libraries such as pandas, sqlalchemy, numpy, and PDF parsing tools (PyPDF2, pdfplumber, or similar).
- Intermediate to advanced SQL skills, including experience with complex queries, joins across multiple tables, and database design principles. Experience with healthcare or clinical databases strongly preferred.
- Basic to intermediate knowledge of statistical analysis methods and their application.
- Experience with REDCap or similar electronic data capture systems for research data management.
- Experience integrating and merging datasets from multiple disparate sources with different schemas and data quality levels.
- Understanding of relational database design, data modeling, and database schema development.
- Strong data quality assurance skills including ab