Lilly Consultant-Scientific Data Engineering in Indianapolis, Indiana
Lilly generates a large volume of biomedical data through discovery research and medicines development through clinical trials. There are also massive public data sets available that provide additional insight into disease and treatment areas of interest to Lilly. Data science and informatics plays a key role in driving actionable insights for discovering new medicines for our patients.
The Research Data Sciences and Engineering team at Lilly is seeking a qualified and experienced individual to join the effort of design, integrate and feature engineer a wide variety of different clinical, biomarker and genomic datasets. You would collaborate closely with our team of software engineers, research scientists, and subject matter experts. Your work would be meaningful in the area of data sciences, the drug discovery process and the research behind it.
Marshal, transform, and integrate internal and publicly available research data for scientists.
Feature engineer data for ML/AI and analytics use by analysts and statisticians
Lead the mining, annotation, and loading of new research data sources, including finding and learning about new resources and vetting their applicability to the task
Participate as a technical and scientific lead on multi-functional research and IT teams. Drive the adoption of data by understanding use cases in Lilly Research
Influence and craft data science strategy by providing regular inputs to projects, creative methods and the best data engineering practices in a changing research environment
Deep knowledge of SQL/NoSQL databases including data pipelines, schema design, performance trade-offs and tuning for large-scale, heterogenous scientific data
Experience in developing and delivering cloud-based computing technologies (AWS) and related ‘big data’ systems (e.g. S3, Redshift, Presto, Spark)
Deep understanding of, and practical experience with agile software practices. e.g. SCRUM
Interest in growing knowledge of biological/chemistry sciences and desire to learn about genetics & genomics
Good interpersonal and communication skills; proven ability to work effectively within a team; ability to communicate and understand complex concepts in both technical and non-technical terms
Publications or invited talks demonstrating knowledge of scientific data (NGS data or similar)
Experience with Amazon Web Services (AWS), Google Cloud, Azure
Familiarity with bio/chem informatics tools (GATK, Galaxy, CWL) and techniques
Applied knowledge of ML/AI in the context of scientific research
Lilly is an EEO/Affirmative Action Employer and does not discriminate on the basis of age, race, color, religion, gender, sexual orientation, gender identity, gender expression, national origin, protected veteran status, disability or any other legally protected status.
Scientific Data Engineer
B.S. in computer science, bioinformatics, or similar data-centric science concentration + 2 years of applicable experience or M.S. in computer science, bioinformatics, cheminformatics or similar data-centric science concentration + 1 year of applicable experience
Experience with one or more data science languages: R, Python, Java
Qualified candidates must be legally authorized to be employed in the United States. Lilly does not anticipate providing sponsorship for employment visa status (e.g., H-1B or TN status) for this employment position
At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our 39,000 employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world.
State / Province: