About
Job Description
Description:
DUTIES AND RESPONSIBILITIES:
* Work in a team to design, build and maintain modeling and backend data collection/integration systems.
* Write robust production quality code and take responsibility for system quality, documentation, high uptime, and prompt collaborator assistance.
* Drive continual improvement of modeling techniques with high impact and interpretability.
* Partner with lab scientists to collaborate on Intrexon's metabolic engineering and maintain high adoption of the tools we develop.
* Contribute to design of experiment and steering of lab technologies compatible with model development.
* Prepare technical presentations to communicate with project teams, leadership, and other stakeholders.
* Present findings to stakeholders in a way that can be easily understood and leveraged by non-data scientists.
* Learn aggressively and strive to stay at forefront of data engineering and scientific advances.
EDUCATION AND EXPERIENCE:
* PhD (preferred), MS (2+ years' experience) or BS (5+ years' experience) in bioengineering, computer science, mathematics, genetics, engineering, bioinformatics, or a related quantitative field.
TECHNICAL SKILLS:
* Ability to write reliable production-ready Python code and troubleshoot rapidly.
* Good understanding and depth of experience with classic machine learning techniques and methods.
* Rapid data analysis and prototyping skills in Python/R and experience with notebooks (RStudio, Jupyter, Zeppelin, etc.)
* Functional experience with Linux systems and virtualization: bash scripting, environment setup, installs, Docker images, Kubernetes, and profiling.
* Ability to work in a software development team: git, JIRA ticket writing, testing, deployment.
* Experience with relational databases (Oracle, MySQL), design patterns, and optimization
* Ability to understand complex code bases and troubleshoot open source software.
* Desired skills:
* High dimensional data analysis (p >> n, dimensional reduction, clustering, etc.)
* Statistical analysis experience and probabilistic programming
* Experience modeling real processes with systems of ordinary differential equations
* Experience applying advanced methods (RNN's, causal networks, image analysis, etc) to biological problems
* Experience with scalable computing (Spark, parallelization, disk/memory/network/cloud usage)
* Knowledge of fermentation, bioprocessing, and process control
* Experience with Next Generation Sequencing (NGS), omics methods and associated datasets
* Proficiency in other programming languages: Java, C++, Scala, functional programming
DESIRED KEY COMPETENCIES:
* Ability to learn both new data engineering techniques and science quickly
* Maintain a high degree of accuracy and attention to detail.
* Ability to understand and execute on the company's mission and values.
* Fosters innovation through creative solutions and group collaboration.
* Successful at communicating in both oral and written forms.
* Maintain a high degree of ethical standards and trustworthiness.
* Capable of fostering change in an organization.
* Deals with conflict in a direct, positive manner.
* Ability to think and adapt to a rapidly changing environment.
* Able to reach rational conclusions through complex processing of information.
* Fosters constructive dialogue and feelings toward the company, coworkers, and tasks.
* Well organized and capable of clear communication through technology (i.e. Outlook, PowerPoint, and other programs used to create and distribute reports and key information).
* Energized by accomplishments and excellence in the workplace.
* Capable of high performance in independent work as well as in team setting.
* Effective organization and implementation of group projects.
* Up-to-date knowledge on industry current events.
EOE MFDV