Design, implement and maintain reliable and scalable data infrastructure, including design and development of industrial-scale data pipelines to capture, store, process and publish platform telemetry for performance and utilisation analytics, and support the development and build the automation of system performance and metrics
Collaboration with portfolio data platform teams to utilize existing data products, ingestion patterns, or automation to avoid bespoke development while contributing to the enhancement and creation of these shared assets when gaps are identified
Own the end-to-end technical data lifecycle and corresponding data technology stack for their data domain and have a deep understanding of the bp technology stack.
Write, deploy and maintain software to build, integrate, manage, maintain, and quality-assure data, and responsible for deploying secure and well-tested software that meets privacy and compliance requirements; develops, maintains and improves CI / CD pipeline.
Adhere to and advocate for software engineering best practices (e.g. technical design, technical design review, unit testing, monitoring & alerting, checking in code, code review, documentation).
Requirements
Deep and hands-on experience designing, planning, implementing, maintaining and documenting reliable and scalable data infrastructure and data products in complex environments.
Development experience in one or more object-oriented programming languages (e.g. Python, Go, Java, C++)
Experience designing and implementing large-scale distributed system
Desired Skills
Data Manipulation: debug and maintain the end-to-end data engineering lifecycle of the data products; design and implementation of the end-to-end data stack, including designing complex data systems, e.g. interoperability across cloud platforms; experience on various types of data (streaming, structured and unstructured) is a plus.
Coding: contribute to internal and/or external libraries through raising issues, adding documentation and/or contributing new features; write clean, reusable code in a language that runs in production at bp.
Software Engineering: hands-on experience with SQL and NoSQL database fundamentals, query structures and design best practices, including scalability, readability, and reliability; you are proficient in at least one object-oriented programming language, e.g. Python [specifically data manipulation packages - Pandas, seaborn, matplotlib], Apache Spark or Scala;
Scalability, Reliability, Maintenance: proven experience in building scalable and re-usable systems that are used by others; knowledge and experience in automating operations as much as possible and identifying and building for long-term productivity over short-term speed/gains, and execute on those opportunities to improve products or services.
Data Domain Knowledge: proven understanding of data sources and data and analytics requirements and typical SLAs associated with data provisioning and consumption at enterprise scale, with interest and experience in the analysis of data or other enterprise platform operations activities.
Cloud Engineering– Recent experience utilizing data analytics offerings and services from Azure and AWS
Hiring criteria
You should have or be completing the following to apply for this opportunity.
Bachelor's Degree
Degree or Certificate
Minimum Level of Study
Associate Degree or higher
From an Institution in
Malaysia
Study Field
I
Artificial Intelligence
Bioinformatics
Computer Graphics & Animation
Computer Science (all other)
Computer Systems and Networks
Cyber Security
Work rights
The opportunity is available to applicants in any of the following categories.