Descrizione
Career Area:
EngineeringJob Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Caterpillar products generate and process large amounts of data from sensors, on-board models, and control algorithms in real-time creating real-world machine usage information. As part of the overall digital strategy Caterpillar has developed an integrated family of machine-based information technology products that interconnect machines to provide real time guidance, tracking, monitoring and supervisory control of the operational mining fleet. This family of products is branded MineStar. The MineStar product range links with other mining and business information systems and is an integral part of the Mining Technology Enabled Solutions vision.
We’re seeking an experienced Senior Developer to join a specialist team focused on building a scalable data platform which ingests high-frequency telemetry data from mobile field machines and supports data science and machine learning workloads.
You’ll be instrumental in transforming a legacy Java Spring Boot monolith into a containerized, scalable, cloud-ready architecture, while enabling stream and batch data processing, lakehouse persistence, and low-latency predictive model hosting.
Responsibilities:
Design and implement robust, scalable components for ingesting, processing, and persisting high-frequency telemetry data.
Collaborate with data scientists to host, orchestrate and optimize workloads in Python, Scala, and Java.
Design and build components using technologies like Apache Spark, Delta Lake, Redis/Valkey, MQTT, and PostgreSQL.
Drive modernization efforts including:
Containerization and deployment on Kubernetes
Integration with S3-compatible object stores (e.g., Ceph)
Evaluate and integrate emerging technologies (e.g., Flink, Trino, Kafka, DuckDB, Dask, Daft) to optimize performance and scalability.
Use your experience to contribute to architectural decisions involving event sourcing, CQRS, and hybrid cloud deployments.
Ideal Candidate Profile
Extensive experience in backend development with languages such as Java, Scala and Python.
Proven track record working in teams to develop large, complex applications.
Deep understanding of streaming and batch data processing, ideally with Apache Spark or similar.
Experience with containerization (Docker) and orchestration (Kubernetes).
Familiarity with data lake/lakehouse architectures, especially Delta Lake.
Strong knowledge of message brokers (MQTT, Kafka) and caching systems (Redis/Valkey).
Comfortable working across multiple languages (Java, Python, Scala).
Experience in designing systems for scalability, multi-tenancy, and hybrid deployments.
Prior experience in Data Engineering roles, and expertise in machine learning algorithms and statistical modelling will be highly regarded
Mining Industry knowledge will be desirable, however not mandatory
Why Join Us?
Work on a technically challenging and meaningful application in the mining domain.
Be part of a forward-thinking team driving innovation in data-centric systems.
Influence architectural direction and technology choices.
Collaborate with experts in autonomy, data science, and distributed systems.
Posting Dates:
June 1, 2026 - June 8, 2026Caterpillar is an Equal Opportunity Employer. Qualified applicants of any age are encouraged to apply
Not ready to apply? Join our Talent Community.

