In Australia, The Peter MacCallum Cancer Centre and the John Holland Group, an infrastructure and construction firm, have turned to cloud data and AI platform Databricks to solve significant data fragmentation problems that were hindering their ability to draw insights from business data.
Speaking at Databricks’ Data + AI World Tour in Sydney, Australia last month, tech leaders at both organisations reported facing challenges such as siloed data, competing business areas, data integration issues, and legacy systems, prompting the need to seek a cloud data solution.
Peter Mac’s legacy data infrastructure limited its ability to effectively leverage big data and AI across its extensive clinical and research operations. The legacy technology also jeopardized its mission to improve the lives of people with cancer, including the use of AI to improve clinical decision making and accelerate biological insights and drug discovery.
During the conference, Jason Li, head of the bioinformatics core facility in Peter Mac’s cancer research division, said that:
SEE: Informatica claims data fragmentation a barrier to AI in APAC
Li said Peter Mac selected Databricks to help it harmonise data across the centre and support advanced analytics, including AI, while meeting data security and privacy requirements in health care.
Peter Mac first tested the AI potential of the Databricks platform with an AI transformation pilot project:
Li said speed across the project was a big advantage: “We estimate that with Databricks, we have sped up the development process by fivefold, and reduced communication overheads across stakeholders by tenfold, allowing us to bring innovations to the market earlier to benefit patients.”
AI has grown into a larger part of Peter Mac’s strategy. Databricks is supporting the cancer centre in three additional use cases: genomics, radiation oncology, and cancer imaging. Additionally, Peter Mac is:
Meanwhile, John Holland managed 80 large-scale infrastructure projects worth AUD $13.2 billion in 2023. However, Travis Rousell, the company’s head of data and analytics, said its legacy data warehouse environment was fragmented and difficult to integrate.
SEE: How to improve data quality in data lakes
“We’ve got all the typical problems everybody’s had historically with data warehouses and data problems,” Rousell said. “Our legacy data warehouse environment was built incrementally over 20 years. It’s slowly evolved and developed out, and we’ve created this really swampy set of data silos.”
Rousell added: “We could build BI [Business Intelligence] and reports on the front of those, but joining that data together to be able to create insights into the flow of activities and behaviors that are occurring so that we can drive change across our business has been a really difficult process for us.”
John Holland set out to create a unified data platform to unlock data for business value. This was part of the group’s effort to drive innovation and competitive advantage in its industry through modern data and digital practices as part of a broader digital transformation push.
The organisation has sought to:
John Holland has so far delivered several core business processes to Databricks’ data lake, including project management, project operations, project controls, safety, and fleet analytics.
As a result of using Databricks, Rousell said that John Holland had:
Rousell said that Databricks ensures IT and technology do not constrain the business from progressing.
“I think the biggest thing for me that we’re achieving by doing this is we’re creating this data culture of ‘yes’ within John Holland,” Rousell explained. “Historically, the difficulty in provisioning new and innovative products has meant we’ve had to stand up large slow projects and underdeliver for the business.
“Now, if the business has an idea, we can say yes; we can deploy them a data workspace that gives them access to all the capability and tooling they’ll need, and they can go and build that at the speed.”