With an ecosystem of high-throughput instruments generating multi-terabyte datasets, the data management capacity of a leading global gene therapy innovator had reached breaking point. It was too slow, too siloed, and too reliant on proprietary formats that locked data away from scientists who needed it most.
The data congestion crisis
The biotechnology company faced several challenges in collecting and storing analytical data, particularly large datasets generated by techniques such as flow cytometry, next-generation sequencing (NGS), western blot analysis, and high-resolution microscopy images. As part of the western blot process, QA scientists had to log in to the instrument to review the data and audit history. This involved booking time on an instrument which requires hours to complete a single run, reducing instrument availability.
The company’s Scientific Data Management System (SDMS) had historically performed well, but struggled to handle files around 30TB, making it prohibitively slow to store or retrieve large datasets. The SDMS also encrypted the data and did not enable viewing, requiring use of additional systems, causing further effort and cost. The system also couldn’t easily generate meta tags, relying on users to name the file correctly to ensure accurate tagging.
A ‘Swiss Army Knife’ for laboratory data management
The organisation turned to Splashlake to deliver a unified system that integrates instruments, manages data and supports digital archiving on a single platform. Moving away from fragile, custom-coded integrations, the team implemented a unified approach that captures, contextualises, stores and enables efficient access to all laboratory and production data.
Key technical pillars:
- Instrument integration and automated data capture: Splashlake connects the LIMS to all analytical instruments, automating data capture without taking instruments offline for audit review. This frees instrument capacity, supports data integrity and traceability and enables seamless integration with LIMS and lab workflows.
- Large data storage and cataloguing: Splashlake enables scientists to store and manage large, complex data files while providing version control, metadata tagging and audit history. Splashlake’s metadata tagging supports data cataloguing and search; scientists can also directly view the data they query.
- Digital archiving for GLP/GMP: Splashlake can be used to implement secure, compliant digital archiving, addressing patent, legal and audit-readiness needs while enabling structured, easy data retrieval. Splashlake converts proprietary formats to human-readable files, allowing laboratories to retire legacy systems with confidence, knowing data remains protected and accessible.
Maximising value from data
Splashlake provides a flexible, open and scalable data platform for capturing, contextualising, storing and accessing laboratory and production data. By replacing multiple single-purpose tools, it simplifies system landscapes while avoiding vendor lock-in.
The incumbent SDMS was fully replaced, with legacy data ingested from local storage on scheduled routines. The new solution provides full audit trails, version control and improved performance. Training followed a “train-the-trainer” model to ensure adoption and long-term success.
Measurable benefits
By adopting Splashlake, the company has:
- Improved data integrity and visibility across large, complex datasets
- Freed up instrument time, reducing delays and operational costs
- Established compliant digital archiving for GLP and 21 CFR Part 11
- Consolidated multiple functions into a single, lower-cost platform
- Future-proofed its data ecosystem for advanced analytics and AI
- Maintained full control of data without proprietary constraints.
The company is now exploring expanded use cases, including spectral compound library management, replacing spreadsheets with structured, searchable datasets to support collaboration and AI-driven discovery.
Splashlake provides a refreshing approach to scientific data management. By bridging the gap between instruments, LIMS, ELN and SAP, Splashlake helps organisations achieve seamless connectivity and data-driven insights.