Provenance Guard: Leveraging IPFS And ScalableBlockchain For Secure And Transparent Data Provenance
Main Article Content
Abstract
Data provenance in the cloud environment faces significant challenges due to concerns over storage and
processing management. In this paper, we propose a scalable data provenance solution employing
Blockchain andthe InterPlanetary File System (IPFS). The system leverages the distributed Ethereum ledger
for storing provenance data and associated hashes, while IPFS manages entry record hashes. This separation
enhances mutual trust between cloud service providers and users, improving data management transparency
and security. We implemented oursolution as a full-stack web application using the MERN stack
(MongoDB, Express, React, and Node.js), integratedwith a local Geth instance running a modified proofof-work algorithm. The application communicates with the Geth instance via the Web3.js library, using
Solidity smart contracts to link the application and the blockchain.Our performance analysis results
demonstrate improved transaction latency and overall performance, indicating the system’s effectiveness in
addressing data provenance challenges in a cloud environment. Future work involves securing the mapping
file, extending the service to all user-uploaded files, and investigating scalability and performance in a larger
blockchain network. We also plan to examine the potential of Hyperledger for managing data provenance
andthe feasibility of implementing Blockchain as a Service (BaaS) provided by cloud service providers.