Next Article in Journal
An Evaluation Method of Gas-Bearing Properties Based on Gaussian Bimodal Function Pore Structure Characterization: A Case Study of Tight Sandstone in the East China Sea Basin
Next Article in Special Issue
Thermodynamic Assessment of the Fe–Mn–Ni System and Diffusion Mobility of Its Face-Centered Cubic Phase
Previous Article in Journal
Advances in Ascorbic Acid (Vitamin C) Manufacturing: Green Extraction Techniques from Natural Sources
Previous Article in Special Issue
Machine Learning Aided Prediction of Glass-Forming Ability of Metallic Glass
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

InterMat: A Blockchain-Based Materials Data Discovery and Sharing Infrastructure

1
Central Iron & Steel Research Institute Co., Ltd., Beijing 100081, China
2
Beijing MatDao Technology Co., Ltd., Beijing 100081, China
3
Material Digital R&D Center, China Iron and Steel Research Institute Group, Beijing 100081, China
*
Author to whom correspondence should be addressed.
Processes 2023, 11(11), 3168; https://doi.org/10.3390/pr11113168
Submission received: 22 September 2023 / Revised: 25 October 2023 / Accepted: 1 November 2023 / Published: 7 November 2023
(This article belongs to the Special Issue Digital Research and Development of Materials and Processes)

Abstract

:
Material research and development driven by data analysis necessitates a substantial volume of data. However, conventional material data sharing platforms encounter challenges in sharing and integrating data across multiple platforms. This article proposes a blockchain-based materials data discovery and sharing infrastructure—InterMat, which is a material big data management and sharing framework model integrating cloud platforms and blockchain. It could support the full lifecycle of materials data sharing, including data generation, management, discovery, sharing, traceability, and valuation. The architecture of the InterMat, its unique method of constructing a consortium chain, and the protocol for data discovery are presented in this paper. Additionally, the method for materials data identifier and blockchain certification is established, which allows for a unified identifier on the blockchain and cloud-based data addressing from various organizations. InterMat has data discovery algorithms for various materials to achieve the discovery of similar materials data from different nodes. Furthermore, we have designed some blockchain smart contracts for InterMat to encourage data sharing across nodes. These contracts include a proof smart contract that records data sharing activities, ensuring transparency and traceability in the materials data flow. The other contract is a value-estimating contract to encourage high-quality data sharing. Finally, this article introduces the application case of InterMat, using steel materials as an example to demonstrate its applications in data management, data discovery, data valuation, etc. This study successfully addresses various challenges associated with the cross-platform sharing of materials data, such as issues related to data discovery, data rights and control, and willingness to share. InterMat can assist material researchers in discovering and accessing more data, which would create a new ecology for sharing data.

1. Introduction

The advancement of the Internet and computer technology has significantly enhanced the data collection and management capabilities of the materials industry [1]. Big data technology allows for the analysis, mining, and application of massive materials data, thereby maximizing their value [2]. Currently, the methodology of materials research is entering the fourth paradigm through empirical science, theoretical science, and computational simulation, which is big-data-driven material research and development [3]. At present, there are many materials data platforms around the world, such as AFLOW [4], NOMAD CoE [5,6], and OQMD [7]. These platforms work variably based on their positions, some of which focus on providing data services, while others provide data analysis and algorithm workflows.
The materials industry is a typical process industry with multiple key stages [8]. The full life cycle of materials involves research and development, production, equipment manufacturing, and service. Various types of data are generated at each stage, and these data are interconnected and complementary. By integrating and conducting multi-dimensional analysis, the full potential of the data can be realized. However, these data are spread across various platforms, and they are operated and owned by different authorities. Therefore, finding a solution to enable data sharing across platforms is the priority issue in materials data infrastructure.
As for material data, sharing across platforms will face several challenges. Firstly, the materials data discovery, which refers to the process of finding similar materials data in different heterogeneous databases, is difficult to deal with due to various data heterogeneity and data standards in different platforms [9]. Thus, to enable materials data sharing, it is fundamental to establish appropriate metadata standards and efficient data discovery algorithms. Secondly, confirming data rights and ensuring controllable sharing is crucial, as materials data are one kind of firsthand scientific data that are collected during research and production. These data are considered as valuable assets [10]. Therefore, it is crucial to ensure the protection of intellectual property rights and allow users to have independent control over the data during its flow. Lastly, the willingness to share, which includes data quality issues and driving forces for sharing, should be well treated. The accuracy of material analysis and evaluation depends on data quality, so a mechanism to assess data reliability should be designed. Additionally, incentive mechanisms should be implemented to encourage sharing behavior.
The sharing and utilization of large amounts of materials data are hindered by the aforementioned issues. However, the emergence of blockchain technology offers new solutions. Blockchain technology is a decentralized database technology that combines various computer technologies, including distributed data storage, peer-to-peer transmission, and encryption algorithms. It uses distributed consensus algorithms to generate and update data and utilizes peer-to-peer networks for data transmission between nodes. The distributed ledger, which combines cryptography theories and timestamps, ensures the tamper-proof and traceability of stored data and utilizes automated script code or smart contracts to implement upper-layer application logic. If traditional databases implement unilateral maintenance of data, blockchain implements multi-party maintenance of the same data to ensure data security and business fairness. For this reason, although blockchain was initially used in digital currencies [11,12,13], its decentralization and traceability make it an effective solution for sharing data across different organizations [14,15,16,17,18,19].
In the industrial field, many researchers are exploring the application of blockchain. Liu et al. [20] proposed a framework for product lifecycle management (PLM) based on industrial blockchain to facilitate data exchange and shared services throughout the product lifecycle. The study introduced the concept of industrial blockchain and provided an information storage and exchange platform that enables cross-enterprise data sharing services. Mandolla et al. [21] proposed a blockchain system for creating digital twins for additive manufacturing and used aviation materials as an example to illustrate the benefits of combining blockchain and 3D printing in coordinating material usage time and supply time. Blockchain can securely transmit data to verified 3D printers, thereby saving costs associated with inventory, procurement, and logistics. Joshi et al. [22] researched the adoption of blockchain technology for privacy and security in the context of Industry 4.0. They have explored the fundamentals of blockchain, blockchain applications, the key costs impacted by blockchain, and privacy-preserving approaches. Furthermore, several possibilities for IoT and Industry 4.0 applications that leverage blockchain are being investigated. Ding et al. [23] proposed a manufacturing system based on blockchain for an I4.0 workshop by studying the autonomous trusted trigger mechanism of smart contracts and the distributed storage method of manufacturing data in a workshop manufacturing system and integrating the workshop service system, a system operation mechanism for manufacturing data security and trusted triggers in the manufacturing process. In general, the current research on blockchain applications in the field of industry is very limited, especially in the field of materials. It is necessary to further explore the characteristics of materials data, data discovery and sharing models, and privacy protection methods in order to promote the sharing and application of materials big data.
This paper introduces a blockchain-based infrastructure called InterMat (as shown in Figure 1). It is built on a blockchain foundation and supports the entire lifecycle of materials data, including generation, management, certification, discovery, sharing, traceability, application, and valuation. InterMat is designed to facilitate data sharing and circulation throughout the entire lifecycle of the materials industry, from research and development to production and service, enabling data-driven material research and development.

2. Construction of InterMat Model

2.1. System Architecture

The diagram in Figure 2 illustrates the structure of the InterMat architecture. It consists of three layers: the client, the cloud platform, and the blockchain. These layers work together and communicate through multiple interfaces.
Client: including data providers and data users. The data provider uploads the data through the client, and the complete data are stored in the cloud platform, while at the same time the metadata are stored in the blockchain. The data user also utilizes the client to request data, which are then sent to the blockchain. The blockchain sends the storage location information to the data requester, and then the data requester obtains the materials data.
The cloud platform: cloud computing provides on-demand network access of configurable computing resources, enabling users to pay for the resources or services they use [24]. The cloud platform offers wide accessibility, abundant resources, flexible distribution, and strong reliability. As a result, many materials data platforms utilize the cloud platform as their infrastructure for storing data. Various organizations store the data from each stage of the material industry chain in different cloud computing platforms. In the cloud platform of InterMat, complete materials data are stored, including metadata and master data. Furthermore, the data release and data flow events can be stored through the certificate storage smart contract to achieve the purpose of data confirmation.
Blockchain: blockchain technology is utilized to create a system of trust between various platforms. A smart contract is employed to extract and encode metadata based on a predefined strategy, which are then stored in the blockchain state database. When a data requester initiates a data request, the blockchain is used for data discovery and addressing. Furthermore, the data proof smart contract can perform blockchain certification on data release and data sharing events to confirm data ownership rights.

2.2. The Construction of Consortium Blockchain

InterMat employs consortium blockchain technology [25] to create a network with three layers: ‘node-industry chain-InterMat’. This network enables the sharing of materials data across different organizations and industries, creating a collaborative ecosystem. The architecture of its application network model is depicted in Figure 3.
  • Nodes
Nodes are the most basic units of the InterMat network and can be divided into full nodes and light nodes. A full node is a node that has the function of storing materials data and participates in the construction of a new block. The ledger includes the block header and the block body that stores the list of transactions. Full nodes are suitable for large enterprises, organizations, and institutions with many people. A light node is a node that only stores materials data. The storage method includes independent node servers or third-party public cloud accounts. It does not participate in consensus but receives block header information sent by full-node users. It is suitable for small teams or individuals in scientific research institutions.
  • Material industry chain
The material industry chain is composed of multiple nodes. It is a collaborative blockchain network of organizations within a specific industry that work together to share materials data. This industry chain utilizes materials data to connect various enterprise nodes, such as research and development, production, procurement, equipment construction, and application. Taking the petrochemical industry chain as an example, it connects petrochemical material R&D institutions, petrochemical material production plants, petrochemical equipment manufacturing enterprises, and petrochemical terminal enterprises. These organizations can share petrochemical materials data through blockchains and then use data-driven material R&D methods to optimize materials.
  • InterMat
InterMat is a blockchain-based network that enables the decentralized or multi-centralized sharing of materials data. It connects nodes and different material industry chains to create a network. This allows for the efficient sharing of materials data between different industries, forming a network ecology for interconnecting multiple industries. InterMat utilizes smart contracts to ensure secure and reliable data management and sharing services. These services include materials data identification and proof, data discovery, shared data traceability, data valuation, and so on.

2.3. Materials Data Discovery Protocol

The data sources and types vary across different stages of the materials industry chain. In the research and development stage, data are primarily sourced from scientific research institutes and universities, encompassing composition, performance, structure, and process-related research data. Material production plants are the main source of data in the production stage, providing data on material production processes, equipment parameters, and factory inspection data. Equipment manufacturing data are mainly obtained from equipment enterprises, including data on material processing, welding, heat treatment, and so on. Material service data primarily originate from downstream application enterprises, consisting of data generated by equipment and materials in service, such as material corrosion and failure data. The combination of heterogeneous data sets from different stages and organizations in the industrial chain can complement each other for a specific material. Establishing a secure sharing mode can significantly enhance research and development efficiency. However, the challenge lies in the fact that data are scattered across different database systems, each using different data standards [26,27]. Therefore, finding and accessing data between different systems is a problem that needs to be addressed.
Metadata refer to information that describes other data [28,29] and are essential for finding and understanding data [30,31]. Traditional metadata typically consist of keywords, but different organizations have different standards for assigning keywords. For instance, a specific type of low-temperature steel with nine percent nickel content may be labeled as 9Ni by institution A, 06Ni9DR by institution B, and X8Ni9 by institution C. This demonstrates that keyword-based matching is not appropriate for sharing materials data across different organizations.
In this paper, we propose a method for discovering and sharing materials data across different nodes using pattern recognition. The method is illustrated in Figure 4. Despite the complexity and various forms of materials data, we categorize them into tetrahedral elements known as material feature data, which include composition, performance, structure, and process [32]. We use multidimensional perception to analyze the materials data, extract the relevant material features, and represent them as a set of mathematical features. By calculating the similarity of material feature data between different materials, we can identify data with high similarity and provide them to the data requester, enabling the retrieval of similar materials. This approach allows for the effective correlation of materials data across different stages of research, development, production, manufacturing, and service. Authorized users can then share and collaborate on the data.
It is worth mentioning that the traditional implementation mode of this algorithm is the index data outsourcing calculation mode, which requires all parties to send material feature data to a third-party server for similarity calculation. However, this centralized model is susceptible to single point failures and lacks the ability to verify the credibility of third-party servers. This article uses distributed ledgers of blockchain for data discovery, which can effectively address the various shortcomings of centralized servers.

3. System Implementation

3.1. Materials Data Identification and Blockchain Certification

Materials data are typically composed of two parts: the master data and the metadata. The master data refer to the main content of the materials data, such as detection reports, calculation results, and production data. They can take various formats, including documents, images, tables, videos, audio files, and so on. Metadata are additional information that describes the material master data. They include materials data attributes, owners, characteristics, and other index information. Metadata serve two purposes in the sharing of data. Firstly, they serve as a means to identify the origin of the data, thereby enhancing the reliability and trustworthiness of the shared information. Secondly, metadata provide a description of the characteristics and attributes of the data, enabling data users to effectively locate and access the desired information.
Since the data come from different data sources, we first need to develop a set of identifier coding rules to uniquely mark the shared data and maintain the information. Then, in order to achieve the addressing of shared data, it is necessary to develop a set of identifier resolution methods, resolve the identifier according to a standardized process, and locate the location of shared data. Metadata are the main component of the identifier. In InterMat, we divide the metadata of materials into metadata of data source and metadata of master data.
Each materials data platform can include multiple data sources, so it is necessary to describe the data sources through metadata, as shown in Table 1. The example is shown in Box 1.
Box 1. Example: Experimental Data of R&D Data Platform from X company.
{
 “MaterialDataCollect”:
 {
 “SourceID”: “Steel_SEM photo of_R&D Data Platform of X company_definite interval”,
 “SourceType”: “Lab”
 “SourceName”: “S-4300 SEM”
 “Provider”: “Testing Technology Center of X company”
 “CollectionTime”: “YYYYMMDDThhmmss”
 “CollectURL”: “https://www.xxxxx.com/xxxxx”
 }
}
The metadata of the master data are used to describe the information of the master data, including the ownership information of the data, the index information used for data discovery, the location of the data, etc., as shown in Table 2.
In the metadata table of the master data, material feature data are essential. They are also the core of the materials data discovery protocol, serving as an index for material discovery calculations. Material feature data are the basic features of materials, which can be summarized as tetrahedral elements of materials, including composition, property, structure, and process, as shown in Table 3. The example is shown in Box 2.
Box 2. Example: SEM photo of pipeline steel.
{
 “MaterialData”:
 {
  “DataID”: “DB74989E-C6AA-4E3B-B209-04550154AC46”,
  “DataName”: “SEM photo of pipeline steel”
  “DataType”: “.jpg”
  “Unit”: “MB”
  “Size”: “25.2”
  “Owner”: “Research Institute of X company”
  “CreateTime”: “YYYYMMDDThhmmss”
  “MaterialName”: “06Ni9DR”
  “MaterialType”: “Low temperature steel”
  “MaterialFeature”: “C0.06Si0.3Mn0.5Ni9Cr0.2Mo0.1Cu0.2V0.01”
  “Description”: “This data is a SEM photo of X80 thermal simulation sample”
 }
}
Material metadata should be stored on the blockchain and stored in the blockchain status database. On the one hand, the data release time and content should be recorded and the data rights verified; on the other hand, metadata are also used for data discovery calculations to discover materials data with similar properties on the chain.
The process of materials data identification, certification, and resolution is shown in the Figure 5.
  • Data storage: The materials data are uploaded by the data provider to the cloud, where they are stored and a URL is provided. Afterwards, the data provider sends a request to the blockchain, including metadata from Table 1 and Table 2, material feature data from Table 3, master data hash, and data signature. According to the identification rules, the blockchain generates the TxID, user identity certificate hash (CaHash), and transaction hash of the transaction (TxHash). Then the TxID and the TxHash are returned to the data provider as the result of the blockchain certification.
  • Data request: The data user makes a request to the blockchain for the data. The blockchain then finds the necessary data based on the data discovery protocol, resolves the URL using the identifier resolution rules, and sends the data back to the user. Data users use this URL request to access materials data in the cloud platform.

3.2. Materials Data Discovery Algorithm

Data users can initiate data discovery requests based on their own materials data and discover similar materials data in InterMat. The discovery of materials data can be realized by algorithms, such as material pattern recognition based on the data schema of material metadata and feature data. The algorithm mainly includes two parts: material feature data extraction and data similarity calculation.
  • Material feature data extraction
Materials data are abstracted as a set of feature vectors, including composition, properties, and keywords (process, structure, shape), in the following form:
X i = x i 1 , x i 2 , , x i N
K i = k i 1 , k i 2 , , k i n
where:
N—the number of components (or the number of performance indicators);
xij—the value of the component (or property) j in the material i;
n—the total number of keywords.
  • Data similarity calculation
First, we need to normalize the data.
Y i j = X i j X j min X j max X j min 0 Y i j 1  
where:
Xij—the jth component (or property) data of material i;
X j max —the maximum value of the component (or performance) j in the data set;
X j min —the minimum value of the component (or performance) j in the dataset.
Second, we need to calculate the similarity of the data. For the two sets of materials data A and B from different nodes, we recommend the distance function method to calculate the similarity:
D S C A B = 1 j = 1 N ( Y B j Y A j ) 2 N
where:
Yj—the data after the jth component standardization;
N—the total number of composition.
The calculation of mechanical properties similarity DSP is the same as that of DSC. The keyword similarity DSKAB is calculated as follows:
D S K A B = N m a t c h e d N t o t a l
where:
Nmatched—the number of matched keywords in materials A and B;
Ntotal—the total number of keywords that participated in the match.
Then the total similarity:
D S A B = D S C A B × D S P A B × D S K A B
By calculating the similarity of the materials that can be found in each node one by one, sorting the data according to the similarity from high to low, and selecting the data with the highest similarity for display, the function of data association of similar materials in each node can be realized.
If there is a need for higher data privacy protection and the material feature data and other metadata used as an index also need to be kept confidential, we can use privacy calculations or searchable encryption methods to perform confidential calculations on the data. There are various encryption methods available, and we take the inner product encryption approach as an example to explain the similarity calculation method of material feature data in the encryption state in this paper. Inner product encryption is a type of function encryption that supports nearest neighbor search of data in an encrypted state [33,34], and it is suitable for scenarios where a similarity calculation of material feature data is performed. The process is illustrated in Figure 6.
When encryption calculation is required for material feature data, a trusted organization needs to be used as the key distribution node in the system. The key distribution node is responsible for initializing the system and generating the public key (pk) and master private key (msk). Then, the data provider encrypts the material feature data y through msk to generate ciphertext cty. Then the data provider sends the cty to the blockchain, which stores the cty and pk. When the data user initiates a data discovery request based on the material feature data x, the request is sent to the key distribution node, which sends the msk to the data user. The data user encrypts the feature data x to obtain the ciphertext skx, which is then sent to the blockchain. Blockchain uses pk, skx, and cty for decryption calculations to obtain the inner product values of x and y. Then, through calculation, the similarity between the two material feature data is obtained in plaintext.

3.3. Materials Data Traceability

Ensuring the traceability of materials data is crucial for facilitating data sharing among different organizations. To enhance the credibility of the data, we utilize blockchain technology to document the data flow process. In line with the characteristics of the materials data sharing system, this article proposes the creation of a data chain and a behavior chain, as depicted in Figure 7. The data chain primarily captures version updates of the data, while the behavior chain focuses on recording data flow events. The events recorded in the data chain and behavior chain are stored in the blockchain, making it difficult to tamper with and enabling reliable traceability of data during sharing.
The data chain records the modifications made by each user to the data. The data chain mainly includes:
  • Data identifier;
  • The version number of the current version;
  • Data hash;
  • Timestamp: the time of initial upload and the time of each modification.
The behavior chain records the sharing events of data, such as exchanges, references, etc. After discovering the data, the data user can request permission from the data owner. After being authorized by the data owner, data can be shared with the data user. Sharing, browsing, and other behaviors are recorded through transaction logs and stored in the blockchain, which is difficult to tamper with. The behavior chain mainly includes:
  • Data identifier;
  • Owner of data;
  • User: the user who requests data access;
  • Event types: including browsing data, providing feedback on data quality (positive feedback, negative feedback), and citing data;
  • Timestamp: the time when the behavior occurred.
The blockchain network regularly packages the data chain and behavior chain data and forms a chain structure. A block consists of a block header and a block body. The block header records the block height, the hash of the previous block, the Merkel root, and the timestamp. In this way, the blockchain certification of materials data in the process of sharing and transmission is realized, and the right to use and ownership of data are clarified, making the data sharing more secure and credible.

3.4. Data Valuation

The value of materials data needs to be evaluated in terms of data quality, data influence, and other aspects. Traditional data evaluation relies on expert judgment, which is inefficient and has large subjective factors. In this paper, we model the value of data through interactive data (as shown in Table 4), such as browsing, referencing, feedback, and so on, and objectively calculate the value of the data. Interactive data can indirectly reflect the importance and quality of data. Valuing materials data through interactive data offers two benefits. Firstly, it fosters a community atmosphere that boosts user engagement and encourages data sharing. Secondly, by using blockchain smart contracts to label low-value data, it incentivizes the sharing of high-value data, leading to improved data quality and the establishment of a high-quality data sharing ecosystem. It is worth mentioning that since the interactive data have already been verified in the blockchain, the data are reliable, so the calculated materials data value is also reliable.
The data value V can be calculated through the items in the above table:
V = E + R + F P F n + log 2 P + 1
This model is an initial model, which can be adjusted according to the actual situation in combination with the machine learning algorithm after the data are online.
This model is the initial model and provides a feasible model for calculating the value of data. In the future, we can adjust the parameters according to the actual situation.

4. Application Case

According to the approach described in this article, we have carried out real-world tests. The setup involves four entities: a public cloud for materials, a research and development institution A, a steel enterprise B, and a large equipment manufacturing plant C. We have built a steel industry chain within the consortium blockchain, created smart contracts of steel data discovery and data proof, and developed a cloud platform that allows users to manage the entire lifecycle of steel materials data, including management, discovery, sharing, traceability, and valuation.
  • Data management
Users can store and manage materials data within the system. When the data are submitted, it will trigger the data proof smart contract automatically, ensuring the protection of the data’s intellectual property rights. The new data page, depicted in Figure 8, allows users to input metadata, upload master data, and gain fine-grained permission control. A part of the metadata is filled in by the data owner in the table of basic information and material feature data, while others are automatically generated by the system, including ID, URL, size, time, and other details. The data details page after submission is shown in Figure 9. The data owner and authorized users can view this page and use related applications, such as data details, similar data, data tracing, data valuation, etc.
  • Similar material discovery
According to the Materials Data Discovery Protocol, users can find materials data that are similar to their own material in the InterMat network, as depicted in Figure 10. The search results include material name, node name, data type, similarity, and data value. The search results are arranged in descending order based on similarity. Data that are accessible can be viewed directly, while data that require permission can be requested. Through this module, users in this example can discover various data related to 9Ni steel from different organizations, such as experimental data, production line data, calculation data, and standard data.
  • Data traceability and valuation
In the InterMat system, data users can request, share, and exchange data, while data owners can expose and modify data. These actions are recorded in the blockchain and are difficult to alter. On the data details page, there are traceability information and data valuation modules, as shown in Figure 11. By clicking the Trace button, three parts of data flow information are displayed. The first part includes basic data information, such as data digest, signature, and public key. The second part contains the traceability information of data modifications, recording the data signature after each modification using Data-chain technology. The third part includes data transmission information, which records the exchange, reference, and other details using Behavior-chain technology. Based on the popularity and evaluation of the data, the system calculates their value according to the model, which is also displayed on this page.

5. Evaluation and Discussion

Verified through practical application cases, the approach in this paper effectively addresses the issues related to data sharing, including sharing technology and driving force. And it possesses the following features:
  • Materials data scene adaptation.
In this paper, we start from the characteristics of materials data through sorting out the material metadata, building the data identifier and parsing method, and establishing the standard data set of materials data, and we manage the data standardization of multi-source heterogeneous data. In addition, we quantify the value change of data in the sharing process through the value model, which promoted the sharing of materials data.
  • High security of the system.
This approach uploads metadata onto a blockchain, while the master data are stored locally or in a private cloud, thus achieving controllability of the master data. Data version traceability and copyright traceability can be realized through the design of Datachain and Behaviorchain so that data copyright can be well protected.
  • Reasonable data sharing incentives.
The system sets up a data valuation model and evaluates data value and node value through interactive data, which promotes the sharing of high-quality data.
  • High scalability.
For materials data, through the metadata setting of different materials, the data can be expanded from steel to non-ferrous alloys, polymer materials, and other materials. For the network structure, through the hierarchical design of the nodes, industry chains, and InterMat, multi-organization node data can be added, and the scalability of the data source can be realized.

6. Conclusions

The establishment of an infrastructure for sharing materials data must not only facilitate efficient communication between different entities, but also incorporate effective measures for copyright protection and incentivize data sharing. This article examines the shortcomings of traditional data sharing solutions and proposes a blockchain-based solution for the discovery and sharing of materials data. By leveraging the unique features of blockchain technology, this solution addresses the challenges that traditional approaches struggle to overcome, thereby facilitating the comprehensive collection of multidimensional materials data. InterMat provides an inner product encryption approach for data discovery, which offers a level of privacy protection for the data. However, with the growing volume of shared material data and the need for enhanced privacy protection, it is necessary to investigate additional methods of privacy protection. In the future, on the basis of materials data sharing, we can utilize privacy computing technology to apply various applications for data. In the state of data encryption, we can collaborate, analyze, and model data from different organizations. The integration of blockchain and privacy computing technologies can further safeguard data privacy, establish trust mechanisms for multi-organizational data sharing, and serve as a foundational technology for the next generation of materials data infrastructure.

Author Contributions

Conceptualization, methodology, and investigation, C.W.; validation, H.S. and L.D.; formal analysis, C.W.; resources, H.L.; data curation, L.D. and H.L.; writing—original draft preparation, C.W.; writing—review and editing, H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Industry and Information Technology of China (Grant No.TC2208076).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Conflicts of Interest

Author Changchang Wang is a PHD. student at Central Iron & Steel Research Institute Co., Ltd. and was employed by the company Beijing MatDao Technology Co., Ltd. Author Hang Su was employed by the company Material Digital R&D Center of China Iron and Steel Research Institute Group and Central Iron & Steel Research Institute Co., Ltd. Author Linna Duan and Author Hao Li were employed by the company Beijing MatDao Technology Co., Ltd. All the authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Wang, G.; Liu, Z.; Zhang, D.; Chu, M. Transformation and development of materials science and technology and construction of iron and steel innovation infrastructure. J. Iron Steel Res. Int. 2021, 33, 1003–1017. [Google Scholar]
  2. Himanen, L.; Geurts, A.; Foster, A.S.; Rinke, P. Data-driven materials science: Status, challenges and perspectives. Adv. Sci. 2019, 6, 1900808. [Google Scholar] [CrossRef]
  3. Agrawal, A.; Choudhary, A. Perspective: Materials informatics and big data: Realization of the “fourth paradigm” of science in materials science. APL Mater. 2016, 4, 053208. [Google Scholar] [CrossRef]
  4. Curtarolo, S.; Setyawan, W.; Hart, G.L.W.; Jahnatek, M.; Chepulskii, R.V.; Taylor, R.H.; Wang, S.; Xue, J.; Yang, K.; Levy, O.; et al. AFLOW: An automatic framework for high-throughput materials discovery. Comput. Mater. Sci. 2013, 58, 218–226. [Google Scholar] [CrossRef]
  5. Draxl, C.; Scheffler, M. NOMAD: The FAIR Concept for Big-Data-Driven Materials Science. MRS Bull. 2018, 43, 676–682. [Google Scholar] [CrossRef]
  6. The Novel Materials Discovery (NOMAD) Laboratory. Available online: https://nomad-coe.eu/ (accessed on 20 September 2023).
  7. Saal, J.E.; Kirklin, S.; Aykol, M.; Meredig, B.; Wolverton, C. Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD). JOM 2013, 65, 1501–1509. [Google Scholar] [CrossRef]
  8. Das, A.; Suman, S.; Sinha, A.K. Development of multivariate process monitoring strategy for a typical process industry. Int. J. Product. Perform. Manag. 2017, 22, 1–21. [Google Scholar] [CrossRef]
  9. Austin, T. Towards a digital infrastructure for engineering materials data. Mater. Discov. 2016, 3, 1–12. [Google Scholar] [CrossRef]
  10. Corallo, A.; Lazoi, M.; Lezzi, M. Cybersecurity in the context of industry 4.0: A structured classification of critical assets and business impacts. Comput. Ind. 2019, 114, 103165. [Google Scholar] [CrossRef]
  11. Rohr, J.; Wright, A. Blockchain-Based Token Sales, Initial Coin Offerings, and the Democratization of Public Capital Markets. Hastings Law J. 2018, 70, 463. [Google Scholar] [CrossRef]
  12. Eyal, I. Blockchain Technology: Transforming Libertarian Cryptocurrency Dreams to Finance and Banking Realities. Computer 2017, 50, 38–49. [Google Scholar] [CrossRef]
  13. Park, K.; Youm, H.-Y. Proposal of Decentralized P2P Service Model for Transfer between Blockchain-Based Heterogeneous Cryptocurrencies and CBDCs. Big Data Cogn. Comput. 2022, 6, 159. [Google Scholar] [CrossRef]
  14. Zhang, G.; Li, T.; Li, Y.; Hui, P.; Jin, D. Blockchain-Based Data Sharing System for AI-Powered Network Operations. J. Commun. Inf. Netw. 2018, 3, 1–8. [Google Scholar] [CrossRef]
  15. Chenli, C.; Tang, W.; Gomulka, F.; Jung, T. ProvNet: Networked bi-directional blockchain for data sharing with verifiable provenance. J. Parallel. Distrib. Comput. 2022, 166, 32–44. [Google Scholar] [CrossRef]
  16. Ma, X.; Wang, C.; Chen, X. Trusted Data Sharing with Flexible Access Control based on Blockchain. Comput. Stand. Interfaces 2021, 78, 103543. [Google Scholar] [CrossRef]
  17. Zheng, B.K.; Zhu, L.H.; Shen, M.; Gao, F.; Zhang, C.; Li, Y.; Yang, J. Scalable and privacy-preserving data sharing based on blockchain. J. Comput. Sci. Technol. 2018, 33, 557–567. [Google Scholar] [CrossRef]
  18. Chowdhury, M.J.M.; Colman, A.; Kabir, M.A.; Han, J.; Sarda, P. Blockchain as a Notarization Service for Data Sharing with Personal Data Store. In Proceedings of the 2018 17th IEEE International Conference on Trust, Security and Privacy in Computing and Communications/12th IEEE International Conference on Big Data Science And Engineering (TrustCom/BigDataSE), New York, NY, USA, 1 August 2018. [Google Scholar]
  19. Ugochukwu, N.A.; Goyal, S.B.; Rajawat, A.S.; Islam, S.M.N.; He, J.; Aslam, M. An Innovative Blockchain-Based Secured Logistics Management Architecture: Utilizing an RSA Asymmetric Encryption Method. Mathematics 2022, 10, 4670. [Google Scholar] [CrossRef]
  20. Liu, X.L.; Wang, W.M.; Guo, H.; Barenji, A.V.; Li, Z.; Huang, G.Q. Industrial blockchain based framework for product lifecycle management in industry 4.0. Robot. Comput.-Integr. Manuf. 2020, 63, 101897. [Google Scholar] [CrossRef]
  21. Mandolla, C.; Antonio Messeni, P.; Percoco, G.; Andrea, U. Building a digital twin for additive manufacturing through the exploitation of blockchain: A case analysis of the aircraft industry. Comput. Ind. 2019, 109, 134–152. [Google Scholar] [CrossRef]
  22. Joshi, S.; Pise, A.A.; Shrivastava, M.; Revathy, C.; Kumar, H.; Alsetoohy, O.; Akwafo, R. Adoption of blockchain technology for privacy and security in the context of industry 4.0. Wirel. Commun. Mob. Comput. 2022, 2022, 4079781. [Google Scholar] [CrossRef]
  23. Ding, K.; Fan, L.; Liu, C. Manufacturing system under I4.0 workshop based on blockchain: Research on architecture, operation mechanism and key technologies. Comput. Ind. Eng. 2021, 161, 107672. [Google Scholar] [CrossRef]
  24. Dorsala, M.R.; Sastry, V.N.; Chapram, S. Blockchain-based solutions for cloud computing: A survey. J. Netw. Comput. Appl. 2021, 196, 103246. [Google Scholar] [CrossRef]
  25. Sato, T.; Shimosawa, T.; Himura, Y. Operations Smart Contract to Realize Decentralized System Operations Workflow for Consortium Blockchain. IEICE Trans. Commun. 2022, 105, 1318–1331. [Google Scholar] [CrossRef]
  26. Henry, S.D.; Berardinis, L.A. A Review of Materials Data Infrastructure Projects. In Proceedings of the World Congress on Integrated Computational Materials Engineering: ICME 2015, Colorado Springs, CO, USA, 31 May 2015. [Google Scholar]
  27. Su, Y.; Yang, M.; Zhu, W.; Zhou, K.; Xue, D.; Wang, H.; Xie, J. Development of Key Technologies for Intelligent Research and Development of New Materials. Strat. Study CAE 2023, 25, 161–169. [Google Scholar] [CrossRef]
  28. Heery, R. Review of metadata formats. Program Electron. Libr. Inf. Syst. 1996, 30, 345–373. [Google Scholar]
  29. Harris, P.A.; Taylor, R.; Thielke, R.; Payne, J.; Conde, J.G. A metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 2009, 42, 337–381. [Google Scholar] [CrossRef] [PubMed]
  30. Chen, C.; Xie, X. Metadata-driven data description and searchability. Big Data Min. Comput. Data 2017, 3, 59–74. [Google Scholar]
  31. Medina-Smith, A.; Becker, C.A.; Plante, R.L.; Bartolo, L.M.; Dima, A.; Warren, J.A.; Hanisch, R.J. A Controlled Vocabulary and Metadata Schema for Materials Science Data Discovery. Data Sci. J. 2021, 20, 1–10. [Google Scholar] [CrossRef]
  32. Cross, J.O.; Opila, R.L.; Boyd, I.W.; Kaufmann, E.N. Materials characterization and the evolution of materials. MRS Bull. 2015, 40, 1019–1033. [Google Scholar] [CrossRef]
  33. Abdalla, M.; Bourse, F.; CARO, A.D.; Pointcheval, D. Simple functional encryption schemes for inner products. In Proceedings of the IACR International Workshop on Public Key Cryptography, Gaithersburg, MD, USA, 30 March 2015–1 April 2015. [Google Scholar]
  34. Kim, S.; Lewi, K.; Mandal, A.; Montgomery, H.; Roy, A.; Wu, D.J. Function-Hiding Inner Product Encryption is Practical. In Proceedings of the Arctic Crypt, Longyearbyen, Svalbard, 17–22 July 2016. [Google Scholar]
Figure 1. Schematic diagram of InterMat.
Figure 1. Schematic diagram of InterMat.
Processes 11 03168 g001
Figure 2. Architecture of InterMat.
Figure 2. Architecture of InterMat.
Processes 11 03168 g002
Figure 3. InterMat application network model.
Figure 3. InterMat application network model.
Processes 11 03168 g003
Figure 4. Schematic diagram of materials data discovery protocol.
Figure 4. Schematic diagram of materials data discovery protocol.
Processes 11 03168 g004
Figure 5. The process of materials data identification, certification, and resolution.
Figure 5. The process of materials data identification, certification, and resolution.
Processes 11 03168 g005
Figure 6. Inner product encryption calculation process.
Figure 6. Inner product encryption calculation process.
Processes 11 03168 g006
Figure 7. Data chain and behavior chain.
Figure 7. Data chain and behavior chain.
Processes 11 03168 g007
Figure 8. New data page.
Figure 8. New data page.
Processes 11 03168 g008
Figure 9. Data details page.
Figure 9. Data details page.
Processes 11 03168 g009
Figure 10. Similar material discovery page.
Figure 10. Similar material discovery page.
Processes 11 03168 g010
Figure 11. Data traceability and valuation module.
Figure 11. Data traceability and valuation module.
Processes 11 03168 g011
Table 1. The metadata table of the data source.
Table 1. The metadata table of the data source.
No.ComponentsExplanation
1SourceIDID of data source, unique identifier of materials data source
2SourceTypeData source types, including calculation software, laboratory, IoT devices, public data, and so on
3SourceNameThe specific name of the data source
4ProviderThe provider of the data source
5CollectTimeStart time of data collection
6CollectURLUnified Resource Identification for data collection
Table 2. The metadata table of the master data.
Table 2. The metadata table of the master data.
No.ComponentsExplanation
1DataIDID of data, unique identifier of materials data
2DataNameName of materials data
3DataTypeThe format of the master data file, such as text, tables, images, etc.
4UnitUnit of measurement for materials data, such as KB, MB, GB, etc.
5SizeMaterials data size
6OwnerOwner of materials data
7CreatTimeData creation time
8MaterialNameName of the material
9MaterialTypeType of the material
10MaterialFeatureMaterial feature data, as shown in Table 3
11DescriptionRemarks on materials data
Table 3. Material feature data table.
Table 3. Material feature data table.
No.ComponentsExplanation
1DataIDID of data, unique identifier of materials data
2CompositionsChemical compositions data
3PropertiesMechanical properties, physical properties, application performance, etc.
4ProcessesMolding process, heat treatment process, etc.
5StructuresCrystal structure, phase structure, organization, etc.
6DataIDID of data, unique identifier of materials data
Table 4. The structure of the data value table.
Table 4. The structure of the data value table.
No.ProjectSymbol
1The number of exchangedE
2Number of referencedR
3Positive feedbackFp
4Negative feedbackFn
5Page viewP
6Data valueV
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, C.; Su, H.; Duan, L.; Li, H. InterMat: A Blockchain-Based Materials Data Discovery and Sharing Infrastructure. Processes 2023, 11, 3168. https://doi.org/10.3390/pr11113168

AMA Style

Wang C, Su H, Duan L, Li H. InterMat: A Blockchain-Based Materials Data Discovery and Sharing Infrastructure. Processes. 2023; 11(11):3168. https://doi.org/10.3390/pr11113168

Chicago/Turabian Style

Wang, Changchang, Hang Su, Linna Duan, and Hao Li. 2023. "InterMat: A Blockchain-Based Materials Data Discovery and Sharing Infrastructure" Processes 11, no. 11: 3168. https://doi.org/10.3390/pr11113168

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop