ML Model Access and Collaborative Training through Decentralized Data Contribution

Show simple item record

dc.contributor.author Islam, Md Samin Yasar
dc.contributor.author Khan, Arafat Kabir
dc.contributor.author Noor, Agni Otulonio
dc.date.accessioned 2025-03-10T08:39:54Z
dc.date.available 2025-03-10T08:39:54Z
dc.date.issued 2024-06-05
dc.identifier.citation [1] U. Agarwal, V. Rishiwal, S. Tanwar, R. Chaudhary, G. Sharma, P. N. Bokoro, and R. Sharma, “Blockchain technology for secure supply chain management: A comprehensive review,” IEEE Access, vol. 10, pp. 85 493–85 517, 2022. [2] H. Al Housani, J. Baek, and C. Y. Yeun, “Survey on certificateless public key cryptography,” in 2011 International Conference for Internet Technology and Se cured Transactions, 2011, pp. 53–58. [3] M. S. Ali, M. Vecchio, M. Pincheira, K. Dolui, F. Antonelli, and M. H. Rehmani, “Applications of blockchains in the internet of things: A comprehensive sur vey,” IEEE Communications Surveys Tutorials, vol. 21, no. 2, pp. 1676–1717, 2019. [4] C. Aristidou and E. Marcou, “Blockchain standards and government applica tions,” Journal of ICT Standardization, vol. 7, no. 3, pp. 287–312, 2019. [5] M. N. M. Bhutta, A. A. Khwaja, A. Nadeem, H. F. Ahmad, M. K. Khan, M. A. Hanif, H. Song, M. Alshamari, and Y. Cao, “A survey on blockchain technology: Evolution, architecture and security,” IEEE Access, vol. 9, pp. 61 048–61 073, 2021. [6] R. Boutaba, M. A. Salahuddin, N. Limam, S. Ayoubi, N. Shahriar, F. Estrada Solano, and O. M. Caicedo, “A comprehensive survey on machine learning for networking: Evolution, applications, and research opportunities,” Journal of In ternet Services and Applications, vol. 9, no. 1, p. 16, Jun. 2018. [7] V. Buterin, “A next-generation smart contract and decentralized application platform,” white paper, vol. 3, no. 37, pp. 2–1, 2014. [8] W. Choi and J. W.-K. Hong, “Performance evaluation of ethereum private and testnet networks using hyperledger caliper,” in Proceedings of the 22nd Asia Pacific Network Operations and Management Symposium (APNOMS), IEEE, Tainan, Taiwan, 2021, pp. 325–329. [9] K. Christidis and M. Devetsikiotis, “Blockchains and smart contracts for the internet of things,” IEEE Access, vol. 4, pp. 2292–2303, 2016. 51 [10] L. W. Cong and Z. He, “Blockchain disruption and smart contracts,” Review of Financial Studies, vol. 32, no. 5, pp. 1754–1797, 2019. [11] M. S. Farooq, Z. Kalim, J. N. Qureshi, S. Rasheed, and A. Abid, “A blockchain based framework for distributed agile software development,” IEEE Access, vol. 10, pp. 17 977–17 995, 2022. [12] J. Feng, F. R. Yu, Q. Pei, X. Chu, J. Du, and L. Zhu, “Cooperative computation of floading and resource allocation for blockchain-enabled mobile-edge comput ing: A deep reinforcement learning approach,” IEEE Internet of Things Journal, vol. 7, no. 7, pp. 6214–6228, 2019. [13] M. Goldblum, D. Tsipras, C. Xie, X. Chen, A. Schwarzschild, D. Song, A. Mądry, B. Li, and T. Goldstein, “Dataset security for machine learning: Data poison ing, backdoor attacks, and defenses,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 2, pp. 1563–1580, 2022. [14] W. Gu, J. Li, and Z. Tang, “A survey on consensus mechanisms for blockchain technology,” in 2021 International Conference on Artificial Intelligence, Big Data and Algorithms (CAIBDA), 2021, pp. 46–49. [15] E. Guerra, F. Wilhelmi, M. Miozzo, and P. Dini, The cost of training machine learning models over distributed data sources, 2023. arXiv: 2209.07124 [cs.LG]. [16] J. D. Harris and B. Waggoner, “Decentralized and collaborative ai on blockchain,” in Proceedings of the IEEE International Conference on Blockchain (Blockchain), Atlanta, USA, 2019, pp. 368–375. [17] L. Junaid, K. Bilal, J. Shuja, A. O. Balogun, and J. J. P. C. Rodrigues, “Blockchain enabled framework for transparent land lease and mortgage management,” IEEE Access, vol. 12, pp. 54 005–54 018, 2024. [18] V. Y. Kemmoe, W. Stone, J. Kim, D. Kim, and J. Son, “Recent advances in smart contracts: A technical overview and state of the art,” IEEE Access, vol. 8, pp. 117 782– 117 801, 2020. [19] A. B. Kurtulmus and K. Daniel, “Trustless machine learning contracts; evalu ating and exchanging machine learning models on the ethereum blockchain,” arXiv preprint arXiv:1802.10185, 2018. [20] A. B. Kurtulmus and K. Daniel, Trustless machine learning contracts; evaluat ing and exchanging machine learning models on the ethereum blockchain, 2018. arXiv: 1802.10185 [cs.CR]. [21] P. Lawhale and S. Kale, “A survey on secure architectures using hash function based on fpga for block chain enabled iot devices,” in 2023 11th International 52 Conference on Emerging Trends in Engineering Technology - Signal and Infor mation Processing (ICETET - SIP), 2023, pp. 1–6. [22] V.-D. Le, T.-C. Bui, and W.-S. Li, “Efficient ml lifecycle transferring for large scale and high-dimensional data via core set-based dataset similarity,” IEEE Access, vol. 11, pp. 73 823–73 838, 2023. [23] H. Liu, X. Luo, H. Liu, and X. Xia, “Merkle tree: A fundamental component of blockchains,” in 2021 International Conference on Electronic Information Engi neering and Computer Science (EIECS), 2021, pp. 556–561. [24] A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng, and C. Potts, “Learning word vectors for sentiment analysis,” in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technolo gies, D. Lin, Y. Matsumoto, and R. Mihalcea, Eds., Portland, Oregon, USA: As sociation for Computational Linguistics, 2011, pp. 142–150. [25] A. Marathe, K. Narayanan, A. Gupta, and M. P.R., “Dinemmo: Decentralized incentivization for enterprise marketplace models,” in Proceedings of the IEEE 25th International Conference on High Performance ComputingWorkshops (HiPCW), Bengaluru, India, 2018, pp. 95–100. [26] M. M. Merlec and H. P. In, “Sc-caac: A smart-contract-based context-aware ac cess control scheme for blockchain-enabled iot systems,” IEEE Internet of Things Journal, vol. 11, no. 11, pp. 19 866–19 881, 2024. [27] D. Naidu, B. Wanjari, R. Bhojwani, S. Suchak, R. Baser, and N. K. Ray, “Effi cient smart contract for privacy preserving authentication in blockchain using zero knowledge proof,” in 2023 OITS International Conference on Information Technology (OCIT), 2023, pp. 969–974. [28] S. Nakamoto, “Bitcoin: A peer-to-peer electronic cash system,” Self-published, 2008, https://bitcoin.org/bitcoin.pdf. [29] A. Qayyum, J. Qadir, M. Bilal, and A. Al-Fuqaha, “Secure and robust machine learning for healthcare: A survey,” IEEE Reviews in Biomedical Engineering, vol. 14, pp. 156–180, 2021. [30] S. M. Sajjadi Mohammadabadi, L. Yang, F. Yan, and J. Zhang, “Communication efficient training workload balancing for decentralized multi-agent learning,” in 2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS), 2024, pp. 680–691. [31] P. Sakul-Ung, H. Ketmaneechairat, and M. Maliyaem, “Overmind, a collabora tive decentralized machine learning framework, the interpretation of network 53 behaviour,” in 2021 Research, Invention, and Innovation Congress: Innovation Electricals and Electronics (RI2C), 2021, pp. 288–292. [32] M. K. Sharma, G. K, and A. Kumari, “Studying the performance of digital signa ture algorithms in information security models for networking applications,” in 2024 2nd International Conference on Artificial Intelligence and Machine Learn ing Applications Theme: Healthcare and Internet of Things (AIMLA), 2024, pp. 1– 6. [33] S. M. Tahsien, H. Karimipour, and P. Spachos, “Machine learning based solu tions for security of internet of things (iot): A survey,” Journal of Network and Computer Applications, vol. 161, p. 102 630, 2020. [34] D. Tapscott and A. Tapscott, Blockchain Revolution: How the Technology Behind Bitcoin is Changing Money, Business, and the World. Baltimore, MD, USA: Pen guin, 2016. [35] O. Ural and K. Yoshigoe, “Survey on blockchain-enhanced machine learning,” IEEE Access, vol. 11, pp. 145 331–145 362, 2023. [36] Q. Wang, Y. Guo, X. Wang, T. Ji, L. Yu, and P. Li, “Ai at the edge: Blockchain empowered secure multiparty learning with heterogeneous models,” IEEE In ternet of Things Journal, vol. 7, no. 10, pp. 9600–9610, 2020. [37] D. Winkler, M. Sabou, S. Petrovic, G. Carneiro, M. Kalinowski, and S. Biffl, “Im proving model inspection with crowdsourcing,” in 2017 IEEE/ACM 4th Inter national Workshop on CrowdSourcing in Software Engineering (CSI-SE), 2017, pp. 30–34. [38] W. Yang, E. Aghasian, S. Garg, D. Herbert, L. Disiuta, and B. Kang, “A survey on blockchain-based internet service architecture: Requirements, challenges, trends, and future,” IEEE Access, vol. 7, pp. 75 845–75 872, 2019. [39] D. Yu, Z. Xie, Y. Yuan, S. Chen, J. Qiao, Y. Wang, Y. Yu, Y. Zou, and X. Zhang, “Trustworthy decentralized collaborative learning for edge intelligence: A sur vey,” High-Confidence Computing, vol. 3, no. 3, p. 100 150, 2023 en_US
dc.identifier.uri http://hdl.handle.net/123456789/2377
dc.description Supervised by Dr. Md. Azam Hossain, Associate Professor, Department of Computer Science and Engineering (CSE) Islamic University of Technology (IUT) Board Bazar, Gazipur, Bangladesh This thesis is submitted in partial fulfillment of the requirement for the degree of Bachelor of Science in Software Engineering, 2024 en_US
dc.description.abstract Machine learning has been at the heart of various industries by enabling systems to learn from data and improve their performance over time. Its applications span a wide range of fields, from healthcare to finance, entertainment to transportation and into more specialized domains like Natural Language Processing, sentiment analysis and so on. However, conventional machine learning (ML) approaches suffer few fundamental problems, such as the requirement for frequent retraining of the machine learning model to keep it updated, the extensive use of private datasets and financial concerns to use the model for inference. The need for frequent training is one of the major financial burden for preparing the ML model. Cost associated with frequently constructing the dataset and then training the model using it poses quite the finan- cial challenge. Besides that, private datasets raise questions about how the model was trained. In addition to this, third parties interested to study or improve the existing model cannot do so due to the lack of access to the dataset. Finally, companies or the end users of the model also have to bear expenses in order to actually use the model for inference. Blockchain technology can come in useful to tackle these issues. Blockchain is a distributed ledger system that maintains a serial of transactions that are unchangeable or more accurately, immutable. After every transaction occurs, it is validated by "miners" or network participants and recorded as a "block" of data within the blockchain. These blocks document the precise time and sequence of transactions. They are securely linked together, ensuring that no block can be altered or inserted between existing blocks. As new blocks are added, they reinforce the validity of the preceding blocks, thereby strengthening the integrity of the entire blockchain. Blockchain is therefore highly secured and trustable due to its robust design and the distributed nature of the technology also makes it accessible and transparent to everyone. The objective of this study is to enhance trust, transparency, and community engagement in the development of machine learning models through the utilisation of decentralised computing and collaborative learning and this is where blockchain aids in this research. This study evaluates a Naive Bayes classifier using a decentralised approach and compares its outcomes to those of a Sparse Perceptron model from previous research. Using Hyperledger Calliper shows that the Naive Bayes model outperforms the Sparse Perceptron model in terms of throughput, average latency, and peak latency. The study's findings demonstrate that the area of machine learning could be enhanced by adopting a decentralised, shared, and collaborative approach. en_US
dc.language.iso en en_US
dc.publisher Department of Computer Science and Engineering(CSE), Islamic University of Technology(IUT), Board Bazar, Gazipur-1704, Bangladesh en_US
dc.subject Blockchain, Sparse Perceptron, Naive Bayes, Hyperledger Caliper en_US
dc.title ML Model Access and Collaborative Training through Decentralized Data Contribution en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IUT Repository


Advanced Search

Browse

My Account

Statistics