Inference Speed Comparison Using Convolutions in Neural Networks on Various Soc Hardware Platforms Using Micropython

Inference speed comparison using convolutions in neural networks on various SoC hardware platforms using MicroPython Kristian Dokic, PhDa, Hrvoje Mikolcevicb and Bojan Radisica a Polytechnic in Pozega, Vukovarska 17, Pozega, Croatia b Tehnical school, Ratarnicka 1, Pozega, Croatia Abstract In recent years we have witnessed the rapid development of machine learning algorithms, and the same can be said for IoT. Developments in both fields have also influenced the growth of machine learning algorithms in IoT devices. The authors of a series of papers cite several reasons to argue this trend. This paper explores the possibility of using the Python programming language in different versions to create, train, and implement convolutional neural networks on two SoCs based on different architectures (ARM and RISC-V). The influence of the number of filters in the convolutional layer on the inference speed is also investigated. The number of filters has a different effect on inference speed depending on the existence of additional components that accelerate individual operations of convolutional neural networks (convolution, batch normalization, activation, and pooling operations). Keywords 1 MicroPython, SoC, CNN, Convolution Neural Network, RISC-V 1. Introduction these are energy savings, network traffic reduction, privacy problems and delays in data transmission [3]. Sakr et al. also cites similar In the last few years, the terms Internet of benefits of moving computation toward the Things and Machine learning have been edge, such as lower energy consumption, lower increasingly mentioned in the field of response latency, higher security, lower information and communication technologies. bandwidth occupancy and expected privacy [4]. Definitions of both terms are given below. Since Python is a programming language Internet of things is “a system of interrelated most often used for machine learning and very computing devices, mechanical and digital rarely for the development of IoT devices, in machines, objects, animals or people that are this paper, the aim was to compare two different provided with unique identifiers (UIDs) and the SoCs' performance using Python and ability to transfer data over a network without MicroPython in the development of machine requiring human-to-human or human-to- learning models and IoT devices. The C/C++ computer interaction” [1]. Machine learning “is programming language is most often used to the study of computer algorithms that improve develop IoT devices, but if a version of Python automatically through experience” [2]. were used for this task, all phases of a machine Powerful computers are most often required learning model development for IoT devices to implement machine learning, and IoT would be significantly accelerated and devices are often low in processing power. simplified. Several authors have stated that there are few Two development boards that hit the market advantages of transferring part of machine a year ago based on different architectures were learning data processing from large and cloud used in this paper, and their inference computers to IoT devices. Zhang et al. state that Proccedings of RTA-CSIT 2021, May 2021, Tirana, Albania EMAIL: [email protected] (Kristian Dokic); [email protected] (Hrvoje Mikolcevic); [email protected] (Bojan Radisic) © 2021 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings (CEUR-WS.org) performance was tested with a different number low-cost ambient monitor device based on of filters in the first layer of the convolutional ESP8266 SoC. They used a DHT22 neural network used to detect handwritten temperature and humidity sensor and OLED digits. Easy use of convolutional neural 128x64 display [14]. networks on microcontrollers has been possible Khamphroo et al. introduced a mobile in the last year or two, so there is not much educational robot based on MicroPython. The information about microcontrollers' development's primary goal is to create a robot performance on these tasks. A convolution that will be easy to use for beginners. neural network is a deep neural network class STM32L432KC ARM Cortex-M4 168 MHz usually used for image processing, was used in the presented project [15] [16]. classification, and other similar tasks. Tariq et al. used MicroPython for noise filtering on an STM32F10RB platform to implement an 2. Literature review early warning seismic event detection algorithm [17]. Zhang et al. used MicroPython in their quadruped robot project that deals with The first part of this section provides an the twisting trunk's effect on the tumble overview of papers in which the authors used stability from an energy viewpoint. Their MicroPython to solve various problems. The development board was based on STM32F405 second part provides an analysis of the used [18]. Wie et al. used MicroPython in their hardware platforms. wearable bio-signal processing system. Devices were based on the STM32F722 microcontroller 2.1. MicroPython [19]. Ibba et al. developed an impedance analyzer for fruit quality monitoring based on MicroPython is a programming language the STM32L486 microcontroller. They also partially compatible with Python 3. It is used MicroPython for microcontroller optimized to run on microcontrollers, and it programming [20]. Crepaldi et al. developed includes various modules for low-level the body channel communication system used hardware access. It was released in 2014. and it for landmark identification. They used is ported on various platforms (ARM Cortex- MicroPython and STM32L486 microcontroller M, RISC-V, ESP32, ESP8266, PIC, STM32) [21]. [5] [6]. It is also available on BBC micro:bit Bahmanian et al. used MicroPython in their development board [7] [8]. Finally, a new wide-band frequency synthesizer that can lock Raspberry product Raspberry Pi Pico can be on any of the optical reference harmonics used with MicroPython [9]. It is evident that between 2 GHz and 20 GHz. They used the MicroPython has become quite popular in a FE310 microcontroller based on RISC-V to short time, and it has been ported to various control the DA converters [22]. platforms without the significant influence of At the end of the MicroPython part, we large companies. should mention the Arduino IDE, which is MicroPython is used in various IoT projects. undoubtedly more popular than MicroPython, Regnath et al. used MicroPython on ESP32 but it also has some drawbacks. Kodali et al. platform to test their approach to verify data compared these two platforms by a series of integrity and consensus on a blockchain used on features. They state that the difference is in the IoT platforms [10]. Cardenas et al. used language type since MicroPython is a scripting MicroPython in Low-Cost and Low-Power language, while Arduino C code needs to be Messaging System. Their solution was based on compiled. According to them, MicroPython is ESP32 and LoRa Wireless Technology [11]. simpler, and the development is 5-10 times Tzounis et al., in their review of the IoT faster because there is no compiling. The syntax platform in agriculture, also mention is cleaner, and the code is more readable in MicroPython as a programming language for MicroPython [14]. Tanganelli et al. have the LoPy development platform based on reviewed the available platforms for rapid ESP32 SoC [12]. LoPy development board and prototyping of IoT solutions from a developer MicroPython have been used by Sayed et al. in perspective. In addition to the Arduino IDE and their multi-sensing platform for the ORCA Hub MicroPython, they listed several other integrated with Robot Operating System [13]. platforms and development tools: FreeRTOS, Kodali et al. used MicroPython to develop a mbedOS, Zephyr, Contiki OS, RIOT OS and Zerynth. They also cite perhaps the biggest Microprocessor’s characteristics drawback of the MicroPython, which is the STM32H743I Kendryte significantly smaller number of development I K210 boards that support it. The Arduino can be used Producer STM Canaan Inc. on over 1000 development boards, while the Bit Width 32 bit 64 bit MicroPython is supported by twenty. Their CPU clock 480 MHz 400 MHz paper is from 2019, and the situation may have RAM 1+32 MB 6 MB changed somewhat [23]. FLASH 2+32 MB 16 MB 2.2. MicroPython hardware on edge In the previously presented paper review, MicroPython-enabled platforms are grouped into three individual sections. The first section contains papers that use ESP32 and ESP8266 SoC. These are Espressif Inc. products based on Tensilica LX106 and LX6 processors. The second section lists papers that use different SoCs from STMicroelectronics, while the third section includes paper that uses SoCs based on the RISC-V architecture. The official GitHub page of the Figure 1: OpenMV Cam H7 Plus MicroPython project lists development boards that come with MicroPython installed. There are currently 35 development boards on the list, primarily based on the STM32F family of microcontrollers [24]. 3. Method The first part of this section gives the characteristics of the used development boards and microcontrollers with MicroPython installed. The second part of the section describes creating a model of a convolutional neural network and converting the model into Figure 2: Sipeed Maixduino formats that can be used on SoC with MicroPython. The Development board in Figure 2 is a Sipeed Maixduino Kit for RISC-V AI + IoT, and it is based on the Kendryte K210 based on 3.1. Used hardware supported by RISC-V instruction set architecture. The MicroPython development board's price was $ 23.90, but only one K210 costs $ 8.64. The paper analyzes two development boards based on different architectures. The development board in Figure 1 is an OpenMV 3.2. CNN model development and Cam H7 Plus and is based on the STM32H743II deployment ARM Cortex M7 processor. The development board's price was € 99.00, but only one STM32H743II processor costs € 11.61. Their The MNIST database of handwritten digits characteristics are listed in Table 1.

Load more