Multilayer perceptron architecture optimization using parallel computing techniques

Wilson Castro; Jimy Oblitas; Roberto Santa-Cruz; Himer Avila-George

doi:10.1371/journal.pone.0189369

Multilayer perceptron architecture optimization using parallel computing techniques

PLoS One. 2017 Dec 13;12(12):e0189369. doi: 10.1371/journal.pone.0189369. eCollection 2017.

Authors

Wilson Castro¹, Jimy Oblitas^{2

3}, Roberto Santa-Cruz⁴, Himer Avila-George⁵

Affiliations

¹ Facultad de Ingeniería, Universidad Privada del Norte, Cajamarca, Peru.
² Centro de Investigaciones e Innovaciones de la Agroindustria Peruana, Amazonas, Peru.
³ Escuela de Doctorado, Departamento de Tecnología de Alimentos, Universidad de Lleida, Lleida, Spain.
⁴ Facultad de Ingeniería de Sistemas y Mecánica Eléctrica, Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Peru.
⁵ Unidad de Transferencia Tecnológica Tepic, CONACYT-CICESE, Tepic, Nayarit, Mexico.

Abstract

The objective of this research was to develop a methodology for optimizing multilayer-perceptron-type neural networks by evaluating the effects of three neural architecture parameters, namely, number of hidden layers (HL), neurons per hidden layer (NHL), and activation function type (AF), on the sum of squares error (SSE). The data for the study were obtained from quality parameters (physicochemical and microbiological) of milk samples. Architectures or combinations were organized in groups (G1, G2, and G3) generated upon interspersing one, two, and three layers. Within each group, the networks had three neurons in the input layer, six neurons in the output layer, three to twenty-seven NHL, and three AF (tan-sig, log-sig, and linear) types. The number of architectures was determined using three factorial-type experimental designs, which reached 63, 2 187, and 50 049 combinations for G1, G2 and G3, respectively. Using MATLAB 2015a, a logical sequence was designed and implemented for constructing, training, and evaluating multilayer-perceptron-type neural networks using parallel computing techniques. The results show that HL and NHL have a statistically relevant effect on SSE, and from two hidden layers, AF also has a significant effect; thus, both AF and NHL can be evaluated to determine the optimal combination per group. Moreover, in the three study groups, it is observed that there is an inverse relationship between the number of processors and the total optimization time.

Publication types

Validation Study

MeSH terms

Algorithms
Animals
Cattle
Computers
Milk
Models, Neurological*
Neural Networks, Computer*

Grants and funding

The author(s) received no specific funding for this work.