Package 'Ruta'

Package ‘ruta’

March 18, 2019 Title Implementation of Unsupervised Neural Architectures Version 1.1.0 Description Implementation of several unsupervised neural networks, from building their architecture to their training and evaluation. Available networks are auto-encoders including their main variants: sparse, contractive, denoising, robust and variational, as described in Charte et al. (2018) . License GPL (>= 3) | ﬁle LICENSE

URL https://github.com/fdavidcl/ruta

BugReports https://github.com/fdavidcl/ruta/issues Depends R (>= 3.2) Imports graphics (>= 3.2.3), keras (>= 2.2.4), purrr (>= 0.2.4), R.utils (>= 2.7.0), stats (>= 3.2.3), utils Suggests knitr, magrittr (>= 1.5), rmarkdown, testthat (>= 2.0.0) VignetteBuilder knitr Encoding UTF-8 LazyData true RoxygenNote 6.1.1 SystemRequirements Python (>= 2.7); keras (>= 2.1) NeedsCompilation no Author David Charte [aut, cre] (), Francisco Charte [aut] (), Francisco Herrera [aut] Maintainer David Charte Repository CRAN Date/Publication 2019-03-18 13:10:02 UTC

1 2 R topics documented:

R topics documented:

+.ruta_network ...... 3 add_weight_decay ...... 4 apply_ﬁlter.ruta_noise_zeros ...... 4 as_loss ...... 5 as_network ...... 6 autoencode ...... 6 autoencoder ...... 7 autoencoder_contractive ...... 8 autoencoder_denoising ...... 9 autoencoder_robust ...... 10 autoencoder_sparse ...... 11 autoencoder_variational ...... 12 contraction ...... 13 conv...... 13 correntropy ...... 14 decode ...... 15 dense ...... 15 dropout ...... 16 encode ...... 17 encoding_index ...... 17 evaluate_mean_squared_error ...... 18 evaluation_metric ...... 19 generate.ruta_autoencoder_variational ...... 19 input ...... 20 is_contractive ...... 21 is_denoising ...... 21 is_robust ...... 22 is_sparse ...... 22 is_trained ...... 23 is_variational ...... 23 layer_keras ...... 24 loss_variational ...... 24 make_contractive ...... 25 make_denoising ...... 26 make_robust ...... 26 make_sparse ...... 27 new_autoencoder ...... 28 new_layer ...... 28 new_network ...... 29 noise ...... 30 noise_cauchy ...... 30 noise_gaussian ...... 31 noise_ones ...... 31 noise_saltpepper ...... 32 noise_zeros ...... 32 output ...... 33 +.ruta_network 3

plot.ruta_network ...... 33 print.ruta_autoencoder ...... 34 reconstruct ...... 35 save_as ...... 35 sparsity ...... 36 to_keras ...... 37 to_keras.ruta_autoencoder ...... 37 to_keras.ruta_ﬁlter ...... 38 to_keras.ruta_layer_input ...... 39 to_keras.ruta_layer_variational ...... 39 to_keras.ruta_loss_contraction ...... 40 to_keras.ruta_network ...... 41 to_keras.ruta_sparsity ...... 42 to_keras.ruta_weight_decay ...... 42 train.ruta_autoencoder ...... 43 variational_block ...... 44 weight_decay ...... 45 [.ruta_network ...... 46

Index 47

+.ruta_network Add layers to a network/Join networks

Description Add layers to a network/Join networks

Usage ## S3 method for class 'ruta_network' e1 + e2

## S3 method for class 'ruta_network' c(...)

Arguments e1 First network e2 Second network ... networks or layers to be concatenated

Value Network combination 4 apply_ﬁlter.ruta_noise_zeros

Examples network <- input() + dense(30) + output("sigmoid") another <- c(input(), dense(30), dense(3), dense(30), output())

add_weight_decay Add weight decay to any autoencoder

Description Adds a weight decay regularization to the encoding layer of a given autoencoder

Usage add_weight_decay(learner, decay = 0.02)

Arguments learner The "ruta_autoencoder" object decay Numeric value indicating the amount of decay

Value An autoencoder object which contains the weight decay

apply_filter.ruta_noise_zeros Apply ﬁlters

Description Apply a ﬁlter to input data, generally a noise ﬁlter in order to train a denoising autoencoder. Users won’t generally need to use these functions

Usage ## S3 method for class 'ruta_noise_zeros' apply_filter(filter, data, ...)

## S3 method for class 'ruta_noise_ones' apply_filter(filter, data, ...)

## S3 method for class 'ruta_noise_saltpepper' apply_filter(filter, data, ...)

## S3 method for class 'ruta_noise_gaussian' as_loss 5

apply_filter(filter, data, ...)

## S3 method for class 'ruta_noise_cauchy' apply_filter(filter, data, ...)

apply_filter(filter, data, ...)

Arguments

filter Filter object to be applied data Input data to be ﬁltered ... Other parameters

See Also

Other autoencoder variants: autoencoder_contractive, autoencoder_denoising, autoencoder_sparse, autoencoder_variational, autoencoder

autoencoder_sparse Sparse autoencoder

Description

Creates a representation of a sparse autoencoder.

Usage autoencoder_sparse(network, loss = "mean_squared_error", high_probability = 0.1, weight = 0.2)

Arguments

network Layer construct of class "ruta_network" loss Character string specifying a loss function high_probability Expected probability of the high value of the encoding layer. Set this to a value near zero in order to minimize activations in that layer. weight The weight of the sparsity regularization

Value

A construct of class "ruta_autoencoder"

References

• Sparse deep belief net model for visual area V2 • Andrew Ng, Sparse Autoencoder. CS294A Lecture Notes

See Also

sparsity, make_sparse, is_sparse Other autoencoder variants: autoencoder_contractive, autoencoder_denoising, autoencoder_robust, autoencoder_variational, autoencoder 12 autoencoder_variational

autoencoder_variational Build a variational autoencoder

Description A variational autoencoder assumes that a latent, unobserved random variable produces the observed data and attempts to approximate its distribution. This function constructs a wrapper for a variational autoencoder using a Gaussian distribution as the prior of the latent space.

Usage autoencoder_variational(network, loss = "binary_crossentropy", auto_transform_network = TRUE)

Arguments network Network architecture as a "ruta_network" object (or coercible) loss Reconstruction error to be combined with KL divergence in order to compute the variational loss auto_transform_network Boolean: convert the encoding layer into a variational block if none is found?

Value A construct of class "ruta_autoencoder"

References • Auto-Encoding Variational Bayes • Under the Hood of the Variational Autoencoder (in Prose and Code) • Keras example: Variational autoencoder

See Also Other autoencoder variants: autoencoder_contractive, autoencoder_denoising, autoencoder_robust, autoencoder_sparse, autoencoder

Examples network <- input() + dense(256, "elu") + variational_block(3) + dense(256, "elu") + output("sigmoid")

learner <- autoencoder_variational(network, loss = "binary_crossentropy") contraction 13

contraction Contractive loss

Description

This is a wrapper for a loss which induces a contraction in the latent space.

Usage contraction(reconstruction_loss = "mean_squared_error", weight = 2e-04)

Arguments reconstruction_loss Original reconstruction error to be combined with the contractive loss (e.g. "binary_crossentropy") weight Weight assigned to the contractive loss

Value

A loss object which can be converted into a Keras loss

See Also autoencoder_contractive Other loss functions: correntropy, loss_variational

conv Create a convolutional layer

Description

Wrapper for a convolutional layer. The dimensions of the convolution operation are inferred from the shape of the input data. This shape must follow the pattern (batch_shape, x, [y, [z, ]], channel) where dimensions y and z are optional, and channel will be either 1 for grayscale images or generally 3 for colored ones.

Usage conv(filters, kernel_size, padding = "same", max_pooling = NULL, average_pooling = NULL, upsampling = NULL, activation = "linear") 14 correntropy

Arguments filters Number of ﬁlters learned by the layer kernel_size Integer or list of integers indicating the size of the weight matrices to be con- volved with the image padding One of "valid" or "same" (case-insensitive). See layer_conv_2d for more details max_pooling NULL or an integer indicating the reduction ratio for a max pooling operation after the convolution average_pooling NULL or an integer indicating the reduction ratio for an average pooling operation after the convolution upsampling NULL or an integer indicating the augmentation ratio for an upsampling operation after the convolution activation Optional, string indicating activation function (linear by default)

Value A construct with class "ruta_network"

See Also Other neural layers: dense, dropout, input, layer_keras, output, variational_block

Examples # Sample convolutional autoencoder net <- input() + conv(16, 3, max_pooling = 2, activation = "relu") + conv(8, 3, max_pooling = 2, activation = "relu") + conv(8, 3, upsampling = 2, activation = "relu") + conv(16, 3, upsampling = 2, activation = "relu") + conv(1, 3, activation = "sigmoid")

correntropy Correntropy loss

Description A wrapper for the correntropy loss function

Usage correntropy(sigma = 0.2)

Arguments sigma Sigma parameter in the kernel decode 15

Value A "ruta_loss" object

See Also autoencoder_robust Other loss functions: contraction, loss_variational

decode Retrieve decoding of encoded data

Description Extracts the decodiﬁcation calculated by a trained autoencoder for the speciﬁed data.

Usage decode(learner, data)

Arguments learner Trained autoencoder model data data.frame to be decoded

Value Matrix containing the decodiﬁcations

See Also

Other neural layers: conv, dropout, input, layer_keras, output, variational_block

Examples

dense(30, "tanh")

dropout Dropout layer

Description

Randomly sets a fraction rate of input units to 0 at each update during training time, which helps prevent overﬁtting.

Usage

dropout(rate = 0.5)

Arguments

rate The fraction of affected units

Value

A construct of class "ruta_network"

See Also

Other neural layers: conv, dense, input, layer_keras, output, variational_block encode 17

encode Retrieve encoding of data

Description Extracts the encoding calculated by a trained autoencoder for the speciﬁed data.

Usage encode(learner, data)

Arguments learner Trained autoencoder model data data.frame to be encoded

Value Matrix containing the encodings

See Also

autoencoder_variational

input Create an input layer

Description

This layer acts as a placeholder for input data. The number of units is not needed as it is deduced from the data during training.

Usage

input()

Value

A construct with class "ruta_network"

See Also

Other neural layers: conv, dense, dropout, layer_keras, output, variational_block is_contractive 21

is_contractive Detect whether an autoencoder is contractive

Description Detect whether an autoencoder is contractive

Usage is_contractive(learner)

Arguments learner A "ruta_autoencoder" object

Value Logical value indicating if a contractive loss was found

See Also contraction, autoencoder_contractive, make_contractive

is_denoising Detect whether an autoencoder is denoising

Description Detect whether an autoencoder is denoising

Usage is_denoising(learner)

Arguments learner A "ruta_autoencoder" object

Value Logical value indicating if a noise generator was found

See Also noise, autoencoder_denoising, make_denoising 22 is_sparse

is_robust Detect whether an autoencoder is robust

Description Detect whether an autoencoder is robust

Usage is_robust(learner)

Arguments learner A "ruta_autoencoder" object

Value Logical value indicating if a correntropy loss was found

See Also correntropy, autoencoder_robust, make_robust

is_sparse Detect whether an autoencoder is sparse

Description Detect whether an autoencoder is sparse

Usage is_sparse(learner)

Arguments learner A "ruta_autoencoder" object

Value Logical value indicating if a sparsity regularization in the encoding layer was found

See Also sparsity, autoencoder_sparse, make_sparse is_trained 23

is_trained Detect trained models

Description Inspects a learner and ﬁgures out whether it has been trained

Usage is_trained(learner)

Arguments learner Learner object

Value A boolean

See Also

autoencoder_variational Other loss functions: contraction, correntropy

make_contractive Add contractive behavior to any autoencoder

Description

Converts an autoencoder into a contractive one by assigning a contractive loss to it

Usage

make_contractive(learner, weight = 2e-04)

Arguments

learner The "ruta_autoencoder" object weight Weight assigned to the contractive loss

Value

An autoencoder object which contains the contractive loss

See Also

autoencoder_contractive 26 make_robust

make_denoising Add denoising behavior to any autoencoder

Description Converts an autoencoder into a denoising one by adding a ﬁlter for the input data

Usage make_denoising(learner, noise_type = "zeros", ...)

Arguments learner The "ruta_autoencoder" object noise_type Type of data corruption which will be used to train the autoencoder, as a character string. See autoencoder_denoising for details ... Extra parameters to customize the noisy ﬁlter. See autoencoder_denoising for details

Value An autoencoder object which contains the noisy ﬁlter

See Also

autoencoder_robust

make_sparse Add sparsity regularization to an autoencoder

Description

Add sparsity regularization to an autoencoder

Usage

make_sparse(learner, high_probability = 0.1, weight = 0.2)

Arguments

learner A "ruta_autoencoder" object high_probability Expected probability of the high value of the encoding layer. Set this to a value near zero in order to minimize activations in that layer. weight The weight of the sparsity regularization

Value

The same autoencoder with the sparsity regularization applied

See Also

sparsity, autoencoder_sparse, is_sparse 28 new_layer

new_autoencoder Create an autoencoder learner

Description Internal function to create autoencoder objects. Instead, consider using autoencoder.

Usage new_autoencoder(network, loss, extra_class = NULL)

Arguments network Layer construct of class "ruta_network" or coercible loss A "ruta_loss" object or a character string specifying a loss function extra_class Vector of classes in case this autoencoder needs to support custom methods (for to_keras, train, generate or others)

Value A construct of class "ruta_autoencoder"

new_layer Layer wrapper constructor

Description Constructor function for layers. You shouldn’t generally need to use this. Instead, consider using individual functions such as dense.

Usage new_layer(cl, ...)

Arguments cl Character string specifying class of layer (e.g. "ruta_layer_dense"), which will be used to call the corresponding methods ... Other parameters (usually units, activation)

Value A construct with class "ruta_layer" new_network 29

Examples

my_layer <- new_layer("dense", 30, "tanh")

# Equivalent: my_layer <- dense(30, "tanh")[[1]]

new_network Sequential network constructor

Description

Constructor function for networks composed of several sequentially placed layers. You shouldn’t generally need to use this. Instead, consider concatenating several layers with +.ruta_network.

Usage

new_network(...)

Arguments

... Zero or more objects of class "ruta_layer"

Value

A construct with class "ruta_network"

Examples

my_network <- new_network( new_layer("input", 784, "linear"), new_layer("dense", 32, "tanh"), new_layer("dense", 784, "sigmoid") )

# Instead, consider using my_network <- input() + dense(32, "tanh") + output("sigmoid") 30 noise_cauchy

noise Noise generator

Description

Delegates on noise classes to generate noise of some type

Usage noise(type, ...)

Arguments

type Type of noise, as a character string ... Parameters for each noise class

noise_cauchy Additive Cauchy noise

Description

A data ﬁlter which adds noise from a Cauchy distribution to instances

Usage noise_cauchy(scale = 0.005)

Arguments

scale Scale for the Cauchy distribution

Value

Object which can be applied to data with apply_filter

See Also

Other noise generators: noise_gaussian, noise_ones, noise_saltpepper, noise_zeros noise_gaussian 31

noise_gaussian Additive Gaussian noise

Description A data ﬁlter which adds Gaussian noise to instances

Usage noise_gaussian(sd = NULL, var = NULL)

Arguments sd Standard deviation for the Gaussian distribution var Variance of the Gaussian distribution (optional, only used if sd is not provided)

Value Object which can be applied to data with apply_filter

See Also Other noise generators: noise_cauchy, noise_ones, noise_saltpepper, noise_zeros

noise_ones Filter to add ones noise

Description A data ﬁlter which replaces some values with ones

Usage noise_ones(p = 0.05)

Arguments p Probability that a feature in an instance is set to one

Value Object which can be applied to data with apply_filter

See Also Other noise generators: noise_cauchy, noise_gaussian, noise_saltpepper, noise_zeros 32 noise_zeros

noise_saltpepper Filter to add salt-and-pepper noise

Description A data ﬁlter which replaces some values with zeros or ones

Usage noise_saltpepper(p = 0.05)

Arguments p Probability that a feature in an instance is set to zero or one

Value Object which can be applied to data with apply_filter

See Also Other noise generators: noise_cauchy, noise_gaussian, noise_ones, noise_zeros

noise_zeros Filter to add zero noise

Description A data ﬁlter which replaces some values with zeros

Usage noise_zeros(p = 0.05)

Arguments p Probability that a feature in an instance is set to zero

Value Object which can be applied to data with apply_filter

See Also Other noise generators: noise_cauchy, noise_gaussian, noise_ones, noise_saltpepper output 33

output Create an output layer

Description This layer acts as a placeholder for the output layer in an autoencoder. The number of units is not needed as it is deduced from the data during training.

Usage output(activation = "linear")

Arguments activation Optional, string indicating activation function (linear by default)

Value A construct with class "ruta_network"

See Also Other neural layers: conv, dense, dropout, input, layer_keras, variational_block

plot.ruta_network Draw a neural network

Description Draw a neural network

Usage ## S3 method for class 'ruta_network' plot(x, ...)

Arguments x A "ruta_network" object ... Additional parameters for style. Available parameters: • bg: Color for the text over layers • fg: Color for the background of layers • log: Use logarithmic scale 34 print.ruta_autoencoder

Examples

net <- input() + dense(1000, "relu") + dropout() + dense(100, "tanh") + dense(1000, "relu") + dropout() + output("sigmoid") plot(net, log = TRUE, fg = "#30707a", bg = "#e0e6ea")

print.ruta_autoencoder Inspect Ruta objects

Description Inspect Ruta objects

Usage ## S3 method for class 'ruta_autoencoder' print(x, ...)

## S3 method for class 'ruta_loss_named' print(x, ...)

## S3 method for class 'ruta_loss' print(x, ...)

## S3 method for class 'ruta_network' print(x, ...)

Arguments x An object ... Unused

Value Invisibly returns the same object passed as parameter

Examples

print(autoencoder(c(256, 10), loss = correntropy())) reconstruct 35

reconstruct Retrieve reconstructions for input data

Description Extracts the reconstructions calculated by a trained autoencoder for the speciﬁed input data after encoding and decoding. predict is an alias for reconstruct.

Usage reconstruct(learner, data)

## S3 method for class 'ruta_autoencoder' predict(object, ...)

Arguments learner Trained autoencoder model data data.frame to be passed through the network object Trained autoencoder model ... Rest of parameters, unused

Value Matrix containing the reconstructions

See Also

autoencoder_variational Other neural layers: conv, dense, dropout, input, layer_keras, output

Examples

variational_block(3)

weight_decay Weight decay

Description

A wrapper that describes a weight decay regularization of the encoding layer

Usage

weight_decay(decay = 0.02)

Arguments

decay Numeric value indicating the amount of decay

Value

A regularizer object containing the set parameters 46 [.ruta_network

[.ruta_network Access subnetworks of a network

Description Access subnetworks of a network

Usage ## S3 method for class 'ruta_network' net[index]

Arguments net A "ruta_network" object index An integer vector of indices of layers to be extracted

Value A "ruta_network" object containing the speciﬁed layers.

Examples (input() + dense(30))[2] long <- input() + dense(1000) + dense(100) + dense(1000) + output() short <- long[c(1, 3, 5)] Index

+.ruta_network,3, 29 evaluate_binary_crossentropy [.ruta_network, 46 (evaluate_mean_squared_error), 18 add_weight_decay,4 evaluate_kullback_leibler_divergence apply_filter, 30–32 (evaluate_mean_squared_error), apply_filter 18 (apply_filter.ruta_noise_zeros), evaluate_mean_absolute_error 4 (evaluate_mean_squared_error), apply_filter.ruta_noise_zeros,4 18 as_loss,5 evaluate_mean_squared_error, 18 as_network,6 evaluation_metric, 18, 19 autoencode,6 fit, 43 autoencoder, 7,7, 9–12, 28, 38, 43 autoencoder_contractive, 8,8, 10–13, 21, generate 25 (generate.ruta_autoencoder_variational), autoencoder_denoising, 5, 8,9 ,9, 11, 12, 19 21, 26 generate.ruta_autoencoder_variational, autoencoder_robust, 8–10, 10, 11, 12, 15, 19 22, 27 autoencoder_sparse, 8–11, 11, 12, 22, 27, 37 input, 14, 16, 20, 24, 33, 45 autoencoder_variational, 8–11, 12, 20, 23, is_contractive, 21 25, 45 is_denoising, 21 is_robust, 22 c.ruta_network (+.ruta_network),3 is_sparse, 11, 22, 27, 37 compile, 43 is_trained, 23 contraction, 13, 15, 21, 25 is_variational, 23 conv, 13, 16, 20, 24, 33, 45 correntropy, 13, 14, 22, 25 layer_conv_2d, 14 layer_keras, 14, 16, 20, 24, 33, 45 load_from (save_as) decode, 15, 17, 35 , 35 loss_variational, 13, 15, 24 dense, 14, 15, 16, 20, 24, 28, 33, 45 dropout, 14, 16, 16, 20, 24, 33, 45 make_contractive, 21, 25 make_denoising, 21, 26 encode, 15, 17, 35 make_robust, 22, 26 encoding_index, 17 make_sparse, 11, 22, 27, 37 evaluate, 18, 19 metric_binary_accuracy, 19 evaluate_binary_accuracy (evaluate_mean_squared_error), new_autoencoder, 28 18 new_layer, 28

47 48 INDEX new_network, 29 to_keras.ruta_network, 41 noise, 21, 30 to_keras.ruta_sparsity, 42 noise_cauchy, 9, 30, 31, 32 to_keras.ruta_weight_decay, 42 noise_gaussian, 9, 30, 31, 31, 32 train, 23 noise_ones, 9, 30, 31, 31, 32 train (train.ruta_autoencoder), 43 noise_saltpepper, 9, 30–32, 32 train.ruta_autoencoder, 8, 43 noise_zeros, 9, 30–32, 32 variational_block, 14, 16, 20, 24, 33, 44 output, 14, 16, 20, 24, 33, 45 weight_decay, 45 plot.ruta_network, 33 predict.ruta_autoencoder (reconstruct), 35 print.ruta_autoencoder, 34 print.ruta_loss (print.ruta_autoencoder), 34 print.ruta_loss_named (print.ruta_autoencoder), 34 print.ruta_network (print.ruta_autoencoder), 34 reconstruct, 15, 17, 35 save_as, 35 sparsity, 11, 22, 27, 36 tar, 36 to_keras, 37 to_keras.ruta_autoencoder, 37 to_keras.ruta_autoencoder_variational (to_keras.ruta_autoencoder), 37 to_keras.ruta_filter, 38 to_keras.ruta_layer_conv (to_keras.ruta_layer_input), 39 to_keras.ruta_layer_custom (to_keras.ruta_layer_input), 39 to_keras.ruta_layer_dense (to_keras.ruta_layer_input), 39 to_keras.ruta_layer_input, 39 to_keras.ruta_layer_variational, 39 to_keras.ruta_loss_contraction, 40 to_keras.ruta_loss_correntropy (to_keras.ruta_loss_contraction), 40 to_keras.ruta_loss_named (to_keras.ruta_loss_contraction), 40 to_keras.ruta_loss_variational (to_keras.ruta_loss_contraction), 40