Example and usage¶

In order to make things simple the following rules have been followed during development:

deel-lip follows the keras package structure.
All elements (layers, activations, initializers, …) are compatible with standard the keras elements.
When a k-Lipschitz layer overrides a standard keras layer, it uses the same interface and the same parameters. The only difference is a new parameter to control the Lipschitz constant of a layer.

Which layers are safe to use?¶

The following table indicates which layers are safe to use in a Lipshitz network, and which are not.

layer	1-lip?	deel-lip equivalent	comments
`Dense`	no	`SpectralDense` `FrobeniusDense`	`SpectralDense` and `FrobeniusDense` are similar when there is a single output.
`Conv2D`	no	`SpectralConv2D` `FrobeniusConv2D`	`SpectralConv2D` also implements Björck normalization.
`MaxPooling` `GlobalMaxPooling`	yes	n/a
`AveragePooling2D` `GlobalAveragePooling2D`	no	`ScaledAveragePooling2D` `ScaledGlobalAveragePooling2D`	The lipschitz constant is bounded by `sqrt(pool_h * pool_h)`.
`Flatten`	yes	n/a
`Dropout`	no	None	The lipschitz constant is bounded by the dropout factor.
`BatchNorm`	no	None	We suspect that layer normalization already limits internal covariate shift.

How to use it?¶

Here is a simple example showing how to build a 1-Lipschitz network:

from deel.lip.initializers import BjorckInitializer
from deel.lip.layers import SpectralDense, SpectralConv2D
from deel.lip.model import Sequential
from deel.lip.activations import PReLUlip
from tensorflow.keras.layers import Input, Lambda, Flatten, MaxPool2D
from tensorflow.keras import backend as K
from tensorflow.keras.optimizers import Adam

# Sequential (resp Model) from deel.model has the same properties as any lipschitz
# layer ( condense, setting of the lipschitz factor etc...). It act only as a container.
model = Sequential(
    [
        Input(shape=(28, 28)),
        Lambda(lambda x: K.reshape(x, (-1, 28, 28, 1))),

        # Lipschitz layer preserve the API of their superclass ( here Conv2D )
        # an optional param is available: k_coef_lip which control the lipschitz
        # constant of the layer
        SpectralConv2D(
            filters=32, kernel_size=(3, 3), padding='same',
            activation=PReLUlip(), data_format='channels_last',
            kernel_initializer=BjorckInitializer(15, 50)),
        SpectralConv2D(
            filters=32, kernel_size=(3, 3), padding='same',
            activation=PReLUlip(), data_format='channels_last',
            kernel_initializer=BjorckInitializer(15, 50)),
        MaxPool2D(pool_size=(2, 2), data_format='channels_last'),

        SpectralConv2D(
            filters=64, kernel_size=(3, 3), padding='same',
            activation=PReLUlip(), data_format='channels_last',
            kernel_initializer=BjorckInitializer(15, 50)),
        SpectralConv2D(
            filters=64, kernel_size=(3, 3), padding='same',
            activation=PReLUlip(), data_format='channels_last',
            kernel_initializer=BjorckInitializer(15, 50)),
        MaxPool2D(pool_size=(2, 2), data_format='channels_last'),

        Flatten(),
        SpectralDense(256, activation="relu", kernel_initializer=BjorckInitializer(15, 50)),
        SpectralDense(10, activation="softmax"),
    ],
    k_coef_lip=0.5,
    name='testing'
)

optimizer = Adam(lr=0.001)
model.compile(loss='categorical_crossentropy',
            optimizer=optimizer,
            metrics=['accuracy'])

See deel.lip package for a complete API description.