Implementation of Model Free Training for End-to-End Communication Systems

Implemented model based and model free auto-encoder based end-to-end communication system for AWGN and Rayleigh Block Fading (RBF) channels as given in the paper, "Model-Free Training of End-to-End Communication Systems" - Fay¸cal Ait Aoudia and Jakob Hoydis,.

The authors in the paper “Model-Free Training of End-to-End Communication Systems” consider auto-encoder based architecture for the entire transmitter and receiver model with the channel between the encoder (transmitter) and decoder (receiver). The following is the brief explaination of the paper.

  • Here, model-free refers to the communication model where the channel is unknown or has non-differentiable components. Hence the encoder (transmitter) cann’t be trained through backpropagation.

  • The objective was to train the autoencoder such that the decoder is able to detect the transmitted messages that are transmitted through the channel.

  • To train the autoencoder, the authors propose an alternating algorithm where the decoder (receiver) and the encoder (transmitter) are trained separately.

  • Since the encoder cann’t be trained as the channel can be non-differentiable, approximate gradient of loss function is used to train the encoder (transmitter).

Neural Network Architecture:

  • Input: One-hot encoded messages that are transmitted through AWGN/ Rayleigh Block Fading (RBF) channel after encoding.

  • Encoder: 2 layers of fully connected network with elu as the activation function with normalization to satisfy the power constraints. The output of the encoder is the symbol that is transmitted.

  • Decoder: 2 layers of fully connected network with elu as the activation function for the $1^{st}$ layer and softmax as the activation function for the $2^{nd}$ layer. Input to the decoder is the noisy version of the transmitted symbol.

  • Output: Predict the transmitted symbol using the received symbol.

  • Loss Function: Categorical cross-entropy

  • Optimizer: Adam

  • For each training iteration, the receiver and the transmitter are trained alternatively.

My replication of certain results of the mentioned research paper can be found here.