An implementation of a variational autoencoder that runs on both SMILES and
SELFIES. Included is code that compares the SMILES and SELFIES representations
for a VAE using reconstruction quality, diversity, and latent space validity
as metrics of interest.
Dependencies are pytorch
, rdkit
, and pyyaml
, which can be installed
using Conda.
chemistry_vae.py
: the main file; contains the model definitions, settings.yml
: a file containing the hyperparameters of the data_loader.py
: contains helper methods that convert SMILES and SELFIESCPU and GPU supported
For comments, bug reports or feature ideas, please send an email to
mario.krenn@utoronto.ca and alan@aspuru.com