Skip to content

Latest commit

 

History

History

simple_chemical_space

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Molecular Similarity in a Chemical Space

Script adapted and developed to plot the molecular similarity in a chemical space.

This script has been adapted from iwatobipen, according to the papper written by Atsushi Yoshimori et.al.

I strongly advise to check iwatobipen github and blog if you are interested on chemical space, chemical similarity and other cheminformatics topics.

Libraries used

  • pandas - a Python package that provides fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive.

  • NumPy - the fundamental package for array computing with Python

  • RDKit - Open source toolkit for cheminformatics

  • scikit-learn - Machine Learning in Python.

  • Matplotlib - a comprehensive library for creating static, animated, and interactive visualizations in Python.

  • seaborn - a Python data visualization library based on matplotlib.

Libraries were used in a Miniconda3 environment using python 3.6.13

Instalation

Miniconda3: Installation

pandas:

conda install -c anaconda pandas

numpy

conda install -c anaconda numpy

RDKit

conda install -c rdkit rdkit

scikit-learn

conda install -c anaconda scikit-learn

Matplotlib

conda install -c conda-forge matplotlib

seaborn

conda install -c anaconda seaborn

How to use

  • Download the code and unzip it on the desirable directory

To run use the following command:

python chemical_space.py

Observations

This script has been elaborated using as references the following articles and codes:

The dataset used as example was made publically available by the Government of India as a part of their Drug Discovery Hackathon and is found at the following link:

At fingeprints.txt is a list of key fingerprints to test on the data.

Authorship

Social preview original photo by Brenda Ferrari (brendaferrari)