MAYA

Multiple ActivitY Analyzer (MAYA) is an open-source Python tool designed to automate the generation of chemical multiverses for comprehensive analysis of structure-activity/property relationships/associations. MAYA integrates multiple molecular representations, including structural descriptors (e.g., MACCS keys, ECFP4, ECFP6, and MAP4), physicochemical molecualr descriptrs, and biological descriptors, to construct diverse chemical spaces. These spaces are visualized through interactive 2D plots, enabling deeper insights into molecular characteristics and their relationships with activities or properties

MAYA is user-friendly and supports various input file formats (CSV, TSV, XLSX, JASON, and XML), requering only a dataset with SMILES notation, molecualr identifiers, and associated activity/property values. Users can customize the analysis by selecting specific descriptors and demensionality reduction techniques (e.g., PCA) through simple parameters setting. The tool automates the following key processes:

The script consist in a funtion that automatically implement:

Data curation: Ensures high-quality data by validating and preprocessing molecualr datasets.
Descriptors calculation: Computes structural, physicochemical, and biological descriptors for molecular characterization.
Tanimoto simmilarity calculation: Quantifies molecular similarity to support chemical space analysis
Dimensionality reduction: Applies techniques like PCA and t-SNE to reduce complexity while preserving meaningful patters.
2D interactive visualization: Generates customizable, interactive plots displaying molecualr structures, PCA-derived variability, SMILES notaton, and user-defined visual attributes (e.g., point size, shape, color palette, and transparency)

MAYA's visualizations are designed to enhance intepretability, offering researchers a clear and interactive way to explore chemical spaces. The tools is particularly suited for chemoinformatics applications, such as drug discovery, where understanding complex structured-activity relationships is critical.

Here you can find more detailed information about how MAYA works

How use MAYA?

Important

It is essential to ensure our dataset contains the following information:

Smiles notation: Molecular representation
Identifier: Unique molecule identifiers
Activity or property values: Quantitative or categorical data for analysis

Users can customize the analysis by enabling or disabling specific descriptors and dimensionality reduction techniquese via Boolean parameters (e.g., True or False) Detailed instructions and examples are available in the documentation.

Example of usage

# This is an example
from maya_chem import MayaAnalyzer, MayaConfig
import numpy as np

# Crear configuración
config = MayaConfig(data_path="TEST.xlsx")

# Actualizar claves relevantes
config.data.update({
    "id_col": "molregno",
    "smiles_col": "canonical_smiles",
    "activities": ["standard_value"]
})
config.analysis["fingerprint"] = ["morgan", 'maccs']
config.analysis["reduction_method"] = ['pca']
config.viz["output_dir"] = "/content/MAYA/colab_results"

# Correr pipeline
analyzer = MayaAnalyzer(config)
results = analyzer.run(color_by='standard_value')
figs = analyzer.visualize(interactive_mode=True, color_by='standard_value')

print("✅ Pipeline completed. Check '/content/MAYA/colab_results' for outputs.")

See this notebook for more detailed usage

Why use MAYA?

To perform an automated analysis of your database annotated with any activity, property, or score by constructing a chemical multiverse focused on a deeper understanding of multiple structure-activity relationships.

You can customize the descriptors and techniques used depending on the required focus. You can select which descriptors you want to use, and you can also input a similarity matrix of any desired descriptor, allowing its integration into the generated visualizations.

Access to well-documented code is provided, covering database curation processes, similarity calculations, and dimensionality reduction techniques.

Usage

Google Colaboratory
The easiest way to use the script is ti open it in Google Colaboratory. The only thing needed is a Google account.
Local installation
You can also setup your own local environment if you do not want to run the script through a Google service.

Additional Information

MAYA current supports Pythob 3.10

rdkit (2022.09.05)

matplotlib (3.7.1)

pandas (2.1.4)

seaborn (0.13.1)

sklearn (1.3.2)

Funding

Research contained in this package was supported by the Consejo Nacional de Humanidades, Ciencia y Tecnología (CONAHCYT) for the scholarship No. CVU 1340927

Name		Name	Last commit message	Last commit date
Latest commit History 423 Commits
examples		examples
maya_chem		maya_chem
tests		tests
Example.jpg		Example.jpg
LICENSE		LICENSE
README.md		README.md
User_Guide.md		User_Guide.md
Wflow.png		Wflow.png
maya_local.py		maya_local.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MAYA

How use MAYA?

Example of usage

Why use MAYA?

Usage

Additional Information

Funding

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

IsrC11/MAYA

Folders and files

Latest commit

History

Repository files navigation

MAYA

How use MAYA?

Example of usage

Why use MAYA?

Usage

Additional Information

Funding

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages