Plexiglass

Quickstart | Installation | Documentation | Code of Conduct

Plexiglass is a toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).

It is a simple command line interface (CLI) tool which allows users to quickly test LLMs against adversarial attacks such as prompt injection, jailbreaking and more.

Plexiglass also allows security, bias and toxicity benchmarking of multiple LLMs by scraping latest adversarial prompts such as jailbreakchat.com and wiki_toxic. See more at modes.

Quickstart

Please follow this quickstart guide in the documentation.

Installation

The first experimental release is version 0.0.1.

To download the package from PyPi:

pip install --upgrade plexiglass

Modes

Plexiglass has two modes: llm-chat and llm-scan.

llm-chat allows you to converse with the LLM and measure predefined metrics, such as toxicity, from its responses. It currently supports the following metrics:

toxicity
pii_detection

llm-scan runs benchmarks using open-source datasets to identify and assess various vulnerabilities in the LLM.

Feature Request

To request new features, please submit an issue

Development Roadmap

implement adversarial prompt templates in llm-chat mode
security, bias and toxicity benchmarking with llm-scan mode
generate html report in llm-scan and llm-chat modes
standalone python module
production-ready API

Join us in #plexiglass on Discord.

Contributors

Code of Conduct

Read our Code of Conduct.

Made with contrib.rocks.

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
.github		.github
docs		docs
plexiglass		plexiglass
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
verify-python-version.sh		verify-python-version.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Plexiglass

Quickstart

Installation

Modes

Feature Request

Development Roadmap

Contributors

Code of Conduct

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

safellama/plexiglass

Folders and files

Latest commit

History

Repository files navigation

Plexiglass

Quickstart

Installation

Modes

Feature Request

Development Roadmap

Contributors

Code of Conduct

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages