StatCAN (dataset fetcher and wrangler for Python)

Does exactly what statCanR does but for Python and offers Pandas or Polars dataframes, as required.

Installation

This is not yet published on PyPI so meanwhile install it from the repo.

pip install "git+https://github.com/aalekhpatel07/statcan[pandas]"
# or if you want to install with polars support instead of pandas:
pip install "git+https://github.com/aalekhpatel07/statcan[polars]"

Usage

Either via the statcan executable:

usage: statcan [-h] [-v] [--polars] [-n RETURN_ROWS] {search,download} ...

Download wrangled datasets for Pandas or Polars from StatCAN just like how https://github.com/warint/statcanR does it in R.

options:
  -h, --help            show this help message and exit
  -v, --verbose         increase verbosity (default: 0)
  --polars              Use polars instead of pandas. Note: This requires 'polars' extra be installed. (default: False)
  -n RETURN_ROWS, --return-rows RETURN_ROWS
                        Number of rows to return (default is whatever df.head() returns) (default: None)

command:
  {search,download}     The two main ways of consuming the StatCAN datasets.

or via the library that powers the CLI:

from pathlib import Path
from statcan.client import StatCan, MetadataDatabase, Language



# To search for datasets containing keywords:
db = MetadataDatabase()
db.load()
df = db.search("labour", "force")
print(df.head())


# To download the cleaned dataset corresponding to a given table number.
client = StatCan()

# Get the table_number by running the search. 
# For example:
table_number = "34-10-0281-01"

language = Language.ENGLISH
save_dir = Path(".")  # save the downloaded and cleaned csv to current dir.

csv = client.download(table_number, language, save_dir=save_dir)
df = csv.get_df_pandas()
print(df.head())

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src/statcan		src/statcan
.gitignore		.gitignore
.python-version		.python-version
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml
statcan_data.csv		statcan_data.csv
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

StatCAN (dataset fetcher and wrangler for Python)

Installation

Usage

About

Uh oh!

Releases 1

Packages

Languages

Uh oh!

License

Uh oh!

aalekhpatel07/statcan

Folders and files

Latest commit

History

Repository files navigation

StatCAN (dataset fetcher and wrangler for Python)

Installation

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages