Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@pull
Copy link

@pull pull bot commented May 30, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

sh-rp and others added 6 commits May 30, 2025 13:18
* remove some duplicate test utils

* use dataset to get table counts

* add exception for sftp but use dataset otherwise for loading table counts and contents

* update checking of empty tables in filesystem tests

* support filesystemsqlclient for tables that have prefixes rather than folders

* fix table location resolution for internal tables

* make sftp check raise same errors as filesystemsqlclient

* more cleanup

* fix replace disposition tests

* simplify table count code in many places

* small cleanup

* fix tables to dicts function

* disable databricks and synapse ibis backend tests

(cherry picked from commit aba8de4)

* simplify table assertions

* add tests for tests :)

* fix two tests

* fix dbt tests

* makes open table locations to work in windows fs

* review comments

* adds docstrings plus linting to pipeline utils

* fix docstring linting on utils class

* bump adlfs in lockfile

* test loading abfss first

* test getting tables one by one for azure

* fix resolving of sql_client

* change folder detection

* add comment for abfss fix
fix iceberg

* move abfss fallback into utils method

* normalizes trailing separator in paths in filesystem

* fixes two tests

* fix glob resolution for tables that have nested folders

* removes globs from duckdb filesystem sql client, adds tests for edge cases

* disables globbing for iceberg, adds optional autorefresh flag for view, fixes tests and docs

---------

Co-authored-by: Marcin Rudolf <[email protected]>
…uery (#2676)

* Feat: Add test for creating table with custom integer range partition

* Refactor: Remove unnecessary imports in custom integer range partition test

* Feat: Include bigquery_partition in adapters  __all__ exports

* Fix: Rename partition argument

* Refactor: Rename test for custom integer range partition to range bucket

* Feat: Add PartitionTransformation and bigquery_partition helper

* Fix: Update partition validation to accept PartitionTransformation in bigquery_adapter

* Feat: Implement partition clause generation for BigQuery SQL

* Refactor: reformat code

* Fix: Apply mypy fixes

* Fix: Update docstring

* reformat code
* start marimo app

* some more work

* a few small additional changes

* move marimo to dlt helpers and some small changes

* a bunch of improvements

* ui improvments and start fixing types

* clean up imports and make app more typesafe

* nicer tables

* start data page with row counts

* first version of query explorer

* make db browser nicer and dataset faster

* add pipeline quickstart links
add query cache and fast query execution

* add studio extra

* add first very simple test

* add studio command

* add more first tests

* fix dropdown

* rename helpers to utils
fix linter

* incomplete work on e2e tests

* tmp

* move e2e tests

* add tests to common file

* fallback when getting pipelines

* add poetry context to marimo start command

* fix folder

* add basic page checking for all e2e test pipelines

* small change

* add python caching (marimo caching does not work properly) and make dlt_pipeline a top level object

* start adding load info tab

* add ibis to e2e dependencies

* add loads page and data browser query history

* update basic e2e tests

* basic grammar fixes

* start adding trace view

* clean up imports

* start reworking tabs / switches

* finish conversion into grid friendly version

* fix types

* clean up strings and cell names

* a bit of styling

* make schema page one cell

* some style  updates

* changes to schema browser

* stg

* some text improvements

* fix unit tests

* fixes tests

* fix load id based row counting

* small css improvements

* add more info to trace section

* fix tests and small changes to trace page

* small string change

* fix warnings in edit mode

* extract all strings

* fix strings

* comments and some formatting

* remove incorrect info

* add config and make tests work again

* us string refs in e2e tests

* update test file

* add better timestamp rendering for loads and update tests

* fix rest api tests

* disable marimo tests on python 3.13

* use marimo state for some caching

* slightly re-organize utils

* add generated version of utils tests

* exclude python 3.9 for marimo e2e tests

* run e2e tests headless

* disable marimo e2e tests on windows

* remove marimo extra and create dependency groups for marimo and streamlit

* add marimo dependencies to linter

(cherry picked from commit e4235a9)

* streamlit should be present for linting

* re-enable relevant fixtures for e2e tests
remove unused imports

* move marimo tests first for debugging purposes

* print html from test to see what is going on

* another test

* do not set duckdb credentials and move marimo tests back to end

* fix marimo app dependencies
* triggers devel tests

* fixed malformed docstring

* use native sqlglot type annotation

* pass hints via SQLGlot metadata

* fix linter errors and tests

* fix a few more tests and edge cases

* fix bug in lineage

* enable columns schema for both ReadableRelation Types

* add more tests and make lineage tests independent from loading

* add lineage tests for all sql destinations

* enable tests on ci and disable column schema for sqlalchemy for now

* fix some more tests

* add sqlalchemy hack

* first fix for snowflake and some smaller chnages and clarifications

* fix sqlglot schema creation, makes clickhouse work

* re-add transformations tests folder

* fix lineage datatype

* disable databricks and synapse ibis backend tests

* move transformation code from prototype excluding old lineage and including updates so that linter passes, no real code changes yet.

* fix some of the python extractor based transformations

* fix most tests

* make basic transformation tests run on all destinations

* enable all current transformation tests for all destinations
run some duckdb transformations on all OSes

* a little bit of cleanup

* move common transactions and mark all destination transaction tests as essential for now

* Add improvements from review in prototype PR and some cleanup

* exclude dremio

* fix some transformations tests

* fix row_counts for snowflake and add some comments

* converts SupportsReadableRelation to an ABC

* add scalar access to SupportsReadableRelation

* simplify transformation signature

* add top level dlt objects and some small changes

* second part of removing transformation extra args

* add clickhouse tests

* add config based transformation source

* add better transformation examples

* use fruitshop template for testing

* remove custom row_counts method in favor of "global" test one

* first draft of transformations doc

* some work on the docs page

* feat: 2540 lineage `allow_unknown_columns` and `allow_anonymous_columns` (#2577)

* test compute_columns_schema() and exception handling

* convert transformation code examples to snippets

* finish first round of transformation docs

* Quite a few PR fixes

* fixes some tests

* add support and docs for dataframe and arrow operations

* add config and fallback if destination not reachable

* fix scalar method
fallback to models if pipeline destination is not available

* hopefully fix one test

* Docs: addition of normalizer behaviour to transformations docs (#2639)

* Normalizer info added

* Unnecessary paragraph removed, regular normalization linked

* feat: 2540 - SQLGlot type mapping (#2587)

* fixes some tests

* post rebase cleanup

* renamed kwarg

* type handling done; WIP

* sqlglot-dlt type mapping completed

* added docstrings to tests

* removed unused test file

* attach metadata to DataType

* refactored test to parameterized form

* refactor function names

* bug fix .to_py()

* rename compute_columns_schema() kwargs

* refactor type conversion branches

* fixes some tests

* add support and docs for dataframe and arrow operations

* add config and fallback if destination not reachable

* fix scalar method
fallback to models if pipeline destination is not available

* fix: update return type in athena_adapter docstring to reflect correct destination (#2599)

* list secrets in vault config provider to avoid calls to backend (#2597)

* fixes bug where configuration section was not propagated when embedded configuration is resolved

* splits vault provider settings per vault type

* adds option to list secrets to vault and google secrets provider

* uses google secrets provider with global cache for tests

* documents vault provider

* test and docs fixes

* slightly clarify clickhouse docs (#2594)

* slightly clarify clickhouse docs

* Update clickhouse.md

* Extract dataset code snippets into tests snippets system (#2598)

* extracts dataset code blocks into tested snippets and uses fruitshop pipeline as base dataset for demonstration purposes

* add ibis group

* Enabling 'model' loader_file_format for athena, synapse and dremio (#2556)

* Athena model loader format initial support

* test_verify_capabilities_data_types adjusted for athena

* Synapse enabled

* The offset logic for tsql made unreachable

* Athena test config without iceberg removed, dremio added

* Unnecessary synapse workaround removed

* fix some typos in cursor-restapi docs (#2608)

* fix some typos in cursor-restapi docs

* fix typo

* refactor init-command for use in dlt project (#2568)

* refactor init-command for use in dlt project

* remove config.toml from project docs

* fix ibis mypy error

---------

Co-authored-by: dave <[email protected]>

* docs: Fix incorrect nesting in secrets.toml (#2614)

* fixes parquet data writer settings docs & rewrites configuration docs (#2583)

* fixes parquet data writer settings docs

* adds section to dlt resource decorator

* fixes and tests how config sections are created when single resource is extracted

* fixes config sections for parallel doc example

* exports postgres adapter

* rewrites configuration docs, moves a few docs sections in sidebar

* snippet fixes

* accepts docs changes from review

Co-authored-by: Violetta Mishechkina <[email protected]>

* adds tip how to eject core source

* linter fixes

---------

Co-authored-by: Violetta Mishechkina <[email protected]>

* enables fsspec per-thread instance cache and updates documentation (#2621)

* bumps pendulum and docs (#2624)

* fixes sql database docstrings and docs

* bumps poetry to 3.0.1 and drop dlt poetry

* Added dedup sort example (#2235)

* Added dedup sort example

* Updated formatting

* Updated

* Updated

* Update docs/website/docs/general-usage/incremental-loading.md

---------

Co-authored-by: Alena Astrakhantseva <[email protected]>
Co-authored-by: Marcin Rudolf <[email protected]>

* Docs: add advanced project tutorial (#2338)

* hopefully fix one test

* trigger ci

* improve tests, lint

---------

Co-authored-by: David Scharf <[email protected]>
Co-authored-by: Anton Burnashev <[email protected]>
Co-authored-by: rudolfix <[email protected]>
Co-authored-by: anuunchin <[email protected]>
Co-authored-by: hsm207 <[email protected]>
Co-authored-by: djudjuu <[email protected]>
Co-authored-by: Alexander Grueneberg <[email protected]>
Co-authored-by: Violetta Mishechkina <[email protected]>
Co-authored-by: dat-a-man <[email protected]>
Co-authored-by: Alena Astrakhantseva <[email protected]>

* qualify all queries that come into the transformations

* fix lineage for snowflake and clickhouse lineage

* apply schema fix for sqlglot and remove special treatment of snowflake

* align datasets interfaces with ibis implementation ["col"] selects column and not table with one column

* disable incremental on transformations decorator and warn if incremental args are discovered

* fixes one more test

* fixes snowflake tests after sqlglot schema fix

* removes standalone resources, fixes transformation function wrapping (#2684)

* changes contrib and README (#2666)

* changes contrib and README

* Apply suggestions from code review

Co-authored-by: Anton Burnashev <[email protected]>

---------

Co-authored-by: Anton Burnashev <[email protected]>

* raises if resolving dataclass without configspec

* adds function type inspect that follows wrappers

* removes make fun, uses wraps

* adds conftest to transformations

* (1) fixes tranformation overloads (2) passes TransformationConfiguration as base spec so buffer is always injected (3) wraps tranformation_function (4) makes str SQL a model (5) tests configurations and parametrized transformations

* (1) removes resources returning resources (2) allows resources to be also functions (3) allows base spec to be passed to resource function (4) makes DltResource and SourceFactory to wrap decorated function and fixes signatures (5) allows inner resources to be injectable, warns for transformers (6) normalizes and tests how functions are wrapped and unwrapped so signatures and configs are available

* normalizes config resolve behavior: default values can be overriden from providers but explicit cannot. if those were instances of base configurations, behavior was inconsistent (explicit values were treated like defaults). also if native value is found for a config and it does not accept native values, config resolution will fail, previously it was ignored

* do not use config specs cached in module when creating autospecs

* fixes venv tests when uv is present

* if incremental parses from another incremental as native value, it copies origina type correctly

* merges standalone resources with regular resources: (1) all are DltResources (2) we generate the correct types for __call__! (3) all resources can be configured including inner resources and including default params, previously only standalone could. that unifies behavior for resources and sources re. config injection (4) resources can return another resources if have DltResource in type annotation (5) resources can be renamed with lambda names also sections can be renamed

* fixes transformation decorators so they generate correct typing

* binds params to resource function instead of using defaults to avoid generating config injection in rest_api

* removes remaining full_refresh flags

* fixes Makefile commands to run common and local destination tests

* fixes xdg home test

* fixes venv tests for uv

* linter and docsstring fixes

---------

Co-authored-by: Anton Burnashev <[email protected]>

* allows for initial values that are configurations also in case no native initial values are supported

* fixes docs linting

* Outer select quotes columns (#2694)

* fix normalizer tests

* fix a few small tests

* remove dependency on ibis for common tests (not supported on python 3.13)

* fixes for python 3.9

* fix sqlglot schema propagation and retrieval

* fixes leaking sqlalchemy credentials into other test

* skip not materialized columns in sqlglot schema generation

---------

Co-authored-by: Marcin Rudolf <[email protected]>
Co-authored-by: zilto <[email protected]>
Co-authored-by: Thierry Jean <[email protected]>
Co-authored-by: anuunchin <[email protected]>
Co-authored-by: Anton Burnashev <[email protected]>
Co-authored-by: hsm207 <[email protected]>
Co-authored-by: djudjuu <[email protected]>
Co-authored-by: Alexander Grueneberg <[email protected]>
Co-authored-by: Violetta Mishechkina <[email protected]>
Co-authored-by: dat-a-man <[email protected]>
Co-authored-by: Alena Astrakhantseva <[email protected]>
@pull pull bot added the ⤵️ pull label May 30, 2025
@pull pull bot merged commit fd88bb0 into kp-forks:devel May 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants