-
Notifications
You must be signed in to change notification settings - Fork 1k
ARROW-12493: Add support for writing dictionary arrays to CSV and JSON #16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov Report
@@ Coverage Diff @@
## master #16 +/- ##
==========================================
+ Coverage 82.47% 82.48% +0.01%
==========================================
Files 162 162
Lines 43414 43447 +33
==========================================
+ Hits 35806 35838 +32
- Misses 7608 7609 +1
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. Thank you @tustvold
|
||
assert_eq!( | ||
String::from_utf8(buf).unwrap(), | ||
r#"{"c1":"cupcakes","c2":"sdsd"} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🧁
We had to perform a small re-write of master. The commits may look a bit odd, but it should not cause conflicts. Could you kindly rebase this against the latest master to make it easier to review? |
4fa2a1c
to
7f425c7
Compare
partition_validity
to use fast path for bit map scan (up to 30% faster)
#7962
Provide support for serializing dictionary arrays to CSV and JSON by hydrating them to their underlying representation. This is not the most efficient way to do this, but was the simplest way I could think of to cover all bases.
It may be worthwhile special-casing StringDictionaries with a more efficient implementation in a subsequent PR, as I imagine they're the most common form of DictionaryArray.