Releases: goose3/goose3
Releases · goose3/goose3
Version 3.1.20
- Remove suppot for python
3.7, and3.8 - Handle
Noneresult for schema author; see issue #197 and PR #198; Thanks adonisnafeh - Hanlde
@graphas a dictionary or list from metadata extraction; PR #203; Thanks adonisnafeh - Ensure that the
final_urlis not a list, but a single entry; see issue #199 - Add Japanese tokenizer; PR #213; Thanks Vasniktel
- Pin
lxmlto less than 6.0.0
Version 3.1.19
- Clean
smalltags by removing them; see issue #190 - Update stopword files with additional languages; see issue #194
Version 3.1.18
Version 3.1.17
- Fix issue with multiple site names in the metadata; see issue #179
- Fix key error in author extraction; PR #177 and PR #183; Thanks anchitshrivastavaand erikvullings
Version 3.1.16
- Fix issue with author parsing; see issue #175
Version 3.1.15
- Add
article.schemasupport for@graphdata; PR #168; Thanks julianofnascimento - Retrieve
article.authorsfrom linked data; see issue #170 - Fix
article.titleextraction whenopengraphreturns a list instead of a single string; see issue #172
Version 3.1.14
- Fix
article.schemafailing when returns a list instead of a dictionary; PR #160; Thanks @catdingding - Add
article.top_node_raw_htmlwhich keeps the base html of the top node; see issue #112 and PR #162; Thanks @markowanga - Handle opengraph arrays if multiple tags are present; see issue #91
Version 3.1.13
- Deprecation Notice:
camelCasefunctions have been deprecated; see issue #151
- Fix for building Goose3 into an executable; see issue #144
- Fix
schema.orgcheck; see PR #154; Thanks @mromanelli9 - Fix
TypeErrorissues in publisher and title extractions; see PR #157 and PR #156; Thanks @Amaimersion - Fix
Configuration.strictnot always being enforced; see issue #158 - Update build system to be PEP 517 compliant
- Code Standardization
Version 3.1.12
- Fix for Korean stop words see PR #138; Thanks galaxytemple
- Allow for extra dependencies see issue #141
- Fix leading and trailing charset characters see issue #139; Thanks @nnick14
- Added basic logging and typing
Version 3.1.11
- Replace
md5with a pure pythonfnv_1anon-cryptographic hash see issue #133; Thanks @openbrian