Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Umlauts and special characters in URLs #127

@nockenfell

Description

@nockenfell

If a page contains links with umlauts like ä ü ö or special characters like e.g. ß, this leads to misrecognitions because the URLs are not saved correctly.

Which charset is used for crawling? In the output via Json/Csv, umlauts are displayed incorrectly.

Example for crawling: ihr-anwalt-hamburg.de

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions