Thanks to visit codestin.com
Credit goes to github.com

Skip to content

标题抽取因为 | 字符少了一截 #124

@pgshow

Description

@pgshow

bug的现象

在抽取该文章的标题时 https://baijiahao.baidu.com/s?id=1733755048466991904
原标题为 操盘必读|央行、银保监会部署加大信贷投放;国泰君安拟协议受让华安基金8%股权;美股涨跌不一,但是只返回了 “操盘必读” 这四个字

如何复现

  1. 目标网址: https://baijiahao.baidu.com/s?id=1733755048466991904
  2. 你怎么调用GNE的

gne_info = extractor.extract(response.text)
title = gne_info['title']

使用环境:

  • OS: [e.g. Ubuntu 19.04/Windows 10/macOS ]
  • Python版本 [e.g. 3.8]

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions