Thanks to visit codestin.com
Credit goes to github.com

Skip to content

下载words.vector.gz之后CRC检验失败 #149

@Lyndon-wong

Description

@Lyndon-wong

购买了license后,调用失败,删除words.vector.gz文件后重新下载依然出现同样问题,并且浪费一次下载次数,希望尽快修复。
(rag_dataset) b220@ps:~/nas_nfs/user/wld/project/rag_dataset$ /home/b220/miniconda3/envs/rag_dataset/bin/python /home/b220/nas_nfs/user/wld/project/rag_dataset/EDA_NLP_for_Chinese/code/eda.py
SYNONYMS_DL_LICENSE= L5PDIT1185
set LOG_LEVEL WARNING

Synonyms: v3.23.6, Project home: https://github.com/chatopera/Synonyms/

Project Sponsored by Chatopera

deliver your chatbots with Chatopera Cloud Services --> https://bot.chatopera.com

Module file path: /home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/synonyms.py

************ NOTICE ************
Require license to download model package, purchase from https://store.chatopera.com/product/syns001


Synonyms load wordseg dict [/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/data/vocab.txt] ...
Building prefix dict from /home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/data/vocab.txt ...
DEBUG:jieba:Building prefix dict from /home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/data/vocab.txt ...
Loading model from cache /tmp/jieba.u8e9a57485619493e8d764e8744ee4f25.cache
DEBUG:jieba:Loading model from cache /tmp/jieba.u8e9a57485619493e8d764e8744ee4f25.cache
Loading model cost 1.225 seconds.
DEBUG:jieba:Loading model cost 1.225 seconds.
Prefix dict has been built successfully.
DEBUG:jieba:Prefix dict has been built successfully.
Synonyms on loading stopwords [/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/data/stopwords.txt] ...
Synonyms on loading vectors [/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/data/words.vector.gz] ...

Synonyms downloading data with licenseId L5PDIT1185, save to /home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/data/words.vector.gz ...
this only happens if Synonyms initialization for the first time.
It would take minutes that depends on network.
[chatopera] store licensed file downloading is started, it takes minutes depending on your network ...

100% [......................................................................] 165919480 / 165919480
[chatopera] store file download done.

Synonyms downloaded

Traceback (most recent call last):
File "/home/b220/nas_nfs/user/wld/project/rag_dataset/EDA_NLP_for_Chinese/code/eda.py", line 15, in
import synonyms
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/init.py", line 14, in
from .synonyms import *
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/synonyms.py", line 176, in
_vectors = _load_w2v(model_file=_f_model)
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/synonyms.py", line 173, in _load_w2v
return KeyedVectors.load_word2vec_format(
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/site-packages/synonyms/word2vec.py", line 163, in load_word2vec_format
ch = fin.read(1)
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/gzip.py", line 301, in read
return self._buffer.read(size)
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/_compression.py", line 68, in readinto
data = self.read(len(byte_view))
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/gzip.py", line 479, in read
self._read_eof()
File "/home/b220/miniconda3/envs/rag_dataset/lib/python3.10/gzip.py", line 525, in _read_eof
raise BadGzipFile("CRC check failed %s != %s" % (hex(crc32),
gzip.BadGzipFile: CRC check failed 0x1752f967 != 0xce28e14d

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions