Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Проблемы с кодировкой #98

Open
yaroshkin1979 opened this issue Apr 2, 2021 · 0 comments
Open

Проблемы с кодировкой #98

yaroshkin1979 opened this issue Apr 2, 2021 · 0 comments

Comments

@yaroshkin1979
Copy link

Сделал программу по примерам из документации, выдавал ошибки
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-6: ordinal not in range(128)
Решил с помощью encode('utf8')
После отладки скрипта, запустил в консоли с передачей текста параметром:
python3 test.py "Премьер Грузии Ираклий Гарибашвили"
Теперь скрипт выдает пустой результат
Помогите отладить, пожалуйста

`
#!/usr/bin/env python3

-- coding: utf-8 --

from sys import argv
from natasha import (
Segmenter,
MorphVocab,

NewsEmbedding,
NewsMorphTagger,
NewsSyntaxParser,
NewsNERTagger,

PER,
NamesExtractor,

Doc

)
import pymorphy2

scriptname, text = argv
segmenter = Segmenter()
morph_vocab = MorphVocab()

emb = NewsEmbedding()
morph_tagger = NewsMorphTagger(emb)
syntax_parser = NewsSyntaxParser(emb)
ner_tagger = NewsNERTagger(emb)

names_extractor = NamesExtractor(morph_vocab)

doc = Doc(text)
doc.segment(segmenter)

doc.tag_ner(ner_tagger)
{print(str(_.text.encode('utf8'))) for _ in doc.spans}
`

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant