ckipnlp.ws package¶
-
class
ckipnlp.ws.
CkipWs
(*, logger=False, ini_file=None, lex_list=None, **kwargs)[source]¶ Bases:
object
The CKIP word segmentation driver.
- Parameters
logger (bool) – enable logger.
lex_list (Iterable) – passed to
ckipnlp.util.ini.create_ws_lex()
overridden lex_file forckipnlp.util.ini.create_ws_ini()
.ini_file (str) – the path to the INI file.
- Other Parameters
** – the configs for CKIPWS, passed to
ckipnlp.util.ini.create_ws_ini()
, ignored if ini_file is set.
Danger
Never instance more than one object of this class!
-
static
normalize_text
(text)[source]¶ Text normalization output.
Replacing keywords
()+-:|&#
by by full-width ones.
-
apply
(text, *, normalize=True)[source]¶ Parse a sentence.
- Parameters
text (str) – the input sentence.
normalize (bool) – do text normalization (please refer
normalize_text()
).
- Returns
str – the output sentence.
Hint
One may also call this method as
__call__()
.
-
apply_list
(ilist, *, normalize=True)[source]¶ Parse a list of sentences.
- Parameters
ilist (List[str]) – the list of input sentences.
normalize (bool) – do text normalization (please refer
normalize_text()
).
- Returns
List[str] – the list of output sentences.