ckipnlp.util.ws module

class ckipnlp.util.ws.WsWord[source]

Bases: tuple

A word-segmented word.

property word

str – the word.

property pos

str – the post-tag.

classmethod from_text(data)[source]

Construct an instance from ckipnlp.ws.CkipWs output.

Parameters

data (str) – text such as '中文字(Na)'.

Notes

  • '中文字(Na)' -> word = '中文字', pos = 'Na'

  • '中文字' -> word = '中文字', pos = None

to_text()[source]

Transform to plain text.

Returns

str

classmethod from_dict(data)[source]

Construct an instance from python built-in containers.

Parameters

data (dict) – dictionary such as { 'word': '中文字', 'pos': 'Na' }

to_dict()[source]

Transform to python built-in containers.

Returns

dict

classmethod from_json(data, **kwargs)[source]

Construct an instance from JSON format.

Parameters

data (str) – please refer from_dict() for format details.

to_json(**kwargs)[source]

Transform to JSON format.

Returns

str

class ckipnlp.util.ws.WsSentence(initlist=None)[source]

Bases: collections.UserList

A word-segmented sentence.

item_class

alias of WsWord

classmethod from_text(data)[source]

Construct an instance from ckipnlp.ws.CkipWs output.

Parameters

data (str) – text such as '中文字(Na)\u3000喔(T)'.

to_text()[source]

Transform to plain text.

Returns

str

classmethod from_dict(data)[source]

Construct an instance a from python built-in containers.

Parameters

data (Sequence[dict]) – list of objects as WsWord.from_dict() input.

to_dict()[source]

Transform to python built-in containers.

Returns

List[dict]

classmethod from_json(data, **kwargs)[source]

Construct an instance from JSON format.

Parameters

data (str) – please refer from_dict() for format details.

to_json(**kwargs)[source]

Transform to JSON format.

Returns

str

class ckipnlp.util.ws.WsSentenceList(initlist=None)[source]

Bases: collections.UserList

A list of word-segmented sentence.

item_class

alias of WsSentence

classmethod from_text(data)[source]

Construct an instance from ckipnlp.ws.CkipWs output.

Parameters

data (Sequence[str]) – list of texts as WsSentence.from_text() input.

to_text()[source]

Transform to plain text.

Returns

List[str]

classmethod from_dict(data)[source]

Construct an instance a from python built-in containers.

Parameters

data (Sequence[Sequence[dict]]) – list of objects as WsSentence.from_dict() input.

to_dict()[source]

Transform to python built-in containers.

Returns

List[List[dict]]

classmethod from_json(data, **kwargs)[source]

Construct an instance from JSON format.

Parameters

data (str) – please refer from_dict() for format details.

to_json(**kwargs)[source]

Transform to JSON format.

Returns

str