ckipnlp.container.util.wspos module¶
This module provides containers for word-segmented sentences with part-of-speech-tags.
- class ckipnlp.container.util.wspos.WsPosToken(word: Optional[str] = None, pos: Optional[str] = None)[source]¶
Bases:
BaseTuple,_WsPosTokenA word with POS-tag.
- Variables
word (str) – the word.
pos (str) – the POS-tag.
Note
This class is an subclass of tuple. To change the attribute, please create a new instance instead.
Data Structure Examples
- Text format
Used for
from_text()andto_text().'中文字(Na)' # word / POS-tag
- List format
Used for
from_list()andto_list().[ '中文字', # word 'Na', # POS-tag ]
- Dict format
Used for
from_dict()andto_dict().{ 'word': '中文字', # word 'pos': 'Na', # POS-tag }
- class ckipnlp.container.util.wspos.WsPosSentence[source]¶
Bases:
objectA helper class for data conversion of word-segmented and part-of-speech sentences.
- classmethod from_text(data)[source]¶
Convert text format to word-segmented and part-of-speech sentences.
- Parameters
data (str) – text such as
'中文字(Na)\u3000耶(T)'.- Returns
SegSentence– the word sentenceSegSentence– the POS-tag sentence.
- static to_text(word, pos)[source]¶
Convert text format to word-segmented and part-of-speech sentences.
- Parameters
word (
SegSentence) – the word sentencepos (
SegSentence) – the POS-tag sentence.
- Returns
str – text such as
'中文字(Na)\u3000耶(T)'.
- class ckipnlp.container.util.wspos.WsPosParagraph[source]¶
Bases:
objectA helper class for data conversion of word-segmented and part-of-speech sentence lists.
- classmethod from_text(data)[source]¶
Convert text format to word-segmented and part-of-speech sentence lists.
- Parameters
data (Sequence[str]) – list of sentences such as
'中文字(Na)\u3000耶(T)'.- Returns
SegParagraph– the word sentence listSegParagraph– the POS-tag sentence list.
- static to_text(word, pos)[source]¶
Convert text format to word-segmented and part-of-speech sentence lists.
- Parameters
word (
SegParagraph) – the word sentence listpos (
SegParagraph) – the POS-tag sentence list.
- Returns
List[str] – list of sentences such as
'中文字(Na)\u3000耶(T)'.