JapaneseTextParser
in package
Japanese text parsing using MeCab.
Handles splitting, previewing, and database insertion for Japanese text.
Tags
Table of Contents
Methods
- displayJapanesePreview() : void
- Display preview HTML for Japanese text.
- parseJapaneseToDatabase() : void
- Parse Japanese text with MeCab and insert into temp_word_occurrences.
- splitJapaneseSentences() : array<string|int, string>
- Split Japanese text into sentences (split-only mode).
Methods
displayJapanesePreview()
Display preview HTML for Japanese text.
public
static displayJapanesePreview(string $text) : void
Parameters
- $text : string
-
Preprocessed text
parseJapaneseToDatabase()
Parse Japanese text with MeCab and insert into temp_word_occurrences.
public
static parseJapaneseToDatabase(string $text, bool $useMaxSeID) : void
Parameters
- $text : string
-
Preprocessed text
- $useMaxSeID : bool
-
Whether to query for max sentence ID (true for existing texts)
splitJapaneseSentences()
Split Japanese text into sentences (split-only mode).
public
static splitJapaneseSentences(string $text) : array<string|int, string>
Parameters
- $text : string
-
Preprocessed text
Tags
Return values
array<string|int, string> —Array of sentences