search for: katakana

Displaying 12 results from an estimated 12 matches for "katakana".

Did you mean: atakan
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...latter is valid, just removing this block (or removing the > parts of it which are Lu or Ll) should fix the problem as then > tokenisation will switch mode - I tried this and it fixes your case at > least: Removing the whole block will cause word-breaker to not correctly handle halfwidth Katakana, such as "??????????" which it would treat as a single term, whereas it should be two: ??????and ????). My pull request causes word-breaker to only handle halfwidth Katakana and Hangul codepoints as unbroken script and treats Latin characters, numbers, symbols and punctuation as broken...
2007 Jun 29
3
[PATCH] Fix keymap for Japanese keyboard
Hi All, We tested with Japanese keyboard. Then, the local keys that was not able to be input with a Japanese keyboard was found. This patch added the key that was not able to be input to the keymap. The key that cannot be input is as follows. ・Katakana ・Eisu_Toggle Signed-off-by: Takanori Kasai <kasai.takanori@jp.fujitsu.com> Signed-off-by: Junko Ichino <ichino.junko@jp.fujitsu.com> Best Regards, -- Takanori Kasai _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xe...
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote: > Since I am undecided yet if and how to fix this in Xapian I haven't > come up with a pull request. Because trac currently is offline, I > could not file a bug. I hope it's OK to post my analysis here first, > I'll be happy to follow up reporting that bug proper later (should we > conclude that it actually
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Mon, Jan 08, 2024 at 02:01:46PM +0100, Robert Stepanek wrote: > Removing the whole block will cause word-breaker to not correctly > handle halfwidth Katakana, such as "??????????" which it would treat > as a single term, whereas it should be two: ??????and ????). > > My pull request causes word-breaker to only handle halfwidth Katakana > and Hangul codepoints as unbroken script and treats Latin characters, > numbers, symbols a...
2011 Apr 21
2
Chinese segmentation
hello, I have finished reading the papers, and i think it is time to design my project. First step will be determine the input characters are Chinese. i see the past post that cjk-tokenizer is just dealing with UTF-8 and unicode, but i see some other code system such as gbk and big5. i am wondering that should i just deal with UTF-8 and unicode?
2007 Sep 13
1
chartr better
For example, the following changes are necessary when i convert a Japanese hiragana into katakana in chattr. R code: > chartr("\u3041-\u3093","\u30a1-\u30f3","\u3084\u3063\u305f\u30fc") --- R-alpha.orig/src/main/character.c 2007-09-05 07:13:27.000000000 +0900 +++ R-alpha/src/main/character.c 2007-09-13 16:10:21.000000000 +0900 @@ -2041,6 +2041,16 @@...
2011 Aug 13
3
Japanese and Korean Fonts inside Wine.
I installed Ubuntu in English. How now, "Wine" to force to show the Korean (Hangeul) and Japanese (Hiragana/Katakana/kanji) fonts?.
2015 Mar 20
0
Wine release 1.7.39
...ryInformationFile 38209 WshScriptExec's Terminate method is missing 38220 Chemcraft crashes 38221 Wrong parameter assignment in get_family_names_from_stream() 38223 SoManyEuros program doesnt run (needs IFile::get_Path() implementation) 38226 EBWin(Unicode) crashes when searching katakana words. (_mbctohira is a stub) 38251 Assign value to function parameter ---------------------------------------------------------------- Changes since 1.7.38: Aaryaman Vasishta (5): d3drm/tests: Add tests for querying between IDirect3DRM interfaces. d3drm: Fix IDirect3DRM::QueryInt...
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
Hi, I am looking for Chinese Japanese and Korean tokenizer that could can be use to tokenize terms for CJK languages. I am not very familiar with these languages however I think that these languages contains one or more words in one symbol which it make more difficult to tokenize into searchable terms. Lucene has CJK Tokenizer ... and I am looking around if there is some open source that we
2018 Sep 13
0
Wine release 3.0.3
...moryExceptionCallback 45403 dwarf2_parse_variable(): Assertion `subpgm->func' failed 45428 Crash Bandicoot N. Sane Trilogy (Steam) crashes while entering main menu in ntdll.NtGetContextThread() with NULL context 45439 Visual Basic setup toolkit fails to run if path contains a voiced Katakana character. 45445 1C:Enterprise 8 platform doesn't install ---------------------------------------------------------------- Changes since 3.0.2: Akihiro Sagawa (2): kernel32/tests: Add more LCMapString tests. kernel32: Fix LCMapString(LCMAP_HALFWIDTH) in some cases. Alex Henri...
2017 Jan 24
0
Wine release 2.0
...ck supports the IP_DONTFRAGMENT socket option. - Long URLs are handled correctly. - JavaScript performance is improved. JSON is also supported. *** Internationalization - There is proper support for the Persian locale. - Uniscribe better supports right-to-left languages. - Japanese Hiragana, Katakana and full-width/half-width character mapping is implemented. *** 64-bit support - 64-bit applications are supported on macOS. - The Mono engine has 64-bit support. - C++ exceptions are handled correctly on x86-64, including nested exceptions and collided unwinds. - The debug registers are...
2018 Jul 20
0
Wine release 3.13
...S Desktop 10.6 calls unimplemented function msvcr120.dll._SetWinRTOutOfMemoryExceptionCallback 45428 Crash Bandicoot N. Sane Trilogy (Steam) crashes while entering main menu in ntdll.NtGetContextThread() with NULL context 45439 Visual Basic setup toolkit fails to run if path contains a voiced Katakana character. 45445 1C:Enterprise 8 platform doesn't install 45448 vSphere 5.5 client 'hcmon.sys' driver crashes on unimplemented function ntoskrnl.exe.KeFlushQueuedDpcs ---------------------------------------------------------------- Changes since 3.12: Akihiro Sagawa (8):...