Displaying 12 results from an estimated 12 matches for "katakana".
Did you mean:
atakan
2024 Jan 08
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
...latter is valid, just removing this block (or removing the
> parts of it which are Lu or Ll) should fix the problem as then
> tokenisation will switch mode - I tried this and it fixes your case at
> least:
Removing the whole block will cause word-breaker to not correctly handle halfwidth Katakana, such as "??????????" which it would treat as a single term, whereas it should be two: ??????and ????).
My pull request causes word-breaker to only handle halfwidth Katakana and Hangul codepoints as unbroken script and treats Latin characters, numbers, symbols and punctuation as broken...
2007 Jun 29
3
[PATCH] Fix keymap for Japanese keyboard
Hi All,
We tested with Japanese keyboard.
Then, the local keys that was not able to be input with a Japanese keyboard was
found.
This patch added the key that was not able to be input to the keymap.
The key that cannot be input is as follows.
・Katakana
・Eisu_Toggle
Signed-off-by: Takanori Kasai <kasai.takanori@jp.fujitsu.com>
Signed-off-by: Junko Ichino <ichino.junko@jp.fujitsu.com>
Best Regards,
--
Takanori Kasai
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xe...
2024 Jan 07
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Thu, Jan 04, 2024 at 05:50:22PM +0100, Robert Stepanek wrote:
> Since I am undecided yet if and how to fix this in Xapian I haven't
> come up with a pull request. Because trac currently is offline, I
> could not file a bug. I hope it's OK to post my analysis here first,
> I'll be happy to follow up reporting that bug proper later (should we
> conclude that it actually
2024 Jan 09
1
Possible bug using FLAG_WORD_BREAKS with fullwidth Unicode codepoints
On Mon, Jan 08, 2024 at 02:01:46PM +0100, Robert Stepanek wrote:
> Removing the whole block will cause word-breaker to not correctly
> handle halfwidth Katakana, such as "??????????" which it would treat
> as a single term, whereas it should be two: ??????and ????).
>
> My pull request causes word-breaker to only handle halfwidth Katakana
> and Hangul codepoints as unbroken script and treats Latin characters,
> numbers, symbols a...
2011 Apr 21
2
Chinese segmentation
hello, I have finished reading the papers, and i think it is time to design
my project.
First step will be determine the input characters are Chinese. i see the
past post that cjk-tokenizer is just dealing with UTF-8 and unicode, but i
see some other code system such as gbk and big5. i am wondering that should
i just deal with UTF-8 and unicode?
2007 Sep 13
1
chartr better
For example, the following changes are necessary when i convert a
Japanese hiragana into katakana in chattr.
R code:
> chartr("\u3041-\u3093","\u30a1-\u30f3","\u3084\u3063\u305f\u30fc")
--- R-alpha.orig/src/main/character.c 2007-09-05 07:13:27.000000000 +0900
+++ R-alpha/src/main/character.c 2007-09-13 16:10:21.000000000 +0900
@@ -2041,6 +2041,16 @@...
2011 Aug 13
3
Japanese and Korean Fonts inside Wine.
I installed Ubuntu in English.
How now, "Wine" to force to show the Korean (Hangeul) and Japanese (Hiragana/Katakana/kanji) fonts?.
2015 Mar 20
0
Wine release 1.7.39
...ryInformationFile
38209 WshScriptExec's Terminate method is missing
38220 Chemcraft crashes
38221 Wrong parameter assignment in get_family_names_from_stream()
38223 SoManyEuros program doesnt run (needs IFile::get_Path() implementation)
38226 EBWin(Unicode) crashes when searching katakana words. (_mbctohira is a stub)
38251 Assign value to function parameter
----------------------------------------------------------------
Changes since 1.7.38:
Aaryaman Vasishta (5):
d3drm/tests: Add tests for querying between IDirect3DRM interfaces.
d3drm: Fix IDirect3DRM::QueryInt...
2007 Jun 05
7
Chinese, Japanese, Korean Tokenizer.
Hi,
I am looking for Chinese Japanese and Korean tokenizer that could can
be use to tokenize terms for CJK languages. I am not very familiar
with these languages however I think that these languages contains one
or more words in one symbol which it make more difficult to tokenize
into searchable terms.
Lucene has CJK Tokenizer ... and I am looking around if there is some
open source that we
2018 Sep 13
0
Wine release 3.0.3
...moryExceptionCallback
45403 dwarf2_parse_variable(): Assertion `subpgm->func' failed
45428 Crash Bandicoot N. Sane Trilogy (Steam) crashes while entering main menu in ntdll.NtGetContextThread() with NULL context
45439 Visual Basic setup toolkit fails to run if path contains a voiced Katakana character.
45445 1C:Enterprise 8 platform doesn't install
----------------------------------------------------------------
Changes since 3.0.2:
Akihiro Sagawa (2):
kernel32/tests: Add more LCMapString tests.
kernel32: Fix LCMapString(LCMAP_HALFWIDTH) in some cases.
Alex Henri...
2017 Jan 24
0
Wine release 2.0
...ck supports the IP_DONTFRAGMENT socket option.
- Long URLs are handled correctly.
- JavaScript performance is improved. JSON is also supported.
*** Internationalization
- There is proper support for the Persian locale.
- Uniscribe better supports right-to-left languages.
- Japanese Hiragana, Katakana and full-width/half-width character
mapping is implemented.
*** 64-bit support
- 64-bit applications are supported on macOS.
- The Mono engine has 64-bit support.
- C++ exceptions are handled correctly on x86-64, including nested
exceptions and collided unwinds.
- The debug registers are...
2018 Jul 20
0
Wine release 3.13
...S Desktop 10.6 calls unimplemented function msvcr120.dll._SetWinRTOutOfMemoryExceptionCallback
45428 Crash Bandicoot N. Sane Trilogy (Steam) crashes while entering main menu in ntdll.NtGetContextThread() with NULL context
45439 Visual Basic setup toolkit fails to run if path contains a voiced Katakana character.
45445 1C:Enterprise 8 platform doesn't install
45448 vSphere 5.5 client 'hcmon.sys' driver crashes on unimplemented function ntoskrnl.exe.KeFlushQueuedDpcs
----------------------------------------------------------------
Changes since 3.12:
Akihiro Sagawa (8):...