search for: translit

Displaying 20 results from an estimated 38 matches for "translit".

Did you mean: transit
2017 Jun 27
3
Windows iconv() "failure" in certain locales
..._CTYPE", "Chinese") [1] "Chinese (Simplified)_People's Republic of China.936" > > iconv(x1, "latin1", "") # NA NA NA [1] NA NA NA > iconv(xU, "UTF-8", "") # NA NA NA [1] NA NA NA > iconv(xU, "UTF-8", "//TRANSLIT") [1] "Ekstr?m" "J?reskog" "bi?chen Z?rcher" > iconv(xU, "UTF-8", "", sub = "byte") [1] "Ekstr<c3><b8>m" "J<c3><b6>reskog" "bi<c3><9f>chen Z...
2017 Jun 28
0
Windows iconv() "failure" in certain locales
...ot;) > [1] "Chinese (Simplified)_People's Republic of China.936" >> >> iconv(x1, "latin1", "") # NA NA NA > [1] NA NA NA >> iconv(xU, "UTF-8", "") # NA NA NA > [1] NA NA NA >> iconv(xU, "UTF-8", "//TRANSLIT") > [1] "Ekstr?m" "J?reskog" "bi?chen Z?rcher" Interesting, I get chinese characters here. Beside the comments from Duncan Murdoch: iconv(x1, "latin1", "", sub="?") etc. would be an alternative in case some charact...
2014 Dec 15
2
Making iconv portable?
...take to make ?iconv? portable? > > >> I ask, because I want to convert accented characters to >> vanilla ASCII, thereby converting, e.g., ?Ra?l? to ?Raul?, and >> Milan Bouchet-Valet suggested on R-help that I use 'iconv(x, >> ?", "ASCII//TRANSLIT?)?. This worked under Windows but failed >> on Linux and Mac. It?s part of the ?subNonStandardCharacters? >> function in the Ecfun package. The development version on >> R-Forge uses this and returns ?Raul? under Windows and NA >> under Mac OS X (and presumably...
2014 Dec 15
1
Making iconv portable?
...>>> >>>> I ask, because I want to convert accented characters to >>>> vanilla ASCII, thereby converting, e.g., ?Ra?l? to ?Raul?, and >>>> Milan Bouchet-Valet suggested on R-help that I use 'iconv(x, >>>> ?", "ASCII//TRANSLIT?)?. This worked under Windows but failed >>>> on Linux and Mac. It?s part of the ?subNonStandardCharacters? >>>> function in the Ecfun package. The development version on >>>> R-Forge uses this and returns ?Raul? under Windows and NA >>>>...
2014 Dec 15
3
Making iconv portable?
Hello, All: What would it take to make ?iconv? portable? I ask, because I want to convert accented characters to vanilla ASCII, thereby converting, e.g., ?Ra?l? to ?Raul?, and Milan Bouchet-Valet suggested on R-help that I use 'iconv(x, ?", "ASCII//TRANSLIT?)?. This worked under Windows but failed on Linux and Mac. It?s part of the ?subNonStandardCharacters? function in the Ecfun package. The development version on R-Forge uses this and returns ?Raul? under Windows and NA under Mac OS X (and presumably also Linux). The ?iconv? R code merely...
2008 Jan 03
1
Iconv translit working in every situation… but the running app!
Hi all, I scourged the web, including this ML''s archives, and did not find an answer for this. I have the following code in my Rails app: require ''iconv'' ... @@converter = Iconv.new(''ASCII//TRANSLIT'', ''UTF-8'') ... result = @@converter.iconv(text.to_s).downcase ... Now, the facts: * Inbound text is indeed valid UTF-8 * Transliteration works fine in the following situations: * Same box, irb, my user or the Rails app''s user * Same box, rub...
2014 Dec 15
0
Making iconv portable?
...rtable? >> >> >>> I ask, because I want to convert accented characters to >>> vanilla ASCII, thereby converting, e.g., ?Ra?l? to ?Raul?, and >>> Milan Bouchet-Valet suggested on R-help that I use 'iconv(x, >>> ?", "ASCII//TRANSLIT?)?. This worked under Windows but failed >>> on Linux and Mac. It?s part of the ?subNonStandardCharacters? >>> function in the Ecfun package. The development version on >>> R-Forge uses this and returns ?Raul? under Windows and NA >>> under Mac OS X...
2017 Jun 29
1
Windows iconv() "failure" in certain locales
...39;s Republic of China.936" >>> >>> iconv(x1, "latin1", "") # NA NA NA >> [1] NA NA NA >>> iconv(xU, "UTF-8", "") # NA NA NA >> [1] NA NA NA >>> iconv(xU, "UTF-8", "//TRANSLIT") >> [1] "Ekstr?m" "J?reskog" "bi?chen Z?rcher" > Interesting, I get chinese characters here. For which one of the above cases; can you show them (it may survive E-mail servers; we had other Chinese R strings on R-help / R-deve...
2012 Mar 14
5
Does Ruby 1.9 support Unicode normalization yet?
In the process of upgrading from 1.8 to 1.9 we are getting a lot of warnings about "Ruby 1.9 doesn''t support Unicode normalization yet". However the commit that added those lines is from 2008 and just mentions "Ruby 1.9 compat: no Unicode normalization support yet" without any references. Does anyone know whether this is still true for ruby 1.9 and for which minor
2012 Dec 13
2
Tamaño de la matriz de términos y memoria. Paquete TM
...library(wordcloud) library(Rstem) library(Snowball) # lee el documento UTF-8 y lo convierte a ASCII txt <- readLines("D:/Publico/Documents/texto1.txt",encoding="UTF-8") txt = iconv(txt, to="ASCII//TRANSLIT") # construye un corpus corpus <- Corpus(VectorSource(txt)) # lleva a minúsculas corpus <- tm_map(corpus, tolower) # quita espacios en blanco corpus <- tm_map(corpus, stripWhitespace) # remueve la puntuación corpus <...
2007 Jan 19
3
How to have 'o' == 'ö'
...; retrieves the document. But I would like searching for "Mongrel" to also retrieve the document. Which it does not currently. Anyone have any good solutions to this problem? I suppose I could filter the documents and queries first which something like: (Iconv.new(''US-ASCII//TRANSLIT'', ''utf-8'').iconv "M?ngrel ?Horsemen?").gsub(/[^a-zA-Z0-9/im,"") But perhaps there is a better, or built in solution. Thanks -- Posted via http://www.ruby-forum.com/.
2017 Jun 26
3
suggestion to fix packageDescription() for Windows users
...amp; Sys.getlocale("LC_CTYPE") == > "C") | > unname(Sys.info()['sysname']) == "Windows") > #--------------- > > If I understand correctly, that will force > ASCII//TRANSLIT encoding when > DESCRIPTION files are read by packageDescription() on > Windows machines. > The upside is that Windows users will get the authors in > the package > citation, unlike the current situation....
2013 Jun 28
3
quitar tildes
Hola. Ahora mismo no caigo en cómo quitar las tildes de un texto. He probado con v1 = sub("([[:punct:]])","",v1) pero no funciona. ¿ideas? Gracias
2011 Feb 18
1
non-ascii characters in R output
...s associated with summary.lm are multi-byte characters on my platform. This particular problem is solved by options(useFancyQuotes=FALSE). But now I'm concerned about other non-ascii characters. As an overkill maybe, my current solution involves capture.output and iconv(..., to="ASCII//TRANSLIT"). Are there other sources of non-ascii character? Is there a better or general solution? Best, Matt > sessionInfo() R version 2.12.1 (2010-12-16) Platform: x86_64-pc-linux-gnu (64-bit) locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en...
2007 Mar 28
2
iconv does not work with french characters
...ing to convert french characters into latin ones using inconv but I face some problems. Here is what I am doing in ruby console: >> $KCODE = ''u'' => "u" >> @x = ''x éèçà x'' => "x éèçà x" >> Iconv.new(''US-ASCII//TRANSLIT'', ''utf-8'').iconv @x => "x ???? x" And I expect "x eeca x" as output. ruby --version returns: ruby 1.8.2 (2005-04-11) [i386-linux] iconv --version returns: iconv (GNU libiconv 1.11) Copyright (C) 2000-2006 Free Software Foundation, Inc. This is fr...
2010 Dec 16
2
Samba printer management commands
...======================================== #!/bin/bash { ########################################################################## # Par?metros ########################################################################## # Nombre de la impresora LPNAME=$(echo "$1" | iconv -t ascii//translit | sed --regexp-extended 's/\s+/_/g') # Nombre del recurso compartido SHARENAME=$2 # Nombre del puerto PORTNAME=$3 # Nombre del controlador DRIVERNAME=$4 # Localizaci?n LOCATION_OLD=$5 # Localizaci?n en Windows 95 LOCATIONWIN9X=$6 # Comprobar la sintaxis de la localizaci?...
2012 Oct 25
2
Minería de texto
...orpus, function(x) removeWords(x, c(stopwords("spanish"),"rt"))) tw.corpus = tm_map(tw.corpus, removeWords, my.stopwords) tw.corpus = tm_map(tw.corpus, stripWhitespace) sw <- readLines("stopwords.es.txt",encoding="UTF-8") sw = iconv(sw, to="ASCII//TRANSLIT") tw.corpus = tm_map(tw.corpus, removeWords, sw) doc.m = TermDocumentMatrix(tw.corpus, control = list(minWordLength = 2)) dm = as.matrix(doc.m) # calculate the frequency of words v = sort(rowSums(dm), decreasing=TRUE) d = data.frame(word=names(v), freq=v) #Generate the wordcloud pal2 &...
2017 Jun 17
2
suggestion to fix packageDescription() for Windows users
...quot;LC_CTYPE") == "C") #--------------- to: #--------------- if ((missing(encoding) && Sys.getlocale("LC_CTYPE") == "C") | unname(Sys.info()['sysname']) == "Windows") #--------------- If I understand correctly, that will force ASCII//TRANSLIT encoding when DESCRIPTION files are read by packageDescription() on Windows machines. The upside is that Windows users will get the authors in the package citation, unlike the current situation. The downside is that the exotic symbols in the authors' names are replaced with common ones that...
2017 Jun 17
3
suggestion to fix packageDescription() for Windows users
...>> >> #--------------- >> if ((missing(encoding) && Sys.getlocale("LC_CTYPE") == "C") | >> unname(Sys.info()['sysname']) == "Windows") >> #--------------- >> >> If I understand correctly, that will force ASCII//TRANSLIT encoding when >> DESCRIPTION files are read by packageDescription() on Windows machines. >> The upside is that Windows users will get the authors in the package >> citation, unlike the current situation. The downside is that the exotic >> symbols in the authors' names ar...
2017 Nov 27
1
AD: password change bug?
...user on win7/win10 changes password and uses non-ascii characters the mail obviously does not work, but the rest of the domain is ok. so we thought to use the "check password script". something like: #/bin/bash read -t 2 PWD ASCII_PWD=$(echo "$PWD"|iconv -f utf-8 -t ascii//TRANSLIT) if [ "${PWD}" != "${ASCII_PWD}" ]; then exit 1 fi exit 0 Simple, and it **seems** to work. During testing and debugging I had the script output the input and all possible conversions from all iconv-supported charsets to utf-8, because I was seeing strange things. Nothi...