search for: html2text

Displaying 15 results from an estimated 15 matches for "html2text".

2006 Jun 29
2
html2text in php
Hi there! I recently ported Aaron Swartz' html2text.py to PHP and would like to know what you think about it. Any suggestions and bug reports are much appreciated. Check it out: http://milianw.de/projects/html2text/ Note: Michel Fortins PHP Markdown Extended is supported (that is: tables and definitionlist are supported in some way) There are s...
2007 Mar 22
4
converting html with \xa9 to Markdown and using iconv?
The html document various characters like ? \xa0 ? \xa9 (Copyright symbol) (and others). I tried using html2text.py but it didn't like these characters. Any ideas on how I can use iconv or another tool to convert documents like this so I can then convert to Markdown? I don't want to do manually as I have around 500+ documents. Jeremy C. Reed
2006 Oct 17
2
back-translation and round-tripping
...regenerate the output again and compare it with the "original" output, repeating if necessary until there exists a pair of files that give perfect agreement. has anyone done such experiments already? is aaron's back-translation tool the best one? > http://www.aaronsw.com/2002/html2text/ anyone know if/when/where it has problems? thanks in advance... -bowerbird p.s. anyone here have reaction to the analysis of markdown from ivan kristic for the one-laptop-per-child project? > http://dev.laptop.org/git.do?p=users/krstic/docformat;a=blob;h=d191a5a9f8beb6a052b5e8d30f7a4d...
2007 Oct 08
1
First Markdownify Beta released
Hi folks! I finally released the first Beta of Markdownify - the HTML to Markdown converter for PHP. It is the successor to the old `html2text.php` and is much more stable. It's pretty much a complete rewrite and supports all Markdown and PHP Markdown Extra syntax features. It's beta since I still have to add a word wrapping feature and "Markdown inside block elements" is not yet working for PHP Markdown Extra. If y...
2003 Aug 08
2
Comprehesive Package/Library list?
Is there a list of all contributed R libraries available through CRAN? Ideally it would include a one or two line description. I am looking for a packages() command similar to library() but that would access the CRAN repository and provide a listing of the current libraries, and version. example: http://www.cpan.org/modules/01modules.index.html
2007 Sep 30
1
two pandoc web apps
I've put together two small web apps to demonstrate pandoc: 1. [html2x](http://johnmacfarlane.net/pandoc/html2x.html) can convert most web pages to markdown, reStructuredText, DocBook, LaTeX, ConTeXt, RTF, or groff man. Bookmarklets are provided. html2x is modeled on Aaron Swartz's [html2text](http://www.aaronsw.com/2002/html2text/), but it's faster, supports multiple output formats, and behaves a bit differently (for example, it wraps text, puts all the links at the end instead of interspersing them, and uses text instead of numbers for link identifiers). 2. [Try Pandoc](http://j...
2007 Jul 13
0
Markdown rendering question
Hello, I'm trying to render the following page in markdown: http://lua-users.org/wiki/RiciLake This page was converted into markdown using Aaron Swartz's html2text [1]. And then rendered to HTML using Niklas Frykholm's markdown.lua [2]. Everything works quite nicely... except that under 'Lua Enhancements'... the very first header is not rendered as a header, but as plain text: ### Lua Enhancements **Lua 5.1:** <<<< renders...
2007 Aug 08
2
Backtick Hickup
I'm using Michel Fortin's MDTest cases to rewrite my html2text.php script. Just now I stumbled upon this bug (in PHP Markdown at least): # Input: Backtick: ``\``` # Output: <p>Backtick: ``&#96;``</p> # Should-Be Output: <p>Backtick: <code>`</code></p> If you (Michel) are already aware of this, just ignore me...
2015 Oct 14
3
Strange indexing behavior on HTML emails ..
Hi, In continuation to the issue I had posted about long back: http://www.dovecot.org/list/dovecot/2014-August/097362.html I did further testing today on a fresh new Debian & latest Dovecot and observed a undesired behavior. I am using fts_lucene & following sequence of commands on a empty test account me at myself.com: doveadm expunge -u 'my at myself.com' mailbox
2011 Oct 28
0
the next mountain for markdown to climb up
...have variants of a program that can do it... and yes, probably 237 of 'em produce different results, but the point is that markdown-to-html just isn't that difficult... so let's move ourselves forward to back-translation, shall we? and, of course, the prior art for markdown is "html2text": > http://www.aaronsw.com/2002/html2text/ by aaron. and it dates to 2002. but it probably still works ok, because -- as the page shows -- it's constantly being updated. indeed, the latest version appears to be hosted on github, with the latest file with a mod-date of just two d...
2015 Oct 14
0
Strange indexing behavior on HTML emails ..
...ly. The problem with single quotes in HTML is still there. The revision: http://hg.dovecot.org/dovecot-2.2/rev/ad028a950248 should have solved it but the relevant code no longer exists in src/plugins/fts/fts-parser-html.c. Seems like it has been moved into lib-mail. The file src/lib-mail/mail-html2text.c does contain something about single quotes but to no avail. Can someone at-least confirm existence of this issue?
2015 Oct 15
3
Strange indexing behavior on HTML emails ..
...HTML is still there. > > The revision: > > http://hg.dovecot.org/dovecot-2.2/rev/ad028a950248 > > should have solved it but the relevant code no longer exists in > src/plugins/fts/fts-parser-html.c. Seems like it has been moved into > lib-mail. The file src/lib-mail/mail-html2text.c does contain something > about single quotes but to no avail. Can someone at-least confirm existence > of this issue? Thanks for the report. Bug found. My bad. A patch is working its way through the internal process, and will be in the public tree soon. Cheers, Phil
2012 Dec 13
1
standalone markdown
Hi Is it possible to run Markdown the app standalone, that is, not as part of a webserver. So the model would be write a file in Markdown and then use the Markdown app to convert the file into (x)html for local browsing in a browser. I want to learn Markdown before I have to go through the pain of setting up a webserver. I have been trying to get jetty working on Fedora 16 but keep
2015 Mar 22
0
2.2.16 link failure on FreeBSD 10.1, with patch
...c -std=gnu99 -I/usr/local/include -Wall -W -Wmissing-prototypes -Wmissing-declarations -Wpointer-arith -Wchar-subscripts -Wformat=2 -Wbad-function-cast -Wno-duplicate-decl-specifier -Wstrict-aliasing=2 -Wl,--as-needed -o test-message-snippet test-message-snippet.o .libs/message-snippet.o .libs/mail-html2text.o .libs/message-decoder.o .libs/quoted-printable.o .libs/rfc822-parser.o .libs/rfc2231-parser.o /usr/local/lib/libiconv.so -Wl,-rpath -Wl,/usr/local/lib .libs/message-parser.o .libs/message-header-parser.o .libs/message-header-decode.o .libs/message-size.o -L/usr/local/lib ../lib-charset/.libs/lib...
2005 Mar 14
1
colinux fresh install, zaptel does not compile, size_t error
...ep and fgrep ii groff-base 1.18.1-15 GNU troff text-formatting system (base system components) ii gzip 1.3.5-8 The GNU compression utility ii hostname 2.13 A utility to set/show the host name or domain name ii html2text 1.3.2a-1 An advanced HTML to text converter ii ifupdown 0.6.4-4.8 High level tools to configure network interfaces ii info 4.6-1 Standalone GNU Info documentation browser ii initscripts...