Displaying 15 results from an estimated 15 matches for "html2text".
2006 Jun 29
2
html2text in php
Hi there!
I recently ported Aaron Swartz' html2text.py to PHP and would like to know
what you think about it.
Any suggestions and bug reports are much appreciated.
Check it out: http://milianw.de/projects/html2text/
Note: Michel Fortins PHP Markdown Extended is supported (that is: tables and
definitionlist are supported in some way)
There are s...
2007 Mar 22
4
converting html with \xa9 to Markdown and using iconv?
The html document various characters like
? \xa0
? \xa9 (Copyright symbol)
(and others).
I tried using html2text.py but it didn't like these characters.
Any ideas on how I can use iconv or another tool to convert documents like
this so I can then convert to Markdown?
I don't want to do manually as I have around 500+ documents.
Jeremy C. Reed
2006 Oct 17
2
back-translation and round-tripping
...regenerate the output again and
compare it with the "original" output,
repeating if necessary until there exists
a pair of files that give perfect agreement.
has anyone done such experiments already?
is aaron's back-translation tool the best one?
> http://www.aaronsw.com/2002/html2text/
anyone know if/when/where it has problems?
thanks in advance...
-bowerbird
p.s. anyone here have reaction to the
analysis of markdown from ivan kristic
for the one-laptop-per-child project?
>
http://dev.laptop.org/git.do?p=users/krstic/docformat;a=blob;h=d191a5a9f8beb6a052b5e8d30f7a4d...
2007 Oct 08
1
First Markdownify Beta released
Hi folks!
I finally released the first Beta of Markdownify - the HTML to Markdown
converter for PHP. It is the successor to the old `html2text.php` and is much
more stable. It's pretty much a complete rewrite and supports all Markdown
and PHP Markdown Extra syntax features.
It's beta since I still have to add a word wrapping feature and "Markdown
inside block elements" is not yet working for PHP Markdown Extra.
If y...
2003 Aug 08
2
Comprehesive Package/Library list?
Is there a list of all contributed R libraries available through CRAN?
Ideally it would include a one or two line description.
I am looking for a packages() command similar to library()
but that would access the CRAN repository and provide a
listing of the current libraries, and version.
example:
http://www.cpan.org/modules/01modules.index.html
2007 Sep 30
1
two pandoc web apps
I've put together two small web apps to demonstrate pandoc:
1. [html2x](http://johnmacfarlane.net/pandoc/html2x.html) can convert
most web pages to markdown, reStructuredText, DocBook, LaTeX, ConTeXt,
RTF, or groff man. Bookmarklets are provided. html2x is modeled
on Aaron Swartz's [html2text](http://www.aaronsw.com/2002/html2text/),
but it's faster, supports multiple output formats, and behaves a bit
differently (for example, it wraps text, puts all the links at the
end instead of interspersing them, and uses text instead of numbers
for link identifiers).
2. [Try Pandoc](http://j...
2007 Jul 13
0
Markdown rendering question
Hello,
I'm trying to render the following page in markdown:
http://lua-users.org/wiki/RiciLake
This page was converted into markdown using Aaron Swartz's html2text
[1]. And then rendered to HTML using Niklas Frykholm's markdown.lua
[2].
Everything works quite nicely... except that under 'Lua
Enhancements'... the very first header is not rendered as a header, but
as plain text:
### Lua Enhancements
**Lua 5.1:** <<<< renders...
2007 Aug 08
2
Backtick Hickup
I'm using Michel Fortin's MDTest cases to rewrite my html2text.php script.
Just now I stumbled upon this bug (in PHP Markdown at least):
# Input:
Backtick: ``\```
# Output:
<p>Backtick: `````</p>
# Should-Be Output:
<p>Backtick: <code>`</code></p>
If you (Michel) are already aware of this, just ignore me...
2015 Oct 14
3
Strange indexing behavior on HTML emails ..
Hi,
In continuation to the issue I had posted about long back:
http://www.dovecot.org/list/dovecot/2014-August/097362.html
I did further testing today on a fresh new Debian & latest Dovecot and
observed a undesired behavior. I am using fts_lucene & following
sequence of commands on a empty test account me at myself.com:
doveadm expunge -u 'my at myself.com' mailbox
2011 Oct 28
0
the next mountain for markdown to climb up
...have variants of a program that can do it...
and yes, probably 237 of 'em produce different results, but
the point is that markdown-to-html just isn't that difficult...
so let's move ourselves forward to back-translation, shall we?
and, of course, the prior art for markdown is "html2text":
> http://www.aaronsw.com/2002/html2text/
by aaron. and it dates to 2002. but it probably still works ok,
because -- as the page shows -- it's constantly being updated.
indeed, the latest version appears to be hosted on github, with
the latest file with a mod-date of just two d...
2015 Oct 14
0
Strange indexing behavior on HTML emails ..
...ly. The problem with single quotes in HTML is still
there.
The revision:
http://hg.dovecot.org/dovecot-2.2/rev/ad028a950248
should have solved it but the relevant code no longer exists in
src/plugins/fts/fts-parser-html.c. Seems like it has been moved into
lib-mail. The file src/lib-mail/mail-html2text.c does contain something
about single quotes but to no avail. Can someone at-least confirm
existence of this issue?
2015 Oct 15
3
Strange indexing behavior on HTML emails ..
...HTML is still there.
>
> The revision:
>
> http://hg.dovecot.org/dovecot-2.2/rev/ad028a950248
>
> should have solved it but the relevant code no longer exists in
> src/plugins/fts/fts-parser-html.c. Seems like it has been moved into
> lib-mail. The file src/lib-mail/mail-html2text.c does contain something
> about single quotes but to no avail. Can someone at-least confirm existence
> of this issue?
Thanks for the report. Bug found. My bad. A patch is working its way
through the internal process, and will be in the public tree soon.
Cheers,
Phil
2012 Dec 13
1
standalone markdown
Hi
Is it possible to run Markdown the app standalone, that is, not as part
of a webserver. So the model would be write a file in Markdown and then
use the Markdown app to convert the file into (x)html for local browsing
in a browser.
I want to learn Markdown before I have to go through the pain of setting
up a webserver. I have been trying to get jetty working on Fedora 16
but keep
2015 Mar 22
0
2.2.16 link failure on FreeBSD 10.1, with patch
...c -std=gnu99 -I/usr/local/include -Wall -W -Wmissing-prototypes -Wmissing-declarations -Wpointer-arith -Wchar-subscripts -Wformat=2 -Wbad-function-cast -Wno-duplicate-decl-specifier -Wstrict-aliasing=2 -Wl,--as-needed -o test-message-snippet test-message-snippet.o .libs/message-snippet.o .libs/mail-html2text.o .libs/message-decoder.o .libs/quoted-printable.o .libs/rfc822-parser.o .libs/rfc2231-parser.o /usr/local/lib/libiconv.so -Wl,-rpath -Wl,/usr/local/lib .libs/message-parser.o .libs/message-header-parser.o .libs/message-header-decode.o .libs/message-size.o -L/usr/local/lib ../lib-charset/.libs/lib...
2005 Mar 14
1
colinux fresh install, zaptel does not compile, size_t error
...ep and fgrep
ii groff-base 1.18.1-15
GNU troff text-formatting system (base system components)
ii gzip 1.3.5-8
The GNU compression utility
ii hostname 2.13
A utility to set/show the host name or domain name
ii html2text 1.3.2a-1
An advanced HTML to text converter
ii ifupdown 0.6.4-4.8
High level tools to configure network interfaces
ii info 4.6-1
Standalone GNU Info documentation browser
ii initscripts...