Hi all, After a long hiatus, I''ve started again to look at webgen internals. However, this time my goal has totally changed (for those not in the know-how, I planned to use webgen as my main website, with advanced generationf eatures (like grabbing isbn infos for books I wrote about). Now, with the rise of life aggregators and the such, and the fact I now publish all my content on dedicated sites, i''ve started thinking about using webgen as a life aggregator. For that, I''ve written a plugin which, given an OPML file, downloads all entries and generates files for these entries, with supposed unique ids. All that is working. Starting testing sitre generation, i was a little in trouble with maruku going in my way (but rather fast found a solution to change default_processing_pipeline:). However, i now have another issue. My process involves adding identification metadata in meta block, like in the following extract : =======================================--- title: [BienBienBien] Les cochons dans l''open space rss_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans guid_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans filemtime: Thu, 25 Sep 2008 13:03:00 +0200 --- <p> <p>Le bouquin dont ils parlent au début, je n''ai pas vraiment besoin de le lire tant ça ressemble à une caricature (et parfois, même pas) de mes différents boulots. <br />En revanche, l''analyse du vocabulaire d''entreprise est totallement criante de vérité.</p> <p>Tenez, allez donc voir <a href="http://feeds.bienbienbien.net/~r/BienBienBien/~3/402620145 =======================================>From the error message I get when launching webgen on this file :C:\Documents and Settings\Maison\Mes documents\webgen\lifestream>webgen Starting webgen... Generating tree... An error has occurred: Error reading source path </lifestream/blogs/riduidel.posterous.com/bienbienbien-les-cochons-dans.page>: syntax error on line 2, col -1 : `rss_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans guid_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans filemtime: Thu, 25 Sep 2008 13:03:00 +0200 I can see something is quite wrong. But what ? And how can I correct that ? Thanks in advance for your reply -- Nicolas Delsaux N''imprimez ce mail que si vous ne savez pas le lire sur l''?cran : les ?lectrons se recyclent bien, le papier, beaucoup moins bien.
> My process involves adding identification metadata in meta block, like > in the following extract : > > =======================================> --- > title: [BienBienBien] Les cochons dans l''open space > rss_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans > guid_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans > filemtime: Thu, 25 Sep 2008 13:03:00 +0200 > --- > > <p> > <p>Le bouquin dont ils parlent au début, je n''ai pas > vraiment besoin de le lire tant ça ressemble à une > caricature (et parfois, même pas) de mes différents > boulots. <br />En revanche, l''analyse du vocabulaire d''entreprise est > totallement criante de vérité.</p> > <p>Tenez, allez donc voir <a > href="http://feeds.bienbienbien.net/~r/BienBienBien/~3/402620145 > =======================================> > >>From the error message I get when launching webgen on this file : > > C:\Documents and Settings\Maison\Mes > documents\webgen\lifestream>webgen Starting webgen... > Generating tree... > An error has occurred: Error reading source path > </lifestream/blogs/riduidel.posterous.com/bienbienbien-les-cochons-dans.page>: > syntax error on line 2, col -1 : `rss_link: > http://riduidel.posterous.com/bienbienbien-les-cochons-dans > guid_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans > filemtime: Thu, 25 Sep 2008 13:03:00 +0200 > > I can see something is quite wrong. But what ? And how can I correct > that ?Recall that the meta information part needs to be a valid YAML hash. However, when YAML parses [BienBienBien] it assumes that the value for the title is an array and not a string. So you need to escape your string by wrapping it in double quotes (single quotes won''t work since you are already using one in the title), like this: --- title: "[BienBienBien] Les cochons dans l''open space" rss_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans guid_link: http://riduidel.posterous.com/bienbienbien-les-cochons-dans filemtime: Thu, 25 Sep 2008 13:03:00 +0200 --- <SNIP REST> After this change, the meta information block loaded fine in my tests. -- Thomas
On Sat, Oct 4, 2008 at 11:38 PM, Thomas Leitner <t_leitner at gmx.at> wrote:> > Recall that the meta information part needs to be a valid YAML hash.Stupid of me :-O> However, when YAML parses [BienBienBien] it assumes that the value for > the title is an array and not a string. So you need to escape your > > After this change, the meta information block loaded fine in my tests.Yup, it now works fine. Thanks. And all seems fine ... Except there are failures with generation of pages from some feeds (like linuxfr or snippets.dzone, which create numerical only entries that webgen parses as only one page). But I will correct that soon. -- Nicolas Delsaux N''imprimez ce mail que si vous ne savez pas le lire sur l''?cran : les ?lectrons se recyclent bien, le papier, beaucoup moins bien.