philwilson.org

Feed on Feeds and dc:subject (again)

15 August, 2005

Well, after making my last brave post, I realised that FeedOnFeeds natively supports the dc:subject element and stores it in its database already. Hurray.

Or not, really. Because what it actually supports is a single dc:subject per feed item, which it just drops straight into a text field. If you have multiple subjects per item, say:

<dc:subject>Semantic Web</dc:subject>
<dc:subject>Programming</dc:subject>

Then what you end up with in your database is this:

Semantic WebProgramming

Which isn’t terribly useful at all. So, I can change the existing functionality to iterate and insert spaces and stuff, but what do I do with the existing 5,500 items in my database which already have tags associated with them? (crikey, that’ll teach me to subscribe my del.icio.us inbox) Clearly, I could write a script which looks for capital letters in the middle of words and inserts a space before it, but then you fall into the problem where things are tagged with acronyms like FOAF (which I still have trouble pronouncing in conversation, by the way), and plain ol’ typos.

“Bah” I say.

See other posts tagged with general and other posts made in August 2005.

Comments

Christoph Görn
16 August, 2005 at 12:29

… like FOAF (which I still have trouble pronouncing in conversation, by the way) …

Just try it the german-bavarian way… foaf foaf… ahm, we are missing audio capabilities here. There should be a localized acronym pronounciation podcast!

Pip
16 August, 2005 at 13:12

haha, that would be great! 🙂