News
The indexed XML website as a commodity Reviewing a few long-term, continuing multi-publishing projects I have been involved in recently, I am struck that several are morphing in a particular direction. The projects might have started as publishing paper or webpages, and moved to publishing high-level XML, but increasingly the commodity that needs to be packaged and distributed (for re-skinning and re-use by third parties) is the whole indexed dataset: in effect the website (without the implication of HTML pages.) The client-person doesn't GET a webpage, they get a whole website (this is for B2B not B2C.)… read more Rick Jelliffe
What are useful Software Engineering approaches for legislated requirements? More projects seem to be coming across my desk that ultimately involve building information systems whose primary requirements come from legislation or regulations. And sometimes even the detailed requirements. Legislation is sometimes quite a nice Requirement Specification: it is expressed...… read more Rick Jelliffe
Linking a public government dataset into the semantic web with RDF A few months ago, a client wanted to dip their toes in the semantic web. So I took a fresh look at the status quo, and where the current sweet spot is. Here are my conclusions, and how things panned out for this particular job.… read more Rick Jelliffe
Programming languages available in-house determines architecture? A solid refactoring, the kind that you don't do every year, also needs to involve a tooling up, but scoped to making the new desired architecture something that programmers won't subvert but find natural. In a way, the programming languages become the interfaces that provides the boundaries for the layers of the system.… read more Rick Jelliffe
The Grammar of Schematron A lot of Schematron can be implemented directly in a mildly enhanced version of RELAX NG without (I think) explosions, before it all runs out of steam.… read more Rick Jelliffe
XProc and SMIL: Orchestrating Pipelines Although the W3C's XML Pipeline Language (XProc) hasn't even left the stable yet, people are already looking beyond its original purpose. XProc was designed to solve the problem of how to describe the joining together of multiple XML processing steps. So, the question is, how do you extend XProc to handle new features like explicit concurrency...… read more Philip Fennell
HTML 5 comics CSS quirrel is an online comic that is good for a few laughs. You can tell it would be funny if you knew what on earth they all were talking about. Actually, most of the comics are really paired with blog items giving the back story. It is a really cute format. Read on for a few of my favorites.… read more Rick Jelliffe
Do we need lazy loading XML parsers to make XHTML scalable? The W3C Systeam's blog has a hilarious item W3C's Excessive DTD Traffic. Apparently, generic XML systems are trying to download the DTD using the DOCTYPE declaration system identifier (i.e. what it is for) on XHTML files, or downloading the schemas from the namespace URI (i.e. not what it is for) for documents with XHTML fragments. And it is a lot of bogus traffic. W3C does not want to cop having to serve dumb XHTML requests for DTDs and schemas. A different DOCTYPE and a lazy loading parser policy would help. But I think all the ISO/MathML special character public entity sets should be built into XML.… read more Rick Jelliffe
Weak validation using hash codes High performance gateways are a potential use case for efficient weak validation systems.… read more Rick Jelliffe
Jotting on parsers for SGML-family document languages: SGML, HTML, XML #4 We seem to be getting to the stage of finally having several credible candidates for language class that can cope with SGML-family systems.… read more Rick Jelliffe
Jotting on parsers for SGML-family document languages: SGML, HTML, XML #3 Now by now you may be saying Rick, are you really saying that SGML can only be described by some kind of seven-level grammar? Zut alors! And HTML and XML too?… read more Rick Jelliffe
Jotting on parsers for SGML-family document languages: SGML, HTML, XML #2 Here is Melvin Conway's foundation point from his 1963 paper defining coroutines:
"That property of the design which makes it amenable to many segment configurations is its separability."… read more Rick Jelliffe
Rikipedia: stuff deleted from Wikipedia - Ken Krechmer on OOXML Standardization I found that that an interesting section Ken Krechemer had contributed to the Wikipedia article on the Standardization of OOXML had been deleted for being an editorial. Anyway, I hope Ken doesn't mind me taking the liberty of reprinting it here.… read more Rick Jelliffe
Jotting on parsers for SGML-family document languages: SGML, HTML, XML But it is no use me sitting here complaining that people are saying "drop SGML" without even knowing what it is they are dropping. So I thought I'd make some little diagrams roughly scoping a basic machine for SGML family parsers.… read more Rick Jelliffe
USPTO gets its prongs in order: but what about schema patents? A counter-reformation rather than a reformation? But welcome none-the-less.… read more Rick Jelliffe
OSCON for FREE! I am offering a novel idea about Open Source. Ric Johnson
Grouping in XQuery One of the really convenient features introduced in XSLT 2.0 is Grouping. It is a typical second-generation change in a programming language: Not essential for the language itself (grouping can be done by hand using techniques such as the Muenchian… read more Erik Wilde
XML makes you stoopid! Everyone is missing the forest for the trees on Google Protcol Buffers not using XML. Ric Johnson
Google hates XML Goolge does not know how to use XML - in fact it seems the HATE it. Ric Johnson
Why M. David Peterson is WRONG The truth in blogging: follow the money to know where your favorite posting really are saying. Ric Johnson
Microsoft credible as blushing debutante at the standards ball? Effective participation in standards bodies involves quite specific commitment and development of expertise, it is not a generic capability that can be instantly redeployed, Rumsfield-style, to trouble spots. For example, while knowledge of OASIS procedures may help you understand some… read more Rick Jelliffe
Using SwiXML and Substance 5 SwiXML is Wolf Paulus' XML User Interface languge (XUI or XUL) which uses the regularity of the Java Swing GUI libraries to allow very lightweight implementation: XML elements are used for JComponents, XML attributes are used for properties (e.g. <frame… read more Rick Jelliffe
Why Jeff Atwood Is Right Firstly, I, like many of you, am glad to see that Dare Obasanjo's indefinite hiatus from the blogosphere was short lived. Secondly, while I most certainly agree with the premise of his recent "In Defense of XML" post -- which… read more M. David Peterson
CherryPy 3.1 Released CherryPy 3.1 is out and there are some exciting new features. The first exciting piece is the Web Site Process Bus. Robert Brewer had come up with an idea to create a generic server management API to help make management… read more Eric Larson
10% of top Google product features are broken every week. Result of Google culture - Roll out cool features, not focus on quality? My saga on problems with GMail continue. Despite of the -ve feedback ("GMail is working fine", "GMail is awesome', "Not sure why you are complaining GMail?" etc) to my posts, I continue to see the problems with GMail. I am… read more Hari K. Gottipati
RDF Parsing in XSLT During the recent discussion of the OAI-ORE drafts (which use RDF), the claim was made that RDF is serialized in RDF/XML and thus could be considered an XML representation of the underlying data model. My response to that was that… read more Erik Wilde
Freedom in Web Applications It is interesting to see the progression of free software along side the proliferation of the web. When I first started programming, I got involved with a web CMS I used in my contract work. I would write a new… read more Eric Larson
Associating Resources with Namespaces The W3C just published a new TAG Finding called Associating Resources with Namespaces. Here's the abstract: This Finding addresses the question of how ancillary information (schemas, stylesheets, documentation, etc.) can be associated with a namespace. I don't quite understand why… read more Erik Wilde
Permanent URLs for things in the real world At the Semantic Technologies conference in San Jose I attended an interesting presentation entitled “persistent identifiers for the real web”. XML often uses URLs for identifying schema namespaces, and I suppose could be credited for influencing RDF’s practice of using… read more Taylor Cowan
Castoff hints? Rethinking interoperability and fidelity First some jargon (from the Glossary of Typesetting Terms or Harrod's Librarians' Glossary full props to Google.) Castoff: The calculation the number of typeset pages a manuscript will make, based on a character count. Proof: An impression made from type… read more Rick Jelliffe