CARVIEW |
XML
XML (an acronym for "eXtensible Markup Language") defines a set of standard rules for creating markup languages. A markup language is a mechanism of labeling content, typically text, and primarily so that it can be understood and processed by software. One common markup language you've probably seen before is HTML (HyperText Markup Language), which looks like this:
<p>I <b>really</b> dislike <a href="https://toc.oreilly.com/resources/drm.html">DRM</a>.</p>
Which when viewed as part of a Web page, looks like this:
I really dislike DRM.
Those words inside of the brackets are called "elements".
There are a wide variety of types of content to describe with a markup language, but there's a lot of benefit to using a common type of markup language, regardless of the content type. For example, in the case of a document meant for the Web, to describe that content you'd want to describe things like "paragraph" (<p>), "heading" (<h1>), and "ordered list" (<ol>).
But in the case of corporate financial data, you'd want to describe things like "inventory" (<ifrs-gp:Inventories>) and "total current assets" (<ifrs-gp:AssetsCurrentTotal>). By using XML to define the markup, even though the content is quite different, a common set of tools and techniques can be used for creating and processing both types of content.
Although few people work directly with XML regularly, most everyone using the Web actually uses XML every day. For example, the RSS feeds used to track blog updates are XML documents. Here's a snippet of the TOC feed:
<feed xmlns="https://www.w3.org/2005/Atom">
<title>Tools of Change for Publishing</title>
<link rel="alternate" type="text/html" href="https://toc.oreilly.com/" />
<link rel="self" type="application/atom+xml" href="https://toc.oreilly.com/atom.xml" />
<id>tag:toc.oreilly.com,2008-01-24://40</id>
<updated>2008-07-18T21:01:03Z</updated>
<subtitle>Tools of Change for Publishing from O'Reilly Media: Technology is transforming publishing. Are you ready for the future? </subtitle>
<generator uri="https://www.sixapart.com/movabletype/">Movable Type 4.1</generator><entry>
<title>[TOC Directory] Recent Additions</title>
<link rel="alternate" type="text/html" href="https://toc.oreilly.com/2008/07/ toc-directory-recent-additions-2.html" />
<id>tag:toc.oreilly.com,2008://40.25177</id><published>2008-07-22T14:30:00Z</published>
<updated>2008-07-18T21:01:03Z</updated>
XML is often used as a file storage format (either as the primary format or an alternative) in word processing and desktop publishing software like Word, InDesign, OpenOffice and even Excel. All use the rules of XML to define their file formats, though each uses their own particular names for their elements, and have different rules about how their elements can appear within a document. Indeed, one of the big advantages of XML is the ability to use a standard set of tools for defining the rules of a particular document ("tables are not allowed inside of sidebars" or "every image must have a caption"). Before XML, such rules were often codified in style manuals, or perhaps enforced with custom software such as Word macros or InDesign scripts.
For more technical information, see this in-depth technical overview of XML, this "Learning XML" course from the O'Reilly School of Technology, or "Learning XML, 2nd Edition".
Slides from "Essential Tools of an XML Workflow" Webcast
Laura Dawson has made her slides available from the recent TOC Webcast, "Essential Tools of an XML Workflow." A complete recording of the event will be posted here soon. View...[TOC Webcast] Essential Tools of an XML Workflow
Tools of Change for Publishing, in conjunction with StartWithXML, will host "Essential Tools of an XML Workflow," a free webcast with presenter Laura Dawson, on Thursday, Dec. 11 at...Webcast Video: What Publishers Need to Know about Digitization
Below you'll find the full recording from the recent TOC Webcast, "What Publishers Need to Know about Digitization," with Liza Daly....A Correction!
Frank Grazioli, of Wiley, writes in to correct my last post about taxonomies: Wiley has been exploring taxonomies for its travel content business; the cooking/psych/accounting spaces might be our next...Slides from "What Publishers Need to Know about Digitization" Webcast
Slides from the "What Publishers Need to Know about Digitization" webcast.[TOC Webcast] Tomorrow: What Publishers Need to Know About Digitization
Tools of Change for Publishing will host a free webcast tomorrow at 1 p.m. eastern (10 a.m. pacific). Digitization expert Liza Daly will discuss "What Publishers Need to Know...Beyond the Tag Cloud
This is an excerpt from our research paper, which will publish in concert with the StartWithXML Forum on January 13th at the McGraw-Hill Auditorium in New York. Early bird discounting...Another Position: XML Alone is Not Enough
George Lossius, the CEO of Publishing Technology PLC, wrote a very thoughtful post about our StartWithXML project for the new UK blog, BookBrunch. He comments after a report on the...For a Workflow Change, Support from the Top is Required
Last week Laura Dawson and I spoke about StartWithXML to a group of IT and operations people from publishers at the User Group meeting for Global Turnkey Systems, a company...[TOC Webcast] What Publishers Need to Know about Digitization
Tools of Change for Publishing will host a free webcast with digitization expert Liza Daly on Wednesday, Nov. 12 at 1 p.m. eastern (10 a.m. pacific). No prior experience...Recommended Reading on XML and Publishing
While clearing out some old files, I came across a folder of articles culled during research about three years ago, while I was building the case for increasing our use...Can XML Help you Avoid a Disruptive Innovation?
This semester, I'm fortunate to spend my Wednesday nights teaching management to students who are part of NYU's M.S. in publishing program. Although a significant share of the course is...StartWithXML Survey Results Preview
Preview the results of the StartWithXML survey.Standardizing Tags in the Metadata Minefield
One issue we haven't discussed much is that of metadata. XML documents are by definition rife with metadata. At what point does metadata cross the line from useful to pollution?...When it Comes to Search, How Low Can You Go?
Discussing futurist Paul Saffo's observation on search.Recent Posts from StartWithXML
- Slides from "Essential Tools of an XML Workflow" Webcast
- [TOC Webcast] Essential Tools of an XML Workflow
- A Correction!
- Beyond the Tag Cloud
- Another Position: XML Alone is Not Enough
XML Companies from the TOC Directory
- Stay Connected
-
TOC RSS Feeds
Blog Feed
News Feed
Combined Feed
New to RSS?
Subscribe to the TOC newsletter. Follow TOC on Twitter. Join the TOC Facebook group. Get the TOC Headline Widget.
- Search
-
- Tag Cloud
Tools of Change for Publishing is a division of O'Reilly Media, Inc.
© 2008, O'Reilly Media, Inc. | (707) 827-7000 / (800) 998-9938
All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners.
O'Reilly Media Home | Privacy Policy | Community | Blog | Directory | Job Board | About