Office Open XML
Not to be confused with
OpenDocument (another XML document format), OpenOffice.org XML (a deprecated format used by earlier versions of OpenOffice.org), or Microsoft Office XML formats (a deprecated format used by earlier versions of Microsoft Office).
|
Office Open XML |
- Office Open XML standardization
- Open Packaging Convention
- Open Specification Promise
- Vector Markup Language
- Office Open XML software
- Comparison of Office Open XML software
|
Office Open XML (also referred to as OOXML, OpenXML, or Open XML) is a file format for representing spreadsheets, charts, presentations and word processing documents. An Office Open XML document file contains mainly XML based files compressed within a zip package.
The Office Open XML format specification in 2006 became a free and open Ecma International standard.[2][3][4][5][6][7][8] In November 2008, after incorporating some of the proposed changes from ISO/IEC members during standardization process of Office Open XML an amended version of the specification was published as international standard ISO/IEC 29500:2008, Information technology – Office Open XML formats.[9] ISO/IEC 29500:2008 is a 4 part standard specification that can be freely downloaded.[10] and as ECMA-376 Office Open XML File Formats - 2nd edition (December 2008).
Microsoft originally developed the specification as a successor to its earlier binary and Office 2003 XML file formats. The specification was later handed over to Ecma International to be developed as the ECMA-376 standard, under the stewardship of Ecma International Technical Committee TC45. ECMA-376 was published in December 2006[11] and can be freely downloaded from Ecma International.[12]
An amended version of the format received the necessary votes for approval as an ISO/IEC Standard as the result of a JTC 1 fast tracking standardization process that concluded in April 2008.[13]
Starting with Microsoft Office 2007, the Office Open XML file formats (ECMA-376) have become the default file format of Microsoft Office, the currently market-leading office suite.[14][15] Microsoft Office 14 will be the first version to implement the ISO/IEC IS 29500 compliant version of Office Open XML.[16][17][18]
Background
Prior to the 2007 edition, the core applications of the Microsoft Office software suite (primarily Word, Excel, and PowerPoint) by default stored their data in binary files. Historically, these files were difficult for other applications to interoperate with, due to the lack of publicly available information and royalty-free access to the format specifications. More recently, Microsoft has offered these binary format specifications under a royalty-free covenant not to sue as part of its Open Specification Promise.[19][20] While a level of support for the binary formats had been achieved by some applications, full interoperability remained elusive.
In 2000, Microsoft released an initial version of an XML-based format for Excel, which was incorporated in Office XP. In 2002, a new file format for Microsoft Word followed.[21] The Excel and Word formats – known as the Office 2003 XML formats – were later incorporated into the 2003 release of Microsoft Office.
In May 2004, governments and the European Union recommended to Microsoft that they publish and standardize their XML Office formats through a standardization organization.[22][21] Microsoft announced in November 2005 that it would standardize the new version of their XML-based formats through Ecma, as "Ecma Office Open XML".[23]
File format and structure
Office Open XML main components
Office Open XML uses a file package conforming to the Open Packaging Convention. This format uses mechanisms from the ZIP file format and contains the individual files that form the basis of the document. In addition to Office markup, the package can also include embedded files such as images, videos, or other documents.
Document markup languages
An Office Open XML file may contain several documents encoded in specialized markup languages corresponding to applications within the Microsoft Office product line. Office Open XML defines multiple vocabularies using 27 namespaces and 89 schema modules.
The primary markup languages are:
- WordprocessingML for word-processing
- SpreadsheetML for spreadsheets
- PresentationML for presentations
Shared markup language materials include:
- Office Math Markup Language (OMML)
- DrawingML used for vector drawing, charts, and for example, text art (additionally, though deprecated, VML is supported for drawing)
- Extended properties
- Custom properties
- Variant Types
- Custom XML data properties
- Bibliography
In addition to the above markup languages custom XML schemas can be used to extend Office Open XML.
The XML Schema of Office Open XML emphasizes reducing load time and improving parsing speed. In a test with applications current in April 2007, XML based office documents were slower to load than binary formats.[24] To enhance performance, Office Open XML uses very short element names for common elements and spreadsheets save dates as index numbers (starting from 1899 or from 1904). In order to be systematic and generic, Office Open XML typically uses separate child elements for data and metadata (element names ending in Pr for properties) rather than using multiple attributes, which allows structured properties. Office Open XML does not use mixed content but uses elements to put a series of text runs (element name r) into paragraphs (element name p). The result is terse and highly nested in contrast to HTML, for example, which is fairly flat, designed for humans to write in text editors and is more congenial for humans to read.
Office MathML (OMML)
Office Math Markup Language is a mathematical markup language which can be embedded in WordprocessingML, with intrinsic support for including word processing markup like revision markings,[25] footnotes, comments, images and elaborate formatting and styles.[26] The OMML format is different from the World Wide Web Consortium (W3C) MathML recommendation that does not support those office features, but is partially compatible[27] through relatively simple XSL Transformations.
The following Office MathML example defines the fraction:
<m:oMathPara>
<m:oMath>
<m:f>
<m:num><m:r>
<m:t>π</m:t>
</m:r></m:num>
<m:den><m:r>
<m:t>2</m:t>
</m:r></m:den>
</m:f>
</m:oMath>
</m:oMathPara>
DrawingML
Example of DrawingML charts
DrawingML is the vector graphics markup language used in Office Open XML documents. Its major features are the graphics rendering of text elements, graphical vector based shape elements, graphical tables and charts.
The DrawingML table is the third table model in Office Open XML (next to the table models in WordprocessingML and SpreadsheetML) and is optimized for graphical effects and its main use is in presentations created with PresentationML markup. DrawingML contains graphics effects (like shadows and reflection) that can be used on the different graphical elements that are used in DrawingML. In DrawingML you can also create 3d effects, for instance to show the different graphical elements through a flexible camera viewpoint. It is possible to create separate DrawingML theme parts in an Office Open XML package. These themes can then be applied to graphical elements throughout the Office Open XML package.[28]
DrawingML is unrelated to the other vector graphics formats such as SVG. These can be converted to DrawingML to include natively in an Office Open XML document. This is a different approach to that of the OpenDocument format, which uses a subset of SVG, and includes vector graphics as separate files.
Container structure
-
Main article: Open Packaging Convention
Container structure of Part 2 of the Ecma Office Open XML standard, ECMA-376
Office Open XML documents are stored in Open Packaging Convention (OPC) packages, which are ZIP files containing XML and other data files, along with a specification of the relationships between them.[29] Depending on the type of the document, the packages have different internal directory structures and names. An application will use the relationships files to locate individual sections (files), with each having accompanying metadata, in particular MIME metadata.
A basic package contains an XML file called [Content_Types].xml at the root, along with three directories: _rels, docProps, and a directory specific for the document type (for example, in a .docx word processing package, there would be a word directory). The word directory contains the document.xml file which is the core content of the document.
- [Content_Types].xml
- This file describes the contents of the package. It also contains a mapping for file extensions and overrides for specific URIs.
- _rels
- This directory contains relationships for the files within the package. To find the relationships for a specific file, look for the _rels directory that is a sibling of the file, and then for a file that has the original file name with a .rels appended to it. For example, if the content types file had any relationships, there would be a file called [Content_Types].xml.rels inside the _rels directory.
- _rels/.rel
- This file is where the package relationships are located. Applications look here first. Viewing in a text editor, one will see it outlines each relationship for that section. In a minimal document containing only the basic document.xml file, the relationships detailed are metadata and document.xml.
- docProps/core.xml
- This file contains the core properties for any Office Open XML document.
- word/document.xml
- This file is the main part for any Word document.
Relationships
Relationship files in Office Open XML
An example relationship file (word/_rels/document.xml.rels), is:
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<Relationships
xmlns="http://schemas.microsoft.com/package/2005/06/relationships">
<Relationship Id="rId1"
Type="http://schemas.microsoft.com/office/2006/relationships/image"
Target="http://en.wikipedia.org/images/wiki-en.png"
TargetMode="External" />
<Relationship Id="rId2"
Type="http://schemas.microsoft.com/office/2006/relationships/hyperlink"
Target="http://www.wikipedia.org"
TargetMode="External" />
</Relationships>
As such, images referenced in the document can be found in the relationship file by looking for all relationships that are of type http://schemas.microsoft.com/office/2006/relationships/image
. To change the used image, edit the relationship.
Hyperlink relations
The following code shows an example of inline markup for a hyperlink:
<w:hyperlink r:id="rId2" w:history="1"
xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships"
xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main">
In this example, the Uniform Resource Locator (URL) is represented by "rId2". The actual URL is in the accompanying relationships file, located by the corresponding "rId2" item. Linked images, templates, and other items are referenced in the same way.
Embedded or linked media file relations
Pictures can be embedded or linked using a tag:
<v:imagedata w:rel="rId1" o:title="example" />
This is the reference to the image file. All references are managed via relationships. For example, a document.xml has a relationship to the image. There is a _rels directory in the same directory as document.xml, inside _rels is a file called document.xml.rels. In this file there will be a relationship definition that contains type, ID and location. The ID is the referenced ID used in the XML document. The type will be a reference schema definition for the media type and the location will be an internal location within the ZIP package or an external location defined with a URL.
Document properties
Office Open XML uses the Dublin Core Metadata Element Set and DCMI Metadata Terms to store document properties. Dublin Core is a standard for cross-domain information resource description and is defined in ISO 15836:2003.
Core properties
An example document properties file (docProps/core.xml) that uses Dublin Core metadata, is:
<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<cp:coreProperties xmlns:cp="http://schemas.openxmlformats.org/package/2006/metadata/core-properties"
xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:dcmitype="http://purl.org/dc/dcmitype/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<dc:title>Office Open XML</dc:title>
<dc:subject>File format and structure</dc:subject>
<dc:creator>Wikipedia</dc:creator>
<cp:keywords>Office Open XML, Metadata, Dublin Core</cp:keywords>
<dc:description>Office Open XML uses ISO 15836:2003</dc:description>
<cp:lastModifiedBy>Wikipedia</cp:lastModifiedBy>
<cp:revision>1</cp:revision>
<dcterms:created xsi:type="dcterms:W3CDTF">2008-06-19T20:00:00Z</dcterms:created>
<dcterms:modified xsi:type="dcterms:W3CDTF">2008-06-19T20:42:00Z</dcterms:modified>
<cp:category>Document file format</cp:category>
<cp:contentStatus>Final</cp:contentStatus>
</cp:coreProperties>
Structure of the standard
To aid the reader's understanding, the Office Open XML specification contains both normative material and informative material.
ISO/IEC 29500:2008
The ISO/IEC standard is structured into four parts, each of which are independent standards.[30]
- Part 1: Fundamentals and Markup Language Reference (5550 pages)
- Contains conformance definitions
- Contains the reference material for WordprocessingML, SpreadsheetML, PresentationML, DrawingML, Shared MLs and Custom XML Schema, defining every element and attribute including the element hierarchy (parent/child relationships)
- XML schemas for the markup languages are declared as XSD and (non-normatively) using RELAX NG
- Defines the custom XML data storing facility
- Schemas are included in the text
- Part 2: Open Packaging Conventions (128 pages)
- Defines the Open Packaging Conventions (package model, physical package)
- Defines core properties, thumbnails and digital signatures
- XML schemas for the OPC are declared as XML Schema Definitions (XSD) and (non-normatively) using RELAX NG (ISO/IEC 19757-2)
- Part 3: Markup Compatibility and Extensibility (40 pages)
- Describes extension facilities of OpenXML documents and specifies elements and attributes by which applications with different extensions can interoperate
- Extensibility rules are expressed using NVDL (ISO/IEC 19757-4)
- Part 4: Transitional Migration Features (1455 pages)
- Contains reference material of legacy and transitional interest
- Contains controversial material such as compatability settings and the graphics markup language VML
- Contains list of differences between ECMA-376 and ISO/IEC 29500:2008
The standard specifies six levels of document and application conformance, strict and transitional for each of WordprocessingML, PresentationML and SpreadsheetML. The standard also specifies applications descriptions of base and full.
ECMA-376:2006
The ECMA standard is structured in five parts to meet the needs of different audiences.[12]
- Part 1: Fundamentals
- Vocabulary, notational conventions and abbreviations
- Summary of primary and supporting markup languages
- Conformance conditions and interoperability guidelines
- Constraints within the Open Packaging Conventions that apply to each document type
- Part 2: Open Packaging Conventions
- Defines the Open Packaging Conventions (package model, physical package)
- Defines core properties, thumbnails and digital signatures
- XML schemas for the OPC are declared as XML Schema Definitions (XSD) and (non-normatively) using RELAX NG (ISO/IEC 19757-2)
- Part 3: Primer
- Informative (non-normative) introduction to WordprocessingML, SpreadsheetML, PresentationML, DrawingML, VML and Shared MLs, providing context and illustrating elements through examples and diagrams
- Describes the custom XML data storing facility within a package to support integration with business data
- Part 4: Markup Language Reference
- Contains the reference material for WordprocessingML, SpreadsheetML, PresentationML, DrawingML, Shared MLs and Custom XML Schema, defining every element and attribute including the element hierarchy (parent/child relationships)
- XML schemas for the markup languages are declared as XSD and (non-normatively) using RELAX NG
- Defines the custom XML data storing facility
- Part 5: Markup Compatibility and Extensibility
- Describes extension facilities of OpenXML documents and specifies elements and attributes by which applications with different extensions can interoperate
- Extensibility rules are expressed using NVDL (ISO/IEC 19757-4)
Licensing
Reasonable and Non Discriminatory
Ecma International provides specifications that "can be freely copied by all interested parties without restrictions".[31] Under the Ecma code of conduct in patent matters, participating and approving member organisations are required to make available their patent rights on a Reasonable and Non Discriminatory (RAND) basis. While making patent rights available on a RAND basis is considered a common minimum patent condition for a standard, international standardization has a clear preference for royalty-free patent licensing. That is why Microsoft, a main contributor to the standard, provided a Covenant Not to Sue[32] for its patent licensing. The covenant received a mixed reception, with some (like the Groklaw blog) identifying problems[33] and others (such as Lawrence Rosen, an attorney and lecturer at Stanford Law School) endorsing it.[34]
Open Specification Promise
-
Main article: Open Specification Promise
Microsoft also added the format to their Open Specification Promise[35] in which
"Microsoft irrevocably promises not to assert any Microsoft Necessary Claims against you for making, using, selling, offering for sale, importing or distributing any implementation to the extent it conforms to a Covered Specification […]"
subject to certain restrictions. Office Open XML can therefore be used under the Covenant Not to Sue or the Open Specification Promise.
The Open Specification Promise was included in documents submitted to ISO in support of the ECMA-376 fast track submission.[36] Ecma International asserted that, "The OSP enables both open source and commercial software to implement [the specification]."[37]
In support of the licensing arrangements Microsoft commissioned an analysis from the London legal firm Baker & McKenzie.[38]
Several standards and OSS licensing experts expressed support in 2006 of the OSP. A 2006 article in Cover Pages quotes Lawrence Rosen, an attorney and lecturer at Stanford Law School, as saying,
"I'm pleased that this OSP is compatible with free and open source licenses."[39]
In 2006[40], Mark Webbink; a lawyer and member of the board of the Software Freedom Law Center, and former employee of Linux vendor Red Hat; has said,
"Red Hat believes that the text of the OSP gives sufficient flexibility to implement the listed specifications in software licensed under free and open source licenses. We commend Microsoft’s efforts to reach out to representatives from the open source community and solicit their feedback on this text, and Microsoft's willingness to make modifications in response to our comments."[41]
Standards lawyer Andy Updegrove said in 2006 the Open Specification Promise was
"[…] what I consider to be a highly desirable tool for facilitating the implementation of open standards, in particular where those standards are of interest to the open source community."[42]
On March 12, 2008 the Software Freedom Law Center, which provides services to protect and advance free software and open source software, has warned of problems with the Open Specification Promise as it relates to Office Open XML and the GNU General Public License (GPL).[43] In a published analysis of the promise it states that[44]
-
- "Any code that implements the specification may also do other things in other contexts, so in effect the OSP does not cover any actual code, only some uses of code."[45]
- "...it permits implementation under free software licenses so long as the resulting code isn't used freely."[45]
- "The OSP cannot be relied upon by GPL developers for their implementations not because its provisions conflict with GPL, but because it does not provide the freedom that the GPL requires."[45]
Microsoft amended the OSP FAQ to specifically assure GPL license users that the open licensing of its covered formats through the Open Specification Promise applies to users of the GPL license when implementing covered implementations:
"The OSP provides the assurance that Microsoft will not assert its Necessary Claims against anyone who make, use, sell, offer for sale, import, or distribute any Covered Implementation under any type of development or distribution model, including the GPL."[46]
Legal experts and academics have confirmed that the licensing is similar to the licensing terms offered by IBM, and to a lesser extent Sun and Adobe on their Office formats but warned that ambiguous legal jargon contained in Microsoft's Open Specification Promise, although understood in the terminology of the specialist patent and intellectual property law community, makes it hard for small developers to determine with certainty that Microsoft will not be entitled to sue them for using OOXML. They criticize Microsoft for not explicitly defining which parts of its 6000 page OOXML specification contains intellectual property. The OSP has never been tested in a court of law regarding intellectual property rights, and the OSP does not name which court or jurisdiction a dispute will be heard in. David Vaile, executive director of the Cyberspace Law and Policy Centre at the University of New South Wales, acknowledges that the Microsoft approach to its Open Specification Promise is similar although not identical to other approaches used by IBM, and to a lesser extent Adobe and Sun, and that these represent a substantial advance on the past practice of negotiating long case-by-case agreements.[47]
Standardization
-
Main article: Standardization of Office Open XML
Office Open XML is an Ecma standard (Ecma-376, approved on 7 December 2006). ECMA-376 was created using as a basis a new version of the Microsoft Office 2003 XML file format, donated by Microsoft, which was being created for Microsoft Office 12.
The specification entered fast-track standardization within ISO/IEC as DIS 29500 (Draft International Standard 29500).[48] In a September 2007 vote by ISO/IEC member bodies, the draft text was not approved as an international standard. A ballot resolution process in March 2008 amended the text.
On April 2, 2008, ISO and IEC officially stated that the DIS 29500 had been approved for acceptance as an ISO/IEC Standard, pending any appeals.[13] In accordance with the JTC 1 directives the Project Editor created a new version with the final text within a month after the BRM, to be published as ISO/IEC 29500. After review and corrections this text has been distributed to the members of SC34.[49]
On May 21st, 2008, Microsoft announced that it will be an active participant in the future evolution of the Office Open XML standard.[50]
In October 2008, ISO/IEC committee JTC1/SC34 (Document Description and Processing Languages) created a working group for maintenance of the ISO/IEC 29500 standard within ISO/IEC. [51]
In November 2008, the new international standard was published as ISO/IEC 29500:2008, Information technology – Office Open XML formats.
Wordwide adoption
Commonwealth of Massachusetts
The US state of Massachusetts has been examining its options for implementing XML-based document processing. In early 2005, Eric Kriss, Secretary of Administration and Finance in Massachusetts, was the first government official in the United States to publicly connect open formats to a public policy purpose: "It is an overriding imperative of the American democratic system that we cannot have our public documents locked up in some kind of proprietary format, perhaps unreadable in the future, or subject to a proprietary system license that restricts access."[52] Since 2007 Massachusetts classifies Office Open XML as "Open Format" and has amended its approved technical standards list — the Enterprise Technical Reference Model (ETRM) — to include Office Open XML. Massachusetts now formally endorses Office Open XML formats for its public records.[53]
Application support
-
Main article: Office Open XML software
- The list here is not exhaustive. Another list of supporting/partial implementations of Office Open XML can be found on Microsoft's office open XML Community website, but it is also not exhaustive.
ECMA-376 implementations
The ECMA-376 1st edition (December 2006) standard is supported in several office suites from various vendors.
- AbiWord includes an input filter for Office Open XML documents beginning with version 2.6.0.[54]
- Adobe Buzzword, the online word processor by Adobe Systems, imports and exports Microsoft Word (DOC and DOCX) and Word 2003 XML files.[55]
- Apple Inc.'s iWork '08 suite has read-only support for Office Open XML word processing, spreadsheet and presentation file formats in Pages, Numbers and Keynote (respectively).[56][57][58]
- Apple Inc.'s iPhone has read-only support for Office Open XML attachments to email.[59]
- Apple Inc.'s TextEdit, the built-in word processing program of Mac OS X, has read and write support for Office Open XML files starting with Mac OS X v10.5.[60]
- Corel WordPerfect Office X4 includes import-only support for Office Open XML.[61]
- DataViz' Documents To Go for PalmOS, Windows Mobile and Symbian OS (UIQ, S80) supports Office Open XML documents.[62]
- Gnumeric has limited SpreadsheetML support.[63]
- Go-oo (OpenOffice.org variant) has built-in support for opening Office Open XML files. Version 3.x is availaible for Windows 2000/XP/Vista and for most of Linux distributions. Go-oo Version 2.4.1 is availaible also for Windows 98/ME.[64]
- Microsoft Office 2007 for Windows uses the Office Open XML format as the default.
- Older versions of Microsoft Office (2000, XP and 2003) require a free compatibility pack provided by Microsoft.[65] It is available for Windows 2000 Service Pack 4 and newer operating systems. The compatibility pack does not require Microsoft Office, but does require Microsoft Windows. It can be used as a standalone converter with products that read Office's older binary formats, such as OpenOffice.org.[66]
- Microsoft Office 2008 for Mac supports the Office Open XML format.[67] For older versions of Office on the Mac, a converter is available.
- Microsoft Office Mobile 6.1 supports Office Open XML on Mobile devices.[68]
- The Mac OS X-based NeoOffice office suite supports opening, editing, and saving of most Office Open XML documents since version 2.1.[69]
- OpenOffice.org has built-in support for opening Office Open XML text documents, spreadsheets and presentations beginning with OpenOffice.org version 3.0 (october 2008). It is availaible for Windows 2000/XP/Vista, Mac OS X (Intel/PowerPC), Linux, Solaris, FreeBSD (*BSD)[70]
- OpenXML Writer is an open source text editor for creating WordprocessingML files.[71]
- Oxygen XML Editor allows you to extract, validate, edit and process the XML data stored in Office Open XML files. These capabilities allow developers to use data from Office 2007 documents together with validation and transformations (using XSLT or XQuery) to other file formats. Validation is done using the latest ECMA-376 XML Schemas.[72]
- The online Thinkfree Office supports Office Open XML word processing files, and will support spreadsheets and presentation files in the future.[73]
- Online word processor Zoho Writer supports exporting to the Office Open XML WordprocessingML format.[74]
Filters and converters
- Microsoft has created Office Open XML Converter for Mac OS X that can convert Office Open XML files to the former binary file formats used in older versions of Microsoft Office.[75]
- OxygenOffice includes xmlfilter which is the code that OpenOffice.org 3 will use to process Office Open XML files, and xmlfilter is completely different than OdfConverter.[76] This filter however is only for importing Office Open XML files not for exporting them.
- docXConverter by Panergy Ltd. converts from WordprocessingML to Rich Text Format (RTF) and from SpreadsheetML to Comma-separated values (CSV). docXConverter can be used to transfer WordprocessingML data to other applications that read RTF data such as Word 97.[77]
- Microsoft and Sonata Software created a plugin to convert Office Open XML text documents to DAISY XML, an open standard format used for creating spoken document for disabled people.[78]
- Google search supports direct HTML view of Office Open XML files. Found files can be viewed directly in a converted HTML view.[79]
- OpenXML Document Viewer is an open source command line utility that provides Office Open XML to HTML transformation to display WordprocessingML files in web browsers.[80]
- DataViz MacLinkPlus Deluxe 16 supports Office Open XML file formats.[81]
Other products
- Altova's MapForce includes support for data integration based on the Office Open XML spreadsheet data format.[82]
- Altova's StyleVision adds Word 2007 (Office Open XML) wordprocessing capabilities to its graphical stylesheet design tool.[83]
- Altova's DiffDog supports detailed differencing for Office Open XML and ZIP archive file pairs.[82]
- Altova XMLSpy, an XML editor for modeling, editing, transforming, and debugging XML technologies supports Office Open XML file formats.[84]
- IBM DB2 Content Manager V8.4 clients support Office Open XML file formats.[85]
- IBM Lotus Quickr V8.0 includes support for Office Open XML documents.[86]
- Nuance OmniPage Professional 16, an OCR and Document Conversion Software, was the first desktop OCR application to provide native support for the Office Open XML standard.[87]
- Oxygen XML Editor provides ready to use validation, editing and processing support for Microsoft Office 2007 - Office Open XML (OOXML).[88]
- RIM BlackBerry Enterprise Server software version 4.1 SP4 (4.1.4) supports Office Open XML file formats.[89]
Planned and beta software
- ExtenXLS will support Office Open XML in the forthcoming 7.0 release, which is aiming for a Q4 2008 release.[90]
- Apache POI will support Office Open XML in the forthcoming 3.5 release, currently still in Beta[91]
- Unified Office Format (UOF) Open Source Translator is being developed by Beihang University and partners to convert from Office Open XML to UOF and vice versa.[92]
Implementations of new 2008 version
The newest version of Office Open XML is formally known as either ISO/IEC 29500:2008 or as ECMA-376 2nd edition (December 2008). Microsoft has stated that Microsoft Office 14 will be the first version of Microsoft Office to support ISO/IEC 29500, though no release date has been announced.[50]
On July 28, 2008 Murray Sargent, a software development engineer in the Microsoft Office team confirmed that Word 2007 will have a service pack release that enables it to read and write ISO standard OOXML files.[93]
Microsoft, whose products currently only support the ECMA-376 standard version of Office Open XML, has committed to using the ISO/IEC 29500 standard in their products[94] and has also committed to participate in the maintenance of this standard.
In a Zdnet article Alex Brown, leader of the ISO/IEC group in charge of deciding maintenance processes for any ISO/IEC 29500 Standard, stated
"I am hoping that Microsoft Office will shortly be brought into line with the [ISO/IEC] 29500 specification, and will stay that way".[95]
On March 13, 2008 Doug Mahugh, a Senior Product Manager at Microsoft specializing in Office client interoperability and the Open XML file formats confirmed that version 1.0 of the Open XML Format SDK
"will definitely be 100% compliant with the final ISO/IEC 29500 spec, including the changes accepted at the BRM".[96]
In a ComputerWorld interview from 2008, Doug Mahugh said that
"Microsoft would continue to update the SDK to make sure that applications built with it remained compliant with an Open XML standard as changes were made in the future".[97]
Implementations
- Microsoft Open XML Format SDK[98] contains a set of managed code libraries to create and manipulate Office Open XML files programmatically. Version 1.0 was released on June 10, 2008[99] and incorporates the changes made to the Office Open XML specification made during the current ISO/IEC standardization process.[100][101] Version 2 of the Open XML SDK supports validating Office Open XML documents against the Office Open XML schema, as well as searching in Office Open XML documents.[100]
- PowerTools for Open XML, a set of free open source cmdlets for Windows PowerShell to create and manipulate Office Open XML files at the command line or using shell scripts.[102][103][104]
Criticism
The ODF Alliance UK Action Group says that with OpenDocument an ISO-standard for Office files already exists and that two competing standards are against the very concept of a standard.[105] Further, they argue that the Office Open XML file-format is heavily based on Microsoft's own Office applications and is thus not vendor-neutral, and that it has inconsistencies with existing ISO standards such as time and date formats and color codes.[105]
Specific criticism
- Use of DrawingML and the transitional-use-only VML instead of W3C recommendation SVG.[106] VML did not become a W3C recommendation.[107]
- Use of Office Math ML instead of W3C recommendation MathML.[108]
- Office Open XML does not define a macro language, leaving this aspect to be application-defined.[109]
- The standard is long, with the version submitted to ISO comprising 6546 pages. Google alleges that this length is unnecessary, saying that the OpenDocument specification is 867 pages in length and achieves the same goals.[110] That coupled with the fast track standardization process, Google claims, reduces the review time per page ratio.[110]
- A comparison of some specific items in the format specification documents of Office Open XML and OpenDocument formats is used to claim disharmony within the Office Open XML format.[111]
See also
- List of document markup languages
- Comparison of document markup languages
- Comparison of Office Open XML and OpenDocument
References
- ↑ 1.0 1.1 1.2 Microsoft. "Register file extensions on third party servers". microsoft.com. Retrieved on 2008-04-09.
- ↑ http://www.ecma-international.org/activities/General/presentingecma.ppt
- ↑ Ecma formal publications
- ↑ Microsoft PowerPoint - BSI OOXML v1b.ppt
- ↑ http://www.mass.gov/Aitd/docs/policies_standards/etrmv4dot0/etrmv4dot0information.rtf
- ↑ http://www.itscj.ipsj.or.jp/sc34/open/1026.pdf
- ↑ "Office Open XML wird als weltweiter ISO/IEC-Standard anerkannt". ntz Fachzeitschrift für Informations- und Kommunikationstechnik (Berlin: VDE-Verlag) (3-4): 10. 2008. ISSN 0948-728X. http://www.vde-verlag.de/ntz.html.
- ↑ Techtree.com India > News > Software > ISO puts Microsoft's OOXML Standard on Hold
- ↑ ISO/IEC (2008-11-18). "Publication of ISO/IEC 29500:2008, Information technology - Office Open XML formats". ISO. Retrieved on 2008-11-19.
- ↑ "Freely Available Standards". ITTF (ISO) (2008-11-18).
- ↑ Ecma International (December 7, 2006). "Ecma International approves Office Open XML standard". Press release. Retrieved on 2006-12-08.
- ↑ 12.0 12.1 Standard ECMA-376
- ↑ 13.0 13.1 "ISO/IEC DIS 29500 receives necessary votes for approval as an International Standard". ISO (2008-04-02).
- ↑ http://www.courierpress.com/news/2008/oct/24/microsofts-future-lies-somewhere-beyond-the/
- ↑ http://knowledge.wharton.upenn.edu/article.cfm?articleid=1795
- ↑ Microsoft Expands List of Formats Supported in Microsoft Office
- ↑ http://www.computerworld.com/action/article.do?command=viewArticleBasic&taxonomyName=Protocols+and+Standards&articleId=9089258&taxonomyId=141&pageNumber=1
- ↑ http://consortiuminfo.org/standardsblog/article.php?story=20080521092930864
- ↑ "How to extract information from Office files by using Office file formats and schemas". Microsoft (2007-03-27). Retrieved on 2007-07-10.
- ↑ "Microsoft Office Binary (doc, xls, ppt) File Formats". Microsoft (2008-02-15). Retrieved on 2008-04-05.
- ↑ 21.0 21.1 Brian Jones (2007-01-25). "History of office XML formats (1998–2006)".
- ↑ Telematics between Administrations Committee based on IDA expert group on open document formats (2004-05-25). "TAC approval on conclusions and recommendations on open document formats". IDABC - European eGovernment Services. Retrieved on 2007-07-30.
- ↑ "Microsoft Co-Sponsors Submission of Office Open XML Document Formats to Ecma International for Standardization". Microsoft (2005-11-21).
- ↑ George Ou (2007-04-27). "MS Office 2007 versus Open Office 2.2 shootout". ZDnet.com. Retrieved on 2007-04-27.
- ↑ Jesper Lund Stocholm (2008-01-29). "Do your math - OOXML and OMML". A Mooh Point blog. Retrieved on 2008-02-12.
- ↑ Murray Sargent (2007-06-05). "Science and Nature have difficulties with Word 2007 mathematics". MSDN blogs. Retrieved on 2007-07-31.
- ↑ David Carlisle (2007-05-09). "XHTML and MathML from Office 2007". David Carlisle. Retrieved on 2007-09-20.
- ↑ Wouter Van Vugt (2008-11-1). "Open XML Explained e-book". Openxmldeveloper.org. Retrieved on 2007-09-14.
- ↑ Tom Ngo (December 11, 2006). "Office Open XML Overview" (PDF) 6. Ecma International. Retrieved on 2007-01-23.
- ↑ Rex Jaeshke - project editor (2008-09-26). "N1080 Final Text for ISO/IEC 29500-1, Information technology -- Document description and processing languages -- Office Open XML File Formats -- Part 1: Fundamentals and Markup Language Reference".
- ↑ "What is Ecma International".
- ↑ "Microsoft Covenant Regarding Office 2003 XML Reference Schemas". Microsoft. Retrieved on 2006-07-11.
- ↑ "2 Escape Hatches in MS's Covenant Not to Sue". Groklaw. Retrieved on 2007-01-29.
- ↑ Berlind, David (November 28, 2005). "Top open source lawyer blesses new terms on Microsoft’s XML file format". ZDNet. Retrieved on 2007-01-27.
- ↑ "Microsoft Open Specification Promise". Microsoft (2006-09-12). Retrieved on 2007-04-22.
- ↑ Licensing conditions that Microsoft offers for Office Open XML
- ↑ -Response Document- National Body Comments from 30-Day Review of the Fast Track Ballot for ISO/IEC DIS 29500 (ECMA-376) Office Open XML File Formats
- ↑ Baker & McKenzie (June 2006). "Standardization and Licensing of Microsoft’s Office Open XML Reference Schema" (PDF). Baker & McKenzie. Retrieved on 2007-02-01.
- ↑ "Microsoft's Open Specification Promise Eases Web Services Patent Concerns.". xml.coverpages.org (2006-09-12).
- ↑ "Microsoft promises to hang patent fire over web services" (2006-09-12).
- ↑ "Microsoft Open Specification Promise".
- ↑ Peter Galli (2006-09-12). "Microsoft Promises Not to Sue over Web Services Specs".
- ↑ "Software Freedom Law Center Publishes Analysis of Microsoft's Open Specification Promise new article". Software Freedom Law Center (March 12, 2008).
- ↑ "Software Freedom Law Center Publishes Analysis of Microsoft's Open Specification Promise". Business Wire (March 12, 2008).
- ↑ 45.0 45.1 45.2 "Microsoft's Open Specification Promise: No Assurance for GPL". Software Freedom Law Center (March 12, 2008).
- ↑ "Microsoft Open Specification Promise". Retrieved on 2008-07-28.
- ↑ Winterford, Brett (2008-02-14). "Can Microsoft be trusted on OOXML covenants?", ZDNet. Retrieved on 2008-07-07.
- ↑ ISO/IEC DIS 29500, Information technology -- Office Open XML file formats
- ↑ IS 29500 has been sent to ITTF for publication (Jesper Lund Stocholm, personal blog - "This will finally make it possible for the NBs of ISO to verify that the editorial instructions from the BRM has made it into the final text")
- ↑ 50.0 50.1 Microsoft Expands List of Formats Supported in Microsoft Office
- ↑ "Resolutions of ISO/IEC JTC 1/SC 34 Plenary Meeting, 2008-10-01, Jeju, Republic of Korea". ISO/IEC JTC1/SC34 Document Description and Processing Languages (2008-10-01). Retrieved on 2008-10-09.
- ↑ Administration and Finance
- ↑ http://xml.coverpages.org/ni2007-07-03-a.html
- ↑ "AbiWord v2.6.0 Released". www.abisource.com. Retrieved on 2008-03-24.
- ↑ http://labs.adobe.com/technologies/buzzword/
- ↑ "Apple - iWork - Pages". Retrieved on 2007-07-08.
- ↑ "Apple - iWork - Numbers". Retrieved on 2007-07-08.
- ↑ "Apple - iWork - Keynote". Retrieved on 2007-07-08.
- ↑ ""iPhone User's Guide"" (PDF). Apple, Inc..
- ↑ "Apple - Pro - Tips - Editing Word Files — Without Word". Retrieved on 2008-07-12.
- ↑ "WordPerfect Office X4, Do more with words, numbers and ideas". Corel. Retrieved on 2008-04-18.
- ↑ ""DocumentsToGo for PalmOS Premium Edition"". Dataviz.
- ↑ "Gnumeric 1.8 is Here!". www.gnome.org. Retrieved on 2008-01-28.
- ↑ "Go-OO features".
- ↑ "Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File Formats (Version 3)". Microsoft (2007-06-18). Retrieved on 2007-09-04.
- ↑ "Office Compatibility Pack Review". OpenOffice.org Ninja (2008-02-06). Retrieved on 2008-02-26.
- ↑ Amazon. "Microsoft Office 2008 for Mac".
- ↑ "Microsoft Office Mobile 6.1: Upgrade for Microsoft Office 2007 file formats". Microsoft (2007-11-28). Retrieved on 2007-11-29.
- ↑ "NeoOffice 2.2.1 for Mac OS X Released". trinity.neooffice.org (2007-08-26). Retrieved on 2007-10-09.
- ↑ "OpenOffice.org 3.0 New Features" (2008-10-13).
- ↑ OpenXML.biz (2008-06-30). "OpenXML Writer - An opensource editor for generating .docx files".
- ↑ "Microsoft Office 2007 - Office Open XML (OOXML) support in Oxygen XML Editor.". Syncro Soft.. Retrieved on 2008-07-04.
- ↑ "Power Edit MS Word 2007 (DOCX) Support". Retrieved on 2007-10-09.
- ↑ Raju Vegesna (2008-02-27). "Zoho Writer Update: DocX Support, Thesaurus, Group Sharing & More".
- ↑ "Conversieprogramma voor de Open XML-bestandsindeling voor Mac 1.0" (2008-06-24).
- ↑ "OxygenOffice as a Word 2007 (.docx) converter". OpenOffice.org Ninja (2008-02-25). Retrieved on 2008-02-26.
- ↑ "docXConverter - Features". panergy. Retrieved on 2007-01-31.
- ↑ "Easily Translate Open XML to DAISY XML Standards". openxmlcommunity.org. Retrieved on 2008-05-15.
- ↑ Brian Jones (2008-01-17). "Google support for Open XML formats".
- ↑ Microsoft (2008-06-30). "Microsoft Takes Additional Steps in Implementing Interoperability Principles".
- ↑ http://www.dataviz.com/news/press/pressreleases/Detail.html?id=170
- ↑ 82.0 82.1 "Altova Announces Version 2008 Release 2 of its Software Product Line". Retrieved on 2008-04-29.
- ↑ "Altova Adds Word 2007 (OOXML) Capabilities to its Graphical Stylesheet Design Tool and Cuts the Price to Ease Adoption".
- ↑ "Altova XMLSpy Now Supports New ECMA Office Open XML File Formats". Altova. Retrieved on 2008-04-15.
- ↑ "DB2 Content Manager V8.4 Clients Support for Office Open XML File Formats". IBM. Retrieved on 2008-07-09.
- ↑ http://publib.boulder.ibm.com/infocenter/lqkrhelp/v8r0/index.jsp?topic=/com.ibm.lotus.quickr.user.dom.doc/h_AdminSecurityAdd_CreatorHelp.html
- ↑ "Nuance Unveils OmniPage Professional 16". Nuance Communications. Retrieved on 2008-04-15.
- ↑ "Oxygen XML Editor supports Microsoft Office 2007 - Office Open XML (OOXML)". Syncro Soft. Retrieved on 2008-07-04.
- ↑ "Supported Microsoft Office (2007) Open XML file formats". Research In Motion Limited. Retrieved on 2008-06-30.
- ↑ "ExtenXLS Office Open XML Support". Extentech Inc.. Retrieved on 2008-06-30.
- ↑ "Office Open XML Support". Apache POI. Retrieved on 2008-04-14.
- ↑ "UOF OpenXML Translator" (2008-04-15).
- ↑ http://blogs.msdn.com/murrays/archive/2008/07/28/improved-mathml-support-in-word-2007.aspx
- ↑ "An Open Letter from Chris Capossela, Senior Vice President, Microsoft Office". Retrieved on 2008-05-26. "... we are committed to supporting the Open XML specification that is approved by ISO/IEC in our products. ... We are committed to the healthy maintenance of the standard once ratification takes place so that it will continue to be useful and relevant to the rapidly growing number of implementers and users around the world."
- ↑ "Microsoft Office 2007 fails OOXML conformance test".
- ↑ Open XML SDK roadmap
- ↑ Microsoft releasing OOXML SDK
- ↑ Microsoft (2008-06-10). "Open XML Format SDK".
- ↑ Erika Ehrli (2008-06-10). "Announcing the Open XML Format SDK 1.0".
- ↑ 100.0 100.1 Doug Mahugh. "Open XML SDK roadmap". MSDN Blogs. Retrieved on 2008-03-23.
- ↑ Eric Lai (2008-03-12). "Microsoft releasing OOXML SDK".
- ↑ PowerTools for Open XML
- ↑ PowerTools license
- ↑ Microsoft Public License (Ms-PL)
- ↑ 105.0 105.1 "ODF/OOXML technical white paper". Free Software Magazine (2 Mai 2007).
- ↑ "The X Factor". reddevnews.com (October 2007).
- ↑ VML - the Vector Markup Language
- ↑ "Microsoft Office dumped by Science and Nature". ZDNet Australia (18 June 2007).
- ↑ "Microsoft New Zealand’s director of innovation puts the case for a new standard". Computerworld (13 March 2008).
- ↑ 110.0 110.1 "Googles position on OOXML" (PDF).
- ↑ Rob Weir (14 March 2008). "Disharmony of OOXML".
External links
- ECMA-376 site
- ISO/IEC 29500:2008
- OpenXMLDeveloper.org, Microsoft's site for developers
- Open XML Community site Microsoft's site for customers and partners
- "The WordprocessingML Vocabulary", sample chapter from O'Reilly book Office 2003 XMLPDF (1.22 MiB)
- OpenOffice.org, How do I open Microsoft Office 2007 files? Article by OpenOffice.org
- Information technology -- Office Open XML file formats, ISO Standards, JTC 1 Information technology, SC 34
- FAQs on ISO/IEC 29500, ISO's FAQ site on ISO/IEC 29500
- DIS29500.org, the website containing all technical comments submitted prior to the BRM
- DOCX reference document, contains a file with fairly complex formatting and can be used to quickly check compatibility of an implementation
- OpenXML site, contains resources, articles and tools for Office Open XML
- Interoperability study showing an indication of the percentage of support for Office Open XML by several different office suite implementations in aug-2008
Multi-purpose office document file formats |
|
Editable document formats |
Compound Document Format · Microsoft Office XML formats · Office Open XML · Open Document Architecture · OpenDoc · OpenDocument · OpenOffice.org XML · Revisable-Form Text · Rich Text Format · Uniform Office Format · Word Document
|
|
Fixed document formats |
|
|