MEGGINSON_Index.ps 11/11/04 10:31 AM Page 215

Index

Abstraction Auditing data, 80 data, 81 Authors, in-line markup for, 154 standards for, 4, 6–7, 10 Automated conversions with legacy Acceleration, 117, 188 information, 181 ACID (atomicity, consistency, isolation, Automation, 77 durability) tests, 103–105 Addressable data, 98 B2B (business-to-business) Adobe PostScript language automation for, 77 as de facto standard, 26 types of, 134–135 for printing, 73 Backward compatibility, 44 Advanced searches, 148–149 Base64 encoding, 96 Aircraft manuals, 56, 74 Batch requests, 116 American Iron and Steel Institute, 17 Batch searching, 37 American National Standards Institute Berners-Lee, Tim, 13, 162 (ANSI), 11 Binary data Analysis of legacy information, 173–174 external references, 96 ANSI (American National Standards internal, 96–97 Institute), 11 representations of, 189 APIs, 22 space efficiency in, 186 Application specifications, 24–25 Binary Large Objects (BLOBs), 34, 36 Archiving data Binding benefits of, 80 data, 94, 109–110 in XML projects, 34, 36 programming languages, 22 Artificial intelligence in search process, 162 in WSDL, 128–129 Associating Style Sheets with XML BIS (Bureau of Indian Standards), 11 Documents, 20 Bloat in legacy information, 175–176 Asynchronous networking, 119, 126–127 BLOBs (Binary Large Objects), 34, 36 Atomicity in ACID test, 103 Bloomberg service, 38 Attributes Boolean searches, 148 in markup, 64 Bourret, Ronald, 110 in relationships, 87 Bray, Tim, 147–148 repetition of, 188 Broadcast networking, 119

215 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 216

216 Index

Budgets in XML projects, 46–47 CompuServe, 33 Bureau of Indian Standards (BIS), 11 Computer games, external metadata for, 158 Business documents, 134 Concurrent version system (CVS), 36, 74 Business-to-business (B2B) Consistency automation for, 77 in ACID test, 103 types of, 134–135 in standards, 27–29 bzip2 program, 199 Content management of, 74–75 CALS Table Model, 21 non-XML, 95–97 Canonical XML, 19 vs. protocols, 136–137 Capitalization, 58–59 Context in searches, 143–145 Cascading style sheets (CSS), 12 Converting as filters, 23 currency, 147 XML support in, 69 legacy information, 177–182 CDATA sections, 52 XML to HTML, 38 Change markup, 62–63 CORBA (Common Object Request Broker custom publishing issues in, Architecture), 121 65–66 Core specifications, 18 problems from, 63–65 Correlation in searches, 145–147 Character large objects (CLOBs), 36 Country identification codes, 11, Character size, 197–198 100 Client-side rendering, 38 Cover, Robin, 8, 17, 25 Client-side XML, 69 Cover Pages, 8, 17, 25 CLOBs (character large objects), 36 Creation stage in XML projects, 34–35 Cluster computing vs. grid computing, 131 Cross-conversions with legacy information, Code reduction in legacy information, 172 177 Collaboration protocol profiles, 135 Currency Collections, 98 codes for, 11, 100 Common data styles, 81–82 converting, 147 graph, 85–90 Custom coding, transformation for, 37 hierarchical, 90–92 Custom libraries, reading XML with, 110 tabular, 82–85 Custom programming for queries, 192 Common Object Request Broker Custom publishing, 56, 65–66 Architecture (CORBA), 121 CVS (concurrent version system), 36, 74 Compatibility networking for, 115–116 Data, 77 in standards, 27–29 abstraction, 81 in XML projects, 44–46 advantages of, 78 Compressed XML, 107 archiving and auditing, 80 Compression, 199 binding, 94, 109–110 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 217

Index 217

common styles, 81–82 Disruptive technologies, 33, 40 graph, 85–90 Distributed Component Object Model hierarchical, 90–92 (DCOM), 121 tabular, 82–85 Distributed computing, 131 data typing, 105–108 DocBook specification, 13 identification of, 98–99 interoperability with, 151 existing schemes for, 100 for Linux, 25 future possibilities, 101–103 for master documents, 54 in searches, 146 words in, 145 interoperability of, 80–81 (DOM) markup issues in, 93 for parsing, 190–191 non-XML content, 95–97 programming bindings for, 22 whitespace handling, 93–95 tree-based, 180, 185, 192 platform and storage independence, W3C working group for, 13 78–79 Document type definitions (DTDs), 23 resources for, 81 costs of, 46 reusability of, 79 external, referencing, 189–190 self-documentation, 79 for searching, 159–161 summary, 111 top-levels of, 75 transactions, 103–105 validation, 27–28 verification, 79–80 Documents, XML, 51–53 Data interchange format (DIF), 78–79 change markup in, 62–66 Data-oriented XML, 93 client-side XML, 69 Database metadata in searches, 37 content management in, 74–75 DataPower, 188 custom publishing, 56 DCMI (Dublin Core Metadata Initiative), 15, hybrid data publishing, 55–56 149–150 idioms, 72–74 DCOM (Distributed Component Object knowledge preservation in, 58–59 Model), 121 layout control for, 61–62 De facto vs. de jure standards, looseleaf publishing, 66–67 26–27 multiple text flows in, 67–69 Denormalized documents, 71 people limitations in, 60–61 Department of Defense (DoD) networking reusability of, 57, 70–72 model, 136 single-source publishing, 53–54 Diagrams, UML, 25 summary, 76 Dictionaries, 74–75 DoD (Department of Defense) networking DIF (data interchange format), model, 136 78–79 DOM (Document Object Model) Direct mapping to programming interfaces, for parsing, 190–191 110 programming bindings for, 22 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 218

218 Index

DOM (Document Object Model) (cont.) Event-based interfaces, 190–191 tree-based, 180, 185, 192 Everything-is-a-string philosophy, 98 W3C working group for, 13 Excel DOM-like interfaces, 190–191 as standard, 78 pipelines in, 197 tabular style for, 82 for XSLT, 194 XML format for, 24 Domain names, 102–103 Expressions, XPath, 193 Down conversions with legacy information, Extensibility 177 in networking, 114–115 DTDs. See Document type definitions in XML projects, 44–46 (DTDs) Extensible Business Reporting Language Dublin Core (XBRL) format, 80 for consistency and compatibility, 29 Extensible Hypertext for RDF data, 156 (XHTML), 25 with Semantic Web, 164 consistency in, 29 Dublin Core Metadata Initiative (DCMI), 15, interoperability with, 151 149–150 External metadata in searches, 154–159 Duplicated parsing, 196–197 External references, 189–190 Durability in ACID test, 104 Facades for legacy interfaces, 168–170 ebXML Fatal errors, 115 as B2B initiative, 135 Fear, uncertainty, and doubt (FUD) development of, 14 standards, 7–8 EDI (electronic data interchange), 14, Feed readers and aggregators in RSS, 134 134–135 Fielding, Roy, 119 8-bit ASCII encoding, 197 Financial-planning software, 133 Element content, 95 Firewalls, 117 Element names, repetition of, 188 Flexibility in switching, 139 Emacs LISP language, 70 Footnotes, text flow in, 68 Embeddable specifications, 19–21 Format level, interoperability in, 5 Emergency phone service, 5 Formats vs. protocols, 113 Empty tags, 63 Formatting Encoding, 189, 197–198 change markup, 62–66 End users in search process, 161 as information, 58–59 Enterprise resource planning (ERP), 176 vs. meaning, 54 Entities in relationships, 87 Forward compatibility, 44–46 Enumerated values, 106 FoxPro, 78 ERP (enterprise resource planning), Frustration problem, 41 176 FTP, plaintext with, 185, 187 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 219

Index 219

FUD (fear, uncertainty, and doubt) standards, for online documentation, 53 7–8 SGML inspiration for, 52 Full-text searches, 37, 143 vs. XML, 78 Fully normalized data, 86–90 HTTP (Hypertext Transfer Protocol) extensibility of, 115 Generalized Markup Language (GML), 51 GET headers in, 137–138 GET headers and requests library problems with, 10 in HTTP, 137–138 plaintext with, 185, 188 in SOAP, 123 specifications for, 12 Global identifiers, 100–102 support for, 113 Globus Toolkit, 132 for transport, 34 GML (Generalized Markup Language), 51 Human intelligence in search process, 161 spreadsheet, XML format for, 24 Human-readable format, 170–172 Goldfarb, Charles, 51 Hybrid data publishing, 55–56 Google search engine, 143–144 Hypertext Markup Language. See HTML Granularity (Hypertext Markup Language) in in-line markup, 153 Hypertext Transfer Protocol. See HTTP in legacy information mismatches, (Hypertext Transfer Protocol) 174–175 HyTime specification, 157 Graph style, 85–90 Graphic Communications Association, 14 IANA (Internet Assigned Numbers Grid computing, 131–132 Authority), 101 Growth in legacy information, 176 IBM databases, XML support for, 36 Hardware acceleration, 117, 188 ICANN (International Corporation for Hardware support in performance, 187–188 Assigned Names and Numbers), 101 Headers in HTTP, 137–138 ICAO (International Civil Aviation Health Level Seven (HL7), 17 Organization) airport identifiers, 101 Heuristics in legacy information ICE (Information and Content Exchange), conversions, 177 15, 38, 134 Hierarchical style, 90–92 IDEAlliance, 14–15 Holman, Ken, 23, 195 Identification of data, 98–99 Homographs in searches, 144 existing schemes for, 100 HTML (Hypertext Markup Language) future possibilities, 101–103 converting XML to, 38 in searches, 146 development of, 12 Idioms, 72–74 extensibility of, 115 IETF (Internet Engineering Task Force), 12 forms and scripts with, 9 Ignorable whitespace, 95 meta elements in, 149–150 In-line markup, 151–154 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 220

220 Index

Independence, platform and storage, 78–79 Invited experts, 13 Indexes IP (Internet Protocol), 115 in in-line markup, 152 IPC (interprocess communications), 121 in structural searches, 37 IPTC (International Press Influence in XML projects, 42–44 Telecommunication Council), 17 Information and Content Exchange (ICE), ISBN (International Standard Book 15, 38, 134 Number), 100 Information exchange, 91 ISO (International Organization for Information publishing, 91 Standardization), 11–12 Inheritance relationships in searches, ISO-8859 encoding, 198 163–164 ISO Latin 1 alphabet, 197 Intelligence in search process, 161–162 Isolation in ACID test, 103–104 Interface designers in search process, 161 Interfaces, legacy, 168–170 Java language Internal text entities, 71 memory leaks with, 10 Internalization and size, 198–199 for server-side development, 9 International Civil Aviation Organization JavaDoc system, 71 (ICAO) airport identifiers, 101 Jazz standards, 3 International Corporation for Assigned JSPs (JavaServer Pages), 9 Names and Numbers (ICANN), 101 Just-in-time rendering, 38 International Organization for Standardization (ISO), 11–12 Keys International Press Telecommunication for identification, 99 Council (IPTC), 17 for relational databases, 85 International Standard Book Number keywords meta element, 150 (ISBN), 100 Knowledge preservation in documents, Internationalization in networking, 114 58–59 Internet, interoperability in, 5 Knuth, Donald, 71 Internet Assigned Numbers Authority KOffice spreadsheet, XML format for, 24 (IANA), 101 Koine (Greek dialect), 167–168 Internet Engineering Task Force (IETF), 12 Internet Explorer, XSLT support in, 22 Language codes, 11, 100 Internet Protocol (IP), 115 LaTeX markup, 51–52 Internet worm, 9 Layer-8 protocols, 138 Interoperability Layering in networking, 136–138 of data, 80–81 Layout control for documents, 61–62 in searches, 150–151 Layout designers, costs of, 47 standards for, 4–6 Legacy information, 167 Interprocess communications (IPC), 121 advantages of XML for, 168 Inventory information, 109–110 analysis of, 173–174 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 221

Index 221

bloat in, 175–176 Mathematical equations, support for, 96 code reduction in, 172 Mathematical Markup Language (MathML), conversion strategies for, 177–182 19–20 granularity mismatches in, 174–175 Meaning vs. formatting, 54 growth and versioning in, 176 Measurement units, identification codes interfaces, 168–170 for, 100 metadata with, 182–184 Medium access control (MAC) addresses, network effect in, 172–173 101 summary, 184 Megginson, David, 15–16 transparency for, 170–172 Memory leaks, 10 virtual XML for, 180–182 meta element, 149–150 LEPs (lists of effective pages), 66–67 Metadata Libraries as commodities, 158 HTML, 10 external, 154–159 reading XML with, 110 with legacy information, 182–184 for reuse, 6 in searches, 37, 154–159 Life cycles of XML projects, 33–35 LISP language, 70 as standard, 78 Lists of effective pages (LEPs), 66–67 tabular style for, 82 Literate programming, 71 XML format for, 24 Logical format for data, 79 Microsoft Networks (MSN), 5 Looseleaf publishing, 66–67 Microsoft Visual SourceSafe, 36, 74 Loosely-coupled data, 79 MIF for printing, 73 Lorie, Ray, 51 Mixed content, 52, 95 Mixed conversions with legacy information, MAC (medium access control) addresses, 177 101 Monocultures from standards, 8–9 Magna Carta, 5 Morris Internet worm, 9 Markup budgets in XML projects, 46–47 Mosher, Ed, 51 Markup issues Mozilla Project, 17 change markup, 62–63 XML support in, 69 custom publishing issues in, XSLT support in, 22 65–66 MSN (Microsoft Networks), 5 problems from, 63–65 Multiple text flows in documents, 67–69 non-XML content, 95–97 in searches, 151 Namespaces, 18 external metadata, 154–159 in compatibility, 46 in-line markup, 151–154 consistency in, 28 whitespace handling, 93–95 for embedded specifications, 19–20 Marshaling, 116 for global identifiers, 102 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 222

222 Index

Namespaces (cont.) NITF (News Industry Text Format), for naming collisions, 115 144–146 in searches, 159–164 Non-XML content, 95–97 Namespaces Recommendation, 18 Normalized data, 86–90 Network effect in legacy information, 172–173 OASIS (Organization for Advancement of in transportation standards, 4–5 Structured Information Standards), Networking, 113–114 13–14 advantages of, 114 ad hoc groups, 15–16 for compatibility, 115–116 IDEAlliance, 14–15 disadvantages of, 117 RosettaNet, 15 extensibility in, 114–115 specialist groups, 16–17 internationalization in, 114 WS-I, 14 layering in, 136–138 Object-oriented programming, 71 performance of, 117 OGSA (Open Grid Services Architecture), resources for, 116 131–132 security for, 117–118 OGSI (Open Grid Services Infrastructure), states in, 118 131–132 styles, 118–120 One-way conversions with legacy asynchronous, 119, 126–127 information, 178 grid computing, 131–132 Opaque names, searching documents with, miscellaneous, 134–135 160–161 RPC, 121–126 Open Grid Services Architecture (OGSA), syndication, 132–134 131–132 Web Services, 127–131 Open Grid Services Infrastructure (OGSI), summary, 139–140 131–132 switching and routing in, Open Office spreadsheet, XML format for, 138–139 24 transparency in, 114 Open Systems Interconnection (OSI) model, New Testament, 167 136–137 News Industry Text Format (NITF), 144–146 Oracle databases, XML support for, 36 NewsML Organization for Advancement of Structured identification codes in, 99 Information Standards. See OASIS interoperability of data in, 80 (Organization for Advancement of for referencing external objects, 97 Structured Information Standards) standardized metaformat in, 157 OSI (Open Systems Interconnection) model, for syndication, 134 136–137 for transport, 38 Overwork problem, 41 Newspapers, text flow in, 68–69 OWL (), 164 911 phone service, 5 Oxford English Dictionary, 74 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 223

Index 223

Packets, HTTP, 138 markup budgets in, 46–47 Parsers and parsing pitfalls in, 38 duplicated, 196–197 rendering stage, 38 external references with, 190–191 resistance to change in, 40–41 in in-line markup, 152 search and retrieval, 36–37 performance of, 188, 190–191 social side of, 42–44 SAX for, 180–181 storage and archiving stage, 36 security of, 117–118 summary, 47 Unicode support by, 197 technology in, 44–47 Pattern matching in legacy information transformation stage, 37–38 conversions, 177 transport stage, 38 PDF format for printing, 73 unrealistic expectations in, Pecking order in XML projects, 42–43 39–40 People limitations, 60–61 unspoken expectations in, 39 Performance, 185 Platform independence, 78–79 encoding in, 189 Polling in RSS, 134 external references in, 189–190 POP3, plaintext with, 185 of networking, 117 Port types in WSDL, 128 parsing interfaces in, 190–191 Postel, John, 115 of pipelines, 196–197 PostScript language of queries, 191–194 as de facto standard, 26 repetition in, 188–189 for printing, 73 size in, 197–199 Potato blight, 9 software and hardware support in, Primary keys for identification, 99 187–188 Printing, 73 space efficiency in, 186 PRISM (Publishing Requirements summary, 199–200 for Industry Standard Metadata), 15, with transformations, 194–196 29 Photographs, external metadata for, 157–158 Processing instructions, 63–64 Physical data format, 79 Processing pipelines, 196–197 Phytophthora infestans, 9 Profiles, WS-I, 14 Pipeline performance, 196–197 Programmers in search process, 161 Plaintext, 187–188 Programming interfaces, direct mapping to, performance with, 185 110 for transparency, 170–172 Property inheritance in RDF, 163–164 Planning XML projects, 33 Protocol level, interoperability in, 5 components in, 33–35 Protocols creation stage, 35 vs. content, 136–137 forward compatibility and extensibility vs. formats, 113 in, 44–46 Proximity searches, 148 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 224

224 Index

Publish/subscribe networking style, RelaxNG schema language 119–120, 132–134 development of, 14 Publishing Requirements for Industry external references with, 190 Standard Metadata (PRISM), 15, 29 schema, 23 Punctuation, 58–59 Remote Method Invocation (RMI), 121 Remote procedure calls (RPC), 121–122 Queries with REST, 122–124 performance of, 191–194 with SOAP client, 123–126 WSDL, 128 vs. syndication, 132–133 Rendering stage in XML projects, 34, 38 Railroad system standards, 4–5, 7 Repeatable information, 86 Random access Repetition in performance and size, 188–189 for queries, 193 Representational state transfer (REST) for references, 89 networking in SAX-like interfaces, 191 characteristics of, 119–120 in XSLT, 194–195 RPC with, 122–124 Raw tables, 83–84 Request/response networking style, 119–120 RDDL (Resource Directory Description Resistance to change, 40–41 Language), 15–16 Resource Description Framework. See RDF RDF (Resource Description Framework) (Resource Description Framework) data representation in, 29 Resource Directory Description Language for data sharing, 78 (RDDL), 15–16 embedded data in, 21 Resources for normalized data, 89 for data, 81 property inheritance in, for networking, 116 163–164 Responses in WSDL, 128 schema languages for, 23 REST (Representational state transfer) in searches, 156, 159 networking Semantic Web with, 162 characteristics of, 119–120 specifications for, 10 RPC with, 122–124 standardized metaformat in, 157 Reusability whitespace handling in, 94 data, 79 Reactivity, 188 documents, 57, 70–72 Readability of data, 84–85, 89 standards for, 4, 6 References, external Reuters, 38 binary, 96 Rival management in XML projects, 43–44 DTDs, 189–190 RMI (Remote Method Invocation), 121 Regions, identification codes for, 100 Robustness Principle, 115 Relational databases, 85 Roman Empire, 167 Relationships, 87, 163 RosettaNet, 15, 135 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 225

Index 225

Round-trip conversions with legacy summary, 164–165 information, 178–179 trust in, 149–150 Routing in networking, 138–139 usability in, 147–149 RPC (remote procedure calls), 121–122 Security for networking, 117–118 with REST, 122–124 Self-documentation, 79 with SOAP client, 124–126 Self-labeling data, 172 vs. syndication, 132–133 Semantic Web project RSS, 15–16 with RDF, 21, 25, 28 consistency in, 29 in searching, 162–164 and syndication, 134 Session states in networking, 118 for transport, 38 SGML (Standard Generalized Markup for web logs, 25 Language), 11–12, 51–53 RTF format for printing, 73 SGML Open, 13 Sharing information, 45, 78 SAI (Standards Australia International), 11 Shift-JIS encoding, 197 Sarvega, 188 Simple API for XML (SAX) and SAX-like SAX (Simple API for XML) and SAX-like interfaces, 16 interfaces, 16 development of, 15–16 as de facto standard, 26 for parsing, 180–181, 190–191 development of, 15–16 programming bindings for, 22 for parsing, 180–181, 190–191 Simple Mail Transfer Protocol (SMTP), 126 pipelines in, 197 attacks on, 9 (SVG) plaintext with, 185, 187 development of, 19 Simple Object Access Protocol (SOAP) support for, 96 with REST, 123–124 Schemas, 107–108 RPC with, 124–126 for data identification, 100 Single-source publishing in searches, 159–164 characteristics of, 53–54 as stylesheets, 23 vs. idioms, 73 Screen-based formats, 73 Size, 185 Search stage in XML projects, 34, 36–37 compression in, 199 Searching, 143 external references in, 189–190 context in, 143–145 and internalization, 198–199 correlation in, 145–147 repetition in, 188–189 interoperability in, 150–151 space efficiency in, 186 markup considerations in, 151 summary, 199–200 external metadata, 154–159 Unicode and character size, 197–198 in-line markup, 151–154 SMTP (Simple Mail Transfer Protocol), 126 namespaces, schemas, and Semantic Web attacks on, 9 in, 159–164 plaintext with, 185, 187 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 226

226 Index

SOAP (Simple Object Access Protocol) XML-related specifications, 17–18 with REST, 123–124 application specifications, 24–25 RPC with, 124–126 core specifications, 18 Social side of XML projects, 42–44 embeddable, 19–21 Software components for reuse, 6 utility specifications, 21–24 Software support in performance, 187–188 Standards Australia International (SAI), 11 Solicit/response method, 131 Star Office, XML format for, 24 Space efficiency, 186 Stateless protocols, 118 Specifications vs. standards, 3 States in networking, 118 Spelling, 60–61 Stephenson, George, 4 Splitting markup, 64 Stock quotes, syndication for, 132–133 Storage and archiving stage in XML RPC model for, 133 projects, 34, 36 standards for, 78 Storage independence, 78–79 tabular style for, 82 Streaming APIs, 22 XML format for, 24 Strings SQL for searches, 34 efficiency of, 105 Square peg problems for standards, 9 everything-is-a-string philosophy, 98 Standard Generalized Markup Language Structural searching, 37 (SGML), 11–12, 51–53 Structural transformations, XSLT for, 194 Standard structures for metadata markup, Structured programming, 71 157 Style sheets, 12 Standards, 3 costs of, 46 advantages of, 4 as filters, 23 abstraction, 6–7 linking, 20 interoperability, 4–6 XML support in, 69 reuse, 6 Styles, networking, 118–120 consistency and compatibility in, 27–29 asynchronous, 119, 126–127 de facto vs. de jure, 26–27 grid computing, 131–132 disadvantages of, 7–8 miscellaneous, 134–135 abstraction, 10 RPC, 121–126 FUD, 8 syndication, 132–134 monocultures, 8–9 Web Services, 127–131 square pegs, 9 Subsets in searches, 147 organizations for, 10 Supersets in searches, 147 ISO, 11–12 SVG (Scalable Vector Graphics) OASIS, 13–17 development of, 19 W3C, 12–13 support for, 96 summary, 29–31 Switching in networking, 138–139 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 227

Index 227

Synchronous networking, 119 Trust Syndication in networking, 132–134 in searches, 149–150 Synonyms in searches, 147 in Semantic Web, 164 System integraters in search process, 161 Two-phase commits, 105 type field, 106 Tabular style, 82–85 TCP (Transmission Control Protocol), 115 U.S. Patent and Trademark Office (USPTO), Technical committee (TC), 14 17 Technology in XML projects, 44–47 UCS-4 encoding, 189 TEI. See Text Encoding Initiative (TEI) UDDI (Universal Description, Discovery, TELNET, plaintext with, 185, 187 and Integration of Web Services), TeX language, 51 131 TeXInfo, 54 UML (Unified Modeling Language), 25 Text Encoding Initiative (TEI), 17, 25 UN/EDIFACT, 135 consistency in, 29 Unicast networking, 119 interoperability with, 151 Unicode encoding words in, 145 for internationalization, 114 Tracking changes in markup, 65 ISO specification, 11 Training, 47, 60–61 size of, 197–198 Transactions, 103–105 UCS-4, 189 Transformation pipeline, 37 Unified Modeling Language (UML), 25 Transformation stage in XML projects, Uniform Code Council, 100 37–38 Uniform resource identifiers (URIs), 101 Transformations Uniform resource name (URNs), 102 in hybrid data publishing, 55–56 unit attribute, 147 performance with, 194–196 United Nations Centre for Trade Facilitation XSL. See XSL Transformations and Electronic Business (UN- (XSLT) CEFACT), 14 Translators for legacy interfaces, 168 Units of measure, identification codes for, Transmission Control Protocol (TCP), 115 100 Transparency of information, 107 Universal Description, Discovery, and for legacy information, 170–172 Integration of Web Services (UDDI), in networking, 114 131 Transport stage in XML projects, 34, 38 Universal product codes (UPCs), 100 Transportation system standards, 4–5 Untyped XML data, 106–107 Tree-based APIs, 22 Up conversions with legacy information, Tree-based DOM, 180, 185, 192 177 Tree-based interfaces, 190–191 UPCs (universal product codes), 100 TROFF language, 51, 54 URIs (uniform resource identifiers), 101 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 228

228 Index

URLs Web pages in relationships, 87 RESTful, 124 Web Services, 14 vs. URNs, 102 specifications for, 10 URNs (uniform resource name), 102 working with, 127–131 Usability in searches, 147–149 Web Services Definition Language (WSDL), User control in XML projects, 42 128–131 User training, 47, 60–61 Web Services Interoperability Organization USPTO (U.S. Patent and Trademark Office), (WS-I), 14 17 Web Services Reliable Messaging (WSRM), UTF-8 encoding, 189, 197–198 14 UTF-16 encoding, 189, 197–198 Web Services Security (WSS), 14 Utility specifications, 21–24 Whitespace in formatting, 58–59 Validation handling, 93–95 data, 79–80 Workload management in XML projects, 43 DTDs, 27–28 World Wide Web Consortium (W3C), 8, Vector graphics (SVG) 12–13 development of, 19 for namespace identifiers, 102 support for, 96 Semantic Web project, 25 Verification of data, 79–80 Wrapping interfaces, 173, 184 version attribute, 46 WS-I (Web Services Interoperability Versioning in legacy information, 176 Organization), 14 Video clips, metadata for, 183–184 WSDL (Web Services Definition Language), Virtual Reality Markup Language (VRML), 128–131 9 WSRM (Web Services Reliable Messaging), Virtual XML, 180–182 14 spreadsheet, 82 WSS (Web Services Security), 14 Visual SourceSafe, 36, 74 WYSIWYG systems, 40 Vocabulary in searches, 148 VRML (Virtual Reality Markup Language), XBRL (Extensible Business Reporting 9 Language) format, 80 XBRL International, 17 W3C (World Wide Web Consortium), 8, XForms, 21 12–13 XHTML (Extensible Hypertext Markup for namespace identifiers, 102 Language), 25 Semantic Web project, 25 consistency in, 29 , XSLT for, 22 interoperability with, 151 web logs, 16, 25 XLink, 20 Web Ontology Language (OWL), 164 consistency in, 28 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 229

Index 229

for referencing external objects, 97 XML Topic Maps (XTM), 25 W3C working group for, 13 for data sharing, 78 XML support in, 69 standardized metaformat in, 157 XMI (XML Metadata Interchange), 25 whitespace handling in, 94 XML applications, 16 XPath language XML Applications and Initiatives page, 25 for content switching, 139 XML Encryption, 21, 24 for queries, 192–193 XML Entity Catalog specification, 24 random access in, 194 XML Information Set, 19 structural searching in, 37 :lang attribute, 19 XPOINTER language, 20–21 XML Linking Language. See XLink W3C working group for, 13 XML Metadata Interchange (XMI), 25 XML support in, 69 XML Namespaces, 18 XQuery consistency in, 28 consistency in, 28 for embedded specifications, 19–20 development of, 22–23 for global identifiers, 102 structural searching in, 37 XML Pointer Language, 20–21 XSL Transformations (XSLT) W3C working group for, 13 characteristics, 22 XML support in, 69 limitations of, 9 XML Recommendation, 18 performance with, 185, 188, XML Schema language, 14, 107–108 194–195 external references with, 190 XML support in, 69 specification for, 23 XSLT. See XML Stylesheet Language XML Signature, 21, 24 (XSLT); XSL Transformations xml:space attribute, 19, 94–95 (XSLT) XML Stylesheet Language (XSLT) XTM (XML Topic Maps), 25 consistency in, 29 for data sharing, 78 for rendering, 34 standardized metaformat in, 157 W3C working group for, 13 whitespace handling in, 94 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 230 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 231