Abstraction Data, 81 Standards For, 4, 6–7, 10 Acceleration, 117, 188 ACID
Total Page:16
File Type:pdf, Size:1020Kb
MEGGINSON_Index.ps 11/11/04 10:31 AM Page 215 Index Abstraction Auditing data, 80 data, 81 Authors, in-line markup for, 154 standards for, 4, 6–7, 10 Automated conversions with legacy Acceleration, 117, 188 information, 181 ACID (atomicity, consistency, isolation, Automation, 77 durability) tests, 103–105 Addressable data, 98 B2B (business-to-business) Adobe PostScript language automation for, 77 as de facto standard, 26 types of, 134–135 for printing, 73 Backward compatibility, 44 Advanced searches, 148–149 Base64 encoding, 96 Aircraft manuals, 56, 74 Batch requests, 116 American Iron and Steel Institute, 17 Batch searching, 37 American National Standards Institute Berners-Lee, Tim, 13, 162 (ANSI), 11 Binary data Analysis of legacy information, 173–174 external references, 96 ANSI (American National Standards internal, 96–97 Institute), 11 representations of, 189 APIs, 22 space efficiency in, 186 Application specifications, 24–25 Binary Large Objects (BLOBs), 34, 36 Archiving data Binding benefits of, 80 data, 94, 109–110 in XML projects, 34, 36 programming languages, 22 Artificial intelligence in search process, 162 in WSDL, 128–129 Associating Style Sheets with XML BIS (Bureau of Indian Standards), 11 Documents, 20 Bloat in legacy information, 175–176 Asynchronous networking, 119, 126–127 BLOBs (Binary Large Objects), 34, 36 Atomicity in ACID test, 103 Bloomberg service, 38 Attributes Boolean searches, 148 in markup, 64 Bourret, Ronald, 110 in relationships, 87 Bray, Tim, 147–148 repetition of, 188 Broadcast networking, 119 215 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 216 216 Index Budgets in XML projects, 46–47 CompuServe, 33 Bureau of Indian Standards (BIS), 11 Computer games, external metadata for, 158 Business documents, 134 Concurrent version system (CVS), 36, 74 Business-to-business (B2B) Consistency automation for, 77 in ACID test, 103 types of, 134–135 in standards, 27–29 bzip2 program, 199 Content management of, 74–75 CALS Table Model, 21 non-XML, 95–97 Canonical XML, 19 vs. protocols, 136–137 Capitalization, 58–59 Context in searches, 143–145 Cascading style sheets (CSS), 12 Converting as filters, 23 currency, 147 XML support in, 69 legacy information, 177–182 CDATA sections, 52 XML to HTML, 38 Change markup, 62–63 CORBA (Common Object Request Broker custom publishing issues in, Architecture), 121 65–66 Core specifications, 18 problems from, 63–65 Correlation in searches, 145–147 Character large objects (CLOBs), 36 Country identification codes, 11, Character size, 197–198 100 Client-side rendering, 38 Cover, Robin, 8, 17, 25 Client-side XML, 69 Cover Pages, 8, 17, 25 CLOBs (character large objects), 36 Creation stage in XML projects, 34–35 Cluster computing vs. grid computing, 131 Cross-conversions with legacy information, Code reduction in legacy information, 172 177 Collaboration protocol profiles, 135 Currency Collections, 98 codes for, 11, 100 Common data styles, 81–82 converting, 147 graph, 85–90 Custom coding, transformation for, 37 hierarchical, 90–92 Custom libraries, reading XML with, 110 tabular, 82–85 Custom programming for queries, 192 Common Object Request Broker Custom publishing, 56, 65–66 Architecture (CORBA), 121 CVS (concurrent version system), 36, 74 Compatibility networking for, 115–116 Data, 77 in standards, 27–29 abstraction, 81 in XML projects, 44–46 advantages of, 78 Compressed XML, 107 archiving and auditing, 80 Compression, 199 binding, 94, 109–110 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 217 Index 217 common styles, 81–82 Disruptive technologies, 33, 40 graph, 85–90 Distributed Component Object Model hierarchical, 90–92 (DCOM), 121 tabular, 82–85 Distributed computing, 131 data typing, 105–108 DocBook specification, 13 identification of, 98–99 interoperability with, 151 existing schemes for, 100 for Linux, 25 future possibilities, 101–103 for master documents, 54 in searches, 146 words in, 145 interoperability of, 80–81 Document Object Model (DOM) markup issues in, 93 for parsing, 190–191 non-XML content, 95–97 programming bindings for, 22 whitespace handling, 93–95 tree-based, 180, 185, 192 platform and storage independence, W3C working group for, 13 78–79 Document type definitions (DTDs), 23 resources for, 81 costs of, 46 reusability of, 79 external, referencing, 189–190 self-documentation, 79 for searching, 159–161 summary, 111 top-levels of, 75 transactions, 103–105 validation, 27–28 verification, 79–80 Documents, XML, 51–53 Data interchange format (DIF), 78–79 change markup in, 62–66 Data-oriented XML, 93 client-side XML, 69 Database metadata in searches, 37 content management in, 74–75 DataPower, 188 custom publishing, 56 DCMI (Dublin Core Metadata Initiative), 15, hybrid data publishing, 55–56 149–150 idioms, 72–74 DCOM (Distributed Component Object knowledge preservation in, 58–59 Model), 121 layout control for, 61–62 De facto vs. de jure standards, looseleaf publishing, 66–67 26–27 multiple text flows in, 67–69 Denormalized documents, 71 people limitations in, 60–61 Department of Defense (DoD) networking reusability of, 57, 70–72 model, 136 single-source publishing, 53–54 Diagrams, UML, 25 summary, 76 Dictionaries, 74–75 DoD (Department of Defense) networking DIF (data interchange format), model, 136 78–79 DOM (Document Object Model) Direct mapping to programming interfaces, for parsing, 190–191 110 programming bindings for, 22 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 218 218 Index DOM (Document Object Model) (cont.) Event-based interfaces, 190–191 tree-based, 180, 185, 192 Everything-is-a-string philosophy, 98 W3C working group for, 13 Excel spreadsheet DOM-like interfaces, 190–191 as standard, 78 pipelines in, 197 tabular style for, 82 for XSLT, 194 XML format for, 24 Domain names, 102–103 Expressions, XPath, 193 Down conversions with legacy information, Extensibility 177 in networking, 114–115 DTDs. See Document type definitions in XML projects, 44–46 (DTDs) Extensible Business Reporting Language Dublin Core (XBRL) format, 80 for consistency and compatibility, 29 Extensible Hypertext Markup Language for RDF data, 156 (XHTML), 25 with Semantic Web, 164 consistency in, 29 Dublin Core Metadata Initiative (DCMI), 15, interoperability with, 151 149–150 External metadata in searches, 154–159 Duplicated parsing, 196–197 External references, 189–190 Durability in ACID test, 104 Facades for legacy interfaces, 168–170 ebXML Fatal errors, 115 as B2B initiative, 135 Fear, uncertainty, and doubt (FUD) development of, 14 standards, 7–8 EDI (electronic data interchange), 14, Feed readers and aggregators in RSS, 134 134–135 Fielding, Roy, 119 8-bit ASCII encoding, 197 Financial-planning software, 133 Element content, 95 Firewalls, 117 Element names, repetition of, 188 Flexibility in switching, 139 Emacs LISP language, 70 Footnotes, text flow in, 68 Embeddable specifications, 19–21 Format level, interoperability in, 5 Emergency phone service, 5 Formats vs. protocols, 113 Empty tags, 63 Formatting Encoding, 189, 197–198 change markup, 62–66 End users in search process, 161 as information, 58–59 Enterprise resource planning (ERP), 176 vs. meaning, 54 Entities in relationships, 87 Forward compatibility, 44–46 Enumerated values, 106 FoxPro, 78 ERP (enterprise resource planning), Frustration problem, 41 176 FTP, plaintext with, 185, 187 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 219 Index 219 FUD (fear, uncertainty, and doubt) standards, for online documentation, 53 7–8 SGML inspiration for, 52 Full-text searches, 37, 143 vs. XML, 78 Fully normalized data, 86–90 HTTP (Hypertext Transfer Protocol) extensibility of, 115 Generalized Markup Language (GML), 51 GET headers in, 137–138 GET headers and requests library problems with, 10 in HTTP, 137–138 plaintext with, 185, 188 in SOAP, 123 specifications for, 12 Global identifiers, 100–102 support for, 113 Globus Toolkit, 132 for transport, 34 GML (Generalized Markup Language), 51 Human intelligence in search process, 161 Gnumeric spreadsheet, XML format for, 24 Human-readable format, 170–172 Goldfarb, Charles, 51 Hybrid data publishing, 55–56 Google search engine, 143–144 Hypertext Markup Language. See HTML Granularity (Hypertext Markup Language) in in-line markup, 153 Hypertext Transfer Protocol. See HTTP in legacy information mismatches, (Hypertext Transfer Protocol) 174–175 HyTime specification, 157 Graph style, 85–90 Graphic Communications Association, 14 IANA (Internet Assigned Numbers Grid computing, 131–132 Authority), 101 Growth in legacy information, 176 IBM databases, XML support for, 36 Hardware acceleration, 117, 188 ICANN (International Corporation for Hardware support in performance, 187–188 Assigned Names and Numbers), 101 Headers in HTTP, 137–138 ICAO (International Civil Aviation Health Level Seven (HL7), 17 Organization) airport identifiers, 101 Heuristics in legacy information ICE (Information and Content Exchange), conversions, 177 15, 38, 134 Hierarchical style, 90–92 IDEAlliance, 14–15 Holman, Ken, 23, 195 Identification of data, 98–99 Homographs in searches, 144 existing schemes for, 100 HTML (Hypertext Markup Language) future possibilities, 101–103 converting XML to, 38 in searches, 146 development of, 12 Idioms, 72–74 extensibility of, 115 IETF (Internet Engineering Task Force), 12 forms and scripts with, 9 Ignorable whitespace, 95 meta elements in, 149–150 In-line markup, 151–154 MEGGINSON_Index.ps 11/11/04 10:31 AM Page 220 220 Index Independence, platform and storage, 78–79 Invited experts, 13 Indexes IP (Internet Protocol), 115 in in-line markup, 152 IPC (interprocess communications), 121 in structural searches, 37 IPTC (International