Python Default File Format for Download Download Files with Progress in Python
Total Page:16
File Type:pdf, Size:1020Kb
python default file format for download Download files with progress in Python. This is a coding tip article. I will show you how to download files with progress in Python. The sauce here is to make use of the wget module. First, install the module into the environment. The wget module is pretty straight-forward, only one function to access, wget.download() . Let say we want to download this file http://download.geonames.org/export/zip/US.zip, the implementation will be following: The output will look like this: As you can see, it prints out the progress bar for us to know the progress of downloading, with current bytes retrieved with total bytes. The s econd parameter is to set output filename or directory for output file. There is another parameter, bar=callback(current, total, width=80) . This is to define how the progress bar is rendered to output screen. Canonical specification. The canonical version of the wheel format specification is now maintained at https://packaging.python.org/specifications/binary-distribution-format/ . This may contain amendments relative to this PEP. Abstract. This PEP describes a built-package format for Python called "wheel". A wheel is a ZIP-format archive with a specially formatted file name and the .whl extension. It contains a single distribution nearly as it would be installed according to PEP 376 with a particular installation scheme. Although a specialized installer is recommended, a wheel file may be installed by simply unpacking into site-packages with the standard 'unzip' tool while preserving enough information to spread its contents out onto their final paths at any later time. PEP Acceptance. This PEP was accepted, and the defined wheel version updated to 1.0, by Nick Coghlan on 16th February, 2013 [1] Rationale. Python needs a package format that is easier to install than sdist. Python's sdist packages are defined by and require the distutils and setuptools build systems, running arbitrary code to build-and-install, and re-compile, code just so it can be installed into a new virtualenv. This system of conflating build-install is slow, hard to maintain, and hinders innovation in both build systems and installers. Wheel attempts to remedy these problems by providing a simpler interface between the build system and the installer. The wheel binary package format frees installers from having to know about the build system, saves time by amortizing compile time over many installations, and removes the need to install a build system in the target environment. Details. Installing a wheel 'distribution-1.0-py32-none-any.whl' Wheel installation notionally consists of two phases: Unpack. Parse distribution-1.0.dist-info/WHEEL . Check that installer is compatible with Wheel-Version. Warn if minor version is greater, abort if major version is greater. If Root-Is-Purelib == 'true', unpack archive into purelib (site-packages). Else unpack archive into platlib (site-packages). Spread. Unpacked archive includes distribution-1.0.dist-info/ and (if there is data) distribution-1.0.data/ . Move each subtree of distribution- 1.0.data/ onto its destination path. Each subdirectory of distribution-1.0.data/ is a key into a dict of destination directories, such as distribution- 1.0.data/(purelib|platlib|headers|scripts|data) . The initially supported paths are taken from distutils.command.install . If applicable, update scripts starting with #!python to point to the correct interpreter. Update distribution-1.0.dist-info/RECORD with the installed paths. Remove empty distribution-1.0.data directory. Compile any installed .py to .pyc. (Uninstallers should be smart enough to remove .pyc even if it is not mentioned in RECORD.) Recommended installer features. In wheel, scripts are packaged in - .data/scripts/ . If the first line of a file in scripts/ starts with exactly b'#!python' , rewrite to point to the correct interpreter. Unix installers may need to add the +x bit to these files if the archive was created on Windows. The b'#!pythonw' convention is allowed. b'#!pythonw' indicates a GUI script instead of a console script. Generate script wrappers. In wheel, scripts packaged on Unix systems will certainly not have accompanying .exe wrappers. Windows installers may want to add them during install. Recommended archiver features. File Format. File name convention. distribution Distribution name, e.g. 'django', 'pyramid'. version Distribution version, e.g. 1.0. build tag Optional build number. Must start with a digit. Acts as a tie-breaker if two wheel file names are the same in all other respects (i.e. name, version, and other tags). Sort as an empty tuple if unspecified, else sort as a two-item tuple with the first item being the initial digits as an int , and the second item being the remainder of the tag as a str . language implementation and version tag E.g. 'py27', 'py2', 'py3'. abi tag E.g. 'cp33m', 'abi3', 'none'. platform tag E.g. 'linux_x86_64', 'any'. For example, distribution-1.0-1-py27-none-any.whl is the first build of a package called 'distribution', and is compatible with Python 2.7 (any Python 2.7 implementation), with no ABI (pure Python), on any CPU architecture. The last three components of the filename before the extension are called "compatibility tags." The compatibility tags express the package's basic interpreter requirements and are detailed in PEP 425. Escaping and Unicode. Each component of the filename is escaped by replacing runs of non-alphanumeric characters with an underscore _ : The archive filename is Unicode. It will be some time before the tools are updated to support non-ASCII filenames, but they are supported in this specification. The filenames inside the archive are encoded as UTF-8. Although some ZIP clients in common use do not properly display UTF-8 filenames, the encoding is supported by both the ZIP specification and Python's zipfile . File contents. The contents of a wheel file, where is replaced with the name of the package, e.g. beaglevote and is replaced with its version, e.g. 1.0.0 , consist of: / , the root of the archive, contains all files to be installed in purelib or platlib as specified in WHEEL . purelib and platlib are usually both site- packages . - .data/ contains one subdirectory for each non-empty install scheme key not already covered, where the subdirectory name is an index into a dictionary of install paths (e.g. data , scripts , include , purelib , platlib ). Python scripts must appear in scripts and begin with exactly b'#!python' in order to enjoy script wrapper generation and #!python rewriting at install time. They may have any or no extension. - .dist-info/METADATA is Metadata version 1.1 or greater format metadata. - .dist-info/WHEEL is metadata about the archive itself in the same basic key: value format: Wheel-Version is the version number of the Wheel specification. Generator is the name and optionally the version of the software that produced the archive. Root-Is-Purelib is true if the top level directory of the archive should be installed into purelib; otherwise the root should be installed into platlib. Tag is the wheel's expanded compatibility tags; in the example the filename would contain py2.py3-none-any . Build is the build number and is omitted if there is no build number. A wheel installer should warn if Wheel-Version is greater than the version it supports, and must fail if Wheel-Version has a greater major version than the version it supports. Wheel, being an installation format that is intended to work across multiple versions of Python, does not generally include .pyc files. Wheel does not contain setup.py or setup.cfg. This version of the wheel specification is based on the distutils install schemes and does not define how to install files to other locations. The layout offers a superset of the functionality provided by the existing wininst and egg binary formats. The .dist-info directory. Wheel .dist-info directories include at a minimum METADATA, WHEEL, and RECORD. METADATA is the package metadata, the same format as PKG-INFO as found at the root of sdists. WHEEL is the wheel metadata specific to a build of the package. RECORD is a list of (almost) all the files in the wheel and their secure hashes. Unlike PEP 376, every file except RECORD, which cannot contain a hash of itself, must include its hash. The hash algorithm must be sha256 or better; specifically, md5 and sha1 are not permitted, as signed wheel files rely on the strong hashes in RECORD to validate the integrity of the archive. 's INSTALLER and REQUESTED are not included in the archive. RECORD.jws is used for digital signatures. It is not mentioned in RECORD. RECORD.p7s is allowed as a courtesy to anyone who would prefer to use S/MIME signatures to secure their wheel files. It is not mentioned in RECORD. During extraction, wheel installers verify all the hashes in RECORD against the file contents. Apart from RECORD and its signatures, installation will fail if any file in the archive is not both mentioned and correctly hashed in RECORD. The .data directory. Any file that is not normally installed inside site-packages goes into the .data directory, named as the .dist-info directory but with the .data/ extension: The .data directory contains subdirectories with the scripts, headers, documentation and so forth from the distribution. During installation the contents of these subdirectories are moved onto their destination paths. Signed wheel files. Wheel files include an extended RECORD that enables digital signatures. PEP 376's RECORD is altered to include a secure hash digestname=urlsafe_b64encode_nopad(digest) (urlsafe base64 encoding with no trailing = characters) as the second column instead of an md5sum.