Arxiv:2102.07033V1 [Cs.CL] 13 Feb 2021 Swering When It Is Likely to Be Incorrect

Total Page:16

File Type:pdf, Size:1020Kb

Arxiv:2102.07033V1 [Cs.CL] 13 Feb 2021 Swering When It Is Likely to Be Incorrect PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them Patrick Lewisyz Yuxiang Wuz Linqing Liuz Pasquale Minerviniz Heinrich Kuttler¨ y Aleksandra Piktusy Pontus Stenetorpz Sebastian Riedelyz yFacebook AI Research zUniversity College London [email protected] Abstract the whole corpus, and then retrieve-and-read docu- ments in order to answer questions on-the-fly (Chen Open-domain Question Answering models et al., 2017; Lee et al., 2019a, inter alia). which directly leverage question-answer (QA) A second class of models, closed-book question pairs, such as closed-book QA (CBQA) mod- els and QA-pair retrievers, show promise in answering (CBQA) models, have recently been terms of speed and memory compared to con- proposed. They learn to directly map questions to ventional models which retrieve and read from answers from training question-answer (QA) pairs text corpora. QA-pair retrievers also offer in- without access to a background corpus (Roberts terpretable answers, a high degree of control, et al., 2020; Ye et al., 2021). These models usu- and are trivial to update at test time with new ally take the form of pretrained seq2seq models knowledge. However, these models lack the such as T5 (Raffel et al., 2020) or BART (Lewis accuracy of retrieve-and-read systems, as sub- stantially less knowledge is covered by the et al., 2019a), fine-tuned on QA-pairs. It has re- available QA-pairs relative to text corpora like cently been shown that current closed-book models Wikipedia. To facilitate improved QA-pair mostly memorise training QA-pairs, and can strug- models, we introduce Probably Asked Ques- gle to answer questions that do not overlap with tions (PAQ), a very large resource of 65M training data (Lewis et al., 2020b). automatically-generated QA-pairs. We intro- Models which explicitly retrieve (training) QA- duce a new QA-pair retriever, RePAQ, to com- pairs, rather than memorizing them in parameters, plement PAQ. We find that PAQ preempts have been shown to perform competitively with and caches test questions, enabling RePAQ to match the accuracy of recent retrieve-and-read CBQA models (Lewis et al., 2020b; Xiao et al., models, whilst being significantly faster. Us- 2020). These models have a number of useful prop- ing PAQ, we train CBQA models which out- erties, such as fast inference, interpretable outputs perform comparable baselines by 5%, but trail (by inspecting retrieved QA-pairs), and the ability RePAQ by over 15%, indicating the effective- to update the model’s knowledge at test time by ness of explicit retrieval. RePAQ can be con- adding or removing QA-pairs. figured for size (under 500MB) or speed (over However, CBQA and QA-pair retriever models 1K questions per second) whilst retaining high accuracy. Lastly, we demonstrate RePAQ’s are currently not competitive with retrieve-and-read strength at selective QA, abstaining from an- systems in terms of accuracy, largely because the arXiv:2102.07033v1 [cs.CL] 13 Feb 2021 swering when it is likely to be incorrect. This training QA-pairs they operate on cover substan- enables RePAQ to “back-off” to a more expen- tially less knowledge than background corpora like sive state-of-the-art model, leading to a com- Wikipedia. In this paper, we explore whether mas- bined system which is both more accurate and sively expanding the coverage of QA-pairs enables 2x faster than the state-of-the-art model alone. CBQA and QA-pair retriever models which are 1 Introduction competitive with retrieve-and-read models. We present Probably Asked Questions (PAQ), a Open-domain QA (ODQA) systems usually have semi-structured Knowledge Base (KB) of 65M nat- access to a background corpus that can be used to ural language QA-pairs, which models can mem- answer questions. Models which explicitly exploit orise and/or learn to retrieve from. PAQ differs this corpus are commonly referred to as Open-book from traditional KBs in that questions and answers models (Roberts et al., 2020). They typically index are stored in natural language, and that questions are generated such that they are likely to appear in tions: i) introduce PAQ, 65M QA-pairs automati- ODQA datasets. PAQ is automatically constructed cally generated from Wikipedia, and demonstrate using a question generation model and Wikipedia. the importance of global filtering for high quality To ensure generated questions are not only answer- ii) introduce RePAQ, a QA system designed to uti- able given the passage they are generated from, lize PAQ and demonstrate how it can be optimised we employ a global filtering post-processing step for memory, speed or accuracy iii) investigate the employing a state-of-the-art ODQA system. This utility of PAQ for CBQA models, improving by 5% greatly reduces the amount of wrong and ambigu- but note significant headroom to RePAQ iv) demon- ous questions compared other approaches (Fang strate RePAQ’s strength on selective QA, enabling et al., 2020; Alberti et al., 2019), and is critical for us to combine RePAQ with a state-of-the-art QA high-accuracy, downstream QA models. model, making it both more accurate and 2x faster1 To complement PAQ we develop RePAQ, a 2 Open-Domain Question Answering question answering model based on question re- trieval/matching models, using dense Maximum ODQA is the task of answering natural language Inner Product Search-based retrieval, and option- factoid question from an open set of domains. A ally, re-ranking. We show that PAQ and RePAQ typical question might be “when was the last year provide accurate ODQA predictions, at the level astronauts landed on the moon?”, with a target an- of relatively recent large-scale retrieve-and-read swer “1972”. The goal of ODQA is to develop systems such as RAG (Lewis et al., 2020a) on Nat- an answer function m : Q 7! A, where Q and A uralQuestions (Kwiatkowski et al., 2019a) and Triv- respectively are the sets of all possible questions iaQA (Joshi et al., 2017). PAQ instances are anno- and answers. We assume there is a distribution tated with scores that reflect how likely we expect P (q; a) of QA-pairs, defined over Q × A. A good questions to appear, which can be used to control answer function will minimise the expected error the memory footprint of RePAQ by filtering the KB over P (q; a) with respect to some loss function, accordingly. As a result, RePAQ is extremely flexi- such as answer string match. In practice, we do ble, allowing us to configure QA systems with near not have access to P (q; a), and instead rely on an state-of-the-art results, very small memory size, or empirical sample of QA-pairs K drawn from P , inference speeds of over 1,000 questions per sec- and measure the empirical loss of answer functions ond. Memory-optimised configurations of RePAQ on K. Our goal in this work is to implicitly model won two of the four tracks of the 2020 Efficien- P (q; a) so that we can draw a large sample of QA- tQA NeurIPS competition (Min et al., 2020a), with pairs, PAQ, which we can train on and/or retrieve system sizes of 336MB and 29MB, respectively. from. Drawing a sufficiently large sample will over- We also show that PAQ is a useful source of train- lap with K, essentially pre-empting and caching ing data for CBQA models. BART models trained questions that humans may ask at test-time. This on PAQ outperform baselines trained on standard allows us to shift computation from test-time to data by 5%. However, these models struggle to train-time compared to retrieve-and-read methods. effectively memorise all the knowledge in PAQ, lagging behind RePAQ by 15%. This demonstrates 3 Generating Question-Answer Pairs the effectiveness of RePAQ at leveraging PAQ. In this section, we describe the process for generat- Finally, we show that since RePAQ’s question ing PAQ. Given a large background corpus C, our matching score correlates well with QA accuracy, QA-pair generation process consists of the follow- it effectively “knows when it doesn’t know”, allow- ing components: ing for selective question answering (Rodriguez et al., 2019) where QA systems may abstain from 1. A passage selection model ps(c), to identify answering if confidence is too low. Whilst answer passages which humans are likely to ask ques- abstaining is important in its own right, it also en- tions about. ables an elegant “back-off” approach where we can 2. An answer extraction model pa(a j c), for defer to a more accurate but expensive QA system identifying spans in a passage that are more when answer confidence is low. This enables us to likely to be answers to a question. make use of the best of both speed and accuracy. 1The PAQ data, models and code will be made available at In summary, we make the following contribu- https://github.com/facebookresearch/PAQ Figure 1: Top Left: Generation pipeline for QA-pairs in PAQ. Top Right: PAQ used as training data for CBQA models. Bottom Left: RePAQ retrieves similar QA-pairs to input questions from PAQ. Bottom right: RePAQ’s confidence is predictive of accuracy. If confidence is low, we can defer to slower, more accurate systems, like FiD. 3. A question generator model pq(q j a; c) that, ther randomly or using heuristics. We then train a given a passage and an answer, generates a model to minimise negative log-likelihood of posi- question. tive passages relative to negatives. We implement p with RoBERTa (Liu et al., 4. A filtering QA model pf (a j q; C) that gen- s erates an answer for a given question.
Recommended publications
  • Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
    ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No.
    [Show full text]
  • Lossless Data Compression with Transformer
    Under review as a conference paper at ICLR 2020 LOSSLESS DATA COMPRESSION WITH TRANSFORMER Anonymous authors Paper under double-blind review ABSTRACT Transformers have replaced long-short term memory and other recurrent neural networks variants in sequence modeling. It achieves state-of-the-art performance on a wide range of tasks related to natural language processing, including lan- guage modeling, machine translation, and sentence representation. Lossless com- pression is another problem that can benefit from better sequence models. It is closely related to the problem of online learning of language models. But, despite this ressemblance, it is an area where purely neural network based methods have not yet reached the compression ratio of state-of-the-art algorithms. In this paper, we propose a Transformer based lossless compression method that match the best compression ratio for text. Our approach is purely based on neural networks and does not rely on hand-crafted features as other lossless compression algorithms. We also provide a thorough study of the impact of the different components of the Transformer and its training on the compression ratio. 1 INTRODUCTION Lossless compression is a class of compression algorithms that allows for the perfect reconstruc- tion of the original data. In the last decades, statistical methods for lossless compression have been dominated by PAQ-type approaches (Mahoney, 2005). The structure of these approaches is similar to the Prediction by Partial Matching (PPM) of Cleary & Witten (1984) and are composed of two separated parts: a predictor and an entropy encoding. Entropy coding scheme like arithmetic cod- ing (Rissanen & Langdon, 1979) are optimal and most of the compression gains are coming from improving the predictor.
    [Show full text]
  • Chapter 2 HISTORY and DEVELOPMENT of MILITARY LASERS
    History and Development of Military Lasers Chapter 2 HISTORY AND DEVELOPMENT OF MILITARY LASERS JACK B. KELLER, JR* INTRODUCTION INVENTING THE LASER MILITARIZING THE LASER SEARCHING FOR HIGH-ENERGY LASER WEAPONS SEARCHING FOR LOW-ENERGY LASER WEAPONS RETURNING TO HIGHER ENERGIES SUMMARY *Lieutenant Colonel, US Army (Retired); formerly, Foreign Science Information Officer, US Army Medical Research Detachment-Walter Reed Army Institute of Research, 7965 Dave Erwin Drive, Brooks City-Base, Texas 78235 25 Biomedical Implications of Military Laser Exposure INTRODUCTION This chapter will examine the history of the laser, Military advantage is greatest when details are con- from theory to demonstration, for its impact upon the US cealed from real or potential adversaries (eg, through military. In the field of military science, there was early classification). Classification can remain in place long recognition that lasers can be visually and cutaneously after a program is aborted, if warranted to conceal hazardous to military personnel—hazards documented technological details or pathways not obvious or easily in detail elsewhere in this volume—and that such hazards deduced but that may be relevant to future develop- must be mitigated to ensure military personnel safety ments. Thus, many details regarding developmental and mission success. At odds with this recognition was military laser systems cannot be made public; their the desire to harness the laser’s potential application to a descriptions here are necessarily vague. wide spectrum of military tasks. This chapter focuses on Once fielded, system details usually, but not always, the history and development of laser systems that, when become public. Laser systems identified here represent used, necessitate highly specialized biomedical research various evolutionary states of the art in laser technol- as described throughout this volume.
    [Show full text]
  • A Structural Analysis of Mide Chants
    A Structural Analysis of Mide Chants GEORGE FULFORD McMaster University Introduction In this paper I shall investigate the relationship between words and im­ agery in seven song scrolls used by members of an Ojibwa religious society known as the Midewiwin. These texts were collected in the late 1880s by W.J. Hoffman for the Bureau of American Ethnology and subsequently published in their seventh Annual Report (Hoffman 1891).1 All the pictographs which I shall discuss were inscribed on birch bark and used by members of the Midewiwin to record chants used in their ceremonies. According to Hoffman (1891:192) these chants consisted of only a few words or short phrases. They were sung by single individuals — never in chorus — and were repeated over and over again, usually to the accompaniment of a wooden kettle drum. In a previous study (Fulford 1989) I analyzed patterns of structural variation among these pictographs and outlined how three complex symbols — the otter, bear and bird — evolved from clan emblems into pictographic markers. The focus of my earlier study was purely iconographic; in this paper I shall explore the verbal structure of Midewiwin chants in order to show some of the ways in which they were pictographically encoded. For the sake of convenience, I have limited my discussion to song scrolls sharing the otter symbol. Six of the seven scrolls that I shall examine contain this marking device. Although one (designated Scroll C in the appendix) lacks an otter, it displays many other formal similarities with Hoffman published transcriptions and translations of 23 songs performed at the White Earth Reservation in northwestern Minnesota.
    [Show full text]
  • 5892 Cisco Category: Standards Track August 2010 ISSN: 2070-1721
    Internet Engineering Task Force (IETF) P. Faltstrom, Ed. Request for Comments: 5892 Cisco Category: Standards Track August 2010 ISSN: 2070-1721 The Unicode Code Points and Internationalized Domain Names for Applications (IDNA) Abstract This document specifies rules for deciding whether a code point, considered in isolation or in context, is a candidate for inclusion in an Internationalized Domain Name (IDN). It is part of the specification of Internationalizing Domain Names in Applications 2008 (IDNA2008). Status of This Memo This is an Internet Standards Track document. This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Further information on Internet Standards is available in Section 2 of RFC 5741. Information about the current status of this document, any errata, and how to provide feedback on it may be obtained at http://www.rfc-editor.org/info/rfc5892. Copyright Notice Copyright (c) 2010 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
    [Show full text]
  • Etir Code Lists
    eTIR Code Lists Code lists CL01 Equipment size and type description code (UN/EDIFACT 8155) Code specifying the size and type of equipment. 1 Dime coated tank A tank coated with dime. 2 Epoxy coated tank A tank coated with epoxy. 6 Pressurized tank A tank capable of holding pressurized goods. 7 Refrigerated tank A tank capable of keeping goods refrigerated. 9 Stainless steel tank A tank made of stainless steel. 10 Nonworking reefer container 40 ft A 40 foot refrigerated container that is not actively controlling temperature of the product. 12 Europallet 80 x 120 cm. 13 Scandinavian pallet 100 x 120 cm. 14 Trailer Non self-propelled vehicle designed for the carriage of cargo so that it can be towed by a motor vehicle. 15 Nonworking reefer container 20 ft A 20 foot refrigerated container that is not actively controlling temperature of the product. 16 Exchangeable pallet Standard pallet exchangeable following international convention. 17 Semi-trailer Non self propelled vehicle without front wheels designed for the carriage of cargo and provided with a kingpin. 18 Tank container 20 feet A tank container with a length of 20 feet. 19 Tank container 30 feet A tank container with a length of 30 feet. 20 Tank container 40 feet A tank container with a length of 40 feet. 21 Container IC 20 feet A container owned by InterContainer, a European railway subsidiary, with a length of 20 feet. 22 Container IC 30 feet A container owned by InterContainer, a European railway subsidiary, with a length of 30 feet. 23 Container IC 40 feet A container owned by InterContainer, a European railway subsidiary, with a length of 40 feet.
    [Show full text]
  • Mcgraw-Hill New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore Sydney Toronto Mcgraw-Hill Abc
    Y L F M A E T Team-Fly® Streaming Media Demystified Michael Topic McGraw-Hill New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore Sydney Toronto McGraw-Hill abc Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved. Manufactured in the United States of America. Except as permitted under the United States Copyright Act of 1976, no part of this publication may be reproduced or distrib- uted in any form or by any means, or stored in a database or retrieval system, without the prior written permission of the publisher. 0-07-140962-9 The material in this eBook also appears in the print version of this title: 0-07-138877-X. All trademarks are trademarks of their respective owners. Rather than put a trademark symbol after every occurrence of a trademarked name, we use names in an editorial fashion only, and to the benefit of the trademark owner, with no intention of infringement of the trademark. Where such designations appear in this book, they have been printed with initial caps. McGraw-Hill eBooks are available at special quantity discounts to use as premiums and sales promotions, or for use in cor- porate training programs. For more information, please contact George Hoare, Special Sales, at george_hoare@mcgraw- hill.com or (212) 904-4069. TERMS OF USE This is a copyrighted work and The McGraw-Hill Companies, Inc. (“McGraw-Hill”) and its licensors reserve all rights in and to the work. Use of this work is subject to these terms.
    [Show full text]
  • Adaptive Weighing of Context Models for Lossless Data Compression
    Adaptive Weighing of Context Models for Lossless Data Compression Matthew V. Mahoney Florida Institute of Technology CS Dept. 150 W. University Blvd. Melbourne FL 32901 [email protected] Technical Report CS-2005-16 Abstract optimal codes (within one bit) are known and can be Until recently the state of the art in lossless data generated efficiently, for example Huffman codes compression was prediction by partial match (PPM). A (Huffman 1952) and arithmetic codes (Howard and Vitter PPM model estimates the next-symbol probability 1992). distribution by combining statistics from the longest matching contiguous contexts in which each symbol value is found. We introduce a context mixing model which 1.1. Text Compression and Natural Language improves on PPM by allowing contexts which are arbitrary Processing functions of the history. Each model independently An important subproblem of machine learning is natural estimates a probability and confidence that the next bit of language processing. Humans apply complex language data will be 0 or 1. Predictions are combined by weighted rules and vast real-world knowledge to implicitly model averaging. After a bit is arithmetic coded, the weights are adjusted along the cost gradient in weight space to favor the natural language. For example, most English speaking most accurate models. Context mixing compressors, as people will recognize that p(recognize speech) > p(reckon implemented by the open source PAQ project, are now top eyes peach). Unfortunately we do not know any algorithm ranked on several independent benchmarks. that estimates these probabilities as accurately as a human. Shannon (1950) estimated that the entropy or information content of written English is about one bit per character 1.
    [Show full text]
  • A Machine Learning Perspective on Predictive Coding with PAQ
    A Machine Learning Perspective on Predictive Coding with PAQ Byron Knoll & Nando de Freitas University of British Columbia Vancouver, Canada fknoll,[email protected] August 17, 2011 Abstract PAQ8 is an open source lossless data compression algorithm that currently achieves the best compression rates on many benchmarks. This report presents a detailed description of PAQ8 from a statistical machine learning perspective. It shows that it is possible to understand some of the modules of PAQ8 and use this understanding to improve the method. However, intuitive statistical explanations of the behavior of other modules remain elusive. We hope the description in this report will be a starting point for discussions that will increase our understanding, lead to improvements to PAQ8, and facilitate a transfer of knowledge from PAQ8 to other machine learning methods, such a recurrent neural networks and stochastic memoizers. Finally, the report presents a broad range of new applications of PAQ to machine learning tasks including language modeling and adaptive text prediction, adaptive game playing, classification, and compression using features from the field of deep learning. 1 Introduction Detecting temporal patterns and predicting into the future is a fundamental problem in machine learning. It has gained great interest recently in the areas of nonparametric Bayesian statistics (Wood et al., 2009) and deep learning (Sutskever et al., 2011), with applications to several domains including language modeling and unsupervised learning of audio and video sequences. Some re- searchers have argued that sequence prediction is key to understanding human intelligence (Hawkins and Blakeslee, 2005). The close connections between sequence prediction and data compression are perhaps under- arXiv:1108.3298v1 [cs.LG] 16 Aug 2011 appreciated within the machine learning community.
    [Show full text]
  • Management of Stroke in Neonates and Children: a Scientific Statement from the American Heart Association/American Stroke Association
    UCSF UC San Francisco Previously Published Works Title Management of Stroke in Neonates and Children: A Scientific Statement From the American Heart Association/American Stroke Association. Permalink https://escholarship.org/uc/item/0sw5j7c1 Journal Stroke, 50(3) ISSN 0039-2499 Authors Ferriero, Donna M Fullerton, Heather J Bernard, Timothy J et al. Publication Date 2019-03-01 DOI 10.1161/str.0000000000000183 Peer reviewed eScholarship.org Powered by the California Digital Library University of California AHA/ASA Scientific Statement Management of Stroke in Neonates and Children A Scientific Statement From the American Heart Association/American Stroke Association The American Academy of Neurology affirms the value of this statement as an educational tool for neurologists. Donna M. Ferriero, MD, MS, FAHA, Co-Chair; Heather J. Fullerton, MD, MAS, Co-Chair; Timothy J. Bernard, MD, MSCS; Lori Billinghurst, MD, MSc, FRCPC; Stephen R. Daniels, MD, PhD; Michael R. DeBaun, MD, MPH; Gabrielle deVeber, MD; Rebecca N. Ichord, MD; Lori C. Jordan, MD, PhD, FAHA; Patricia Massicotte, MSc, MD, MHSc; Jennifer Meldau, MSN; E. Steve Roach, MD, FAHA; Edward R. Smith, MD; on behalf of the American Heart Association Stroke Council and Council on Cardiovascular and Stroke Nursing Purpose—Much has transpired since the last scientific statement on pediatric stroke was published 10 years ago. Although stroke has long been recognized as an adult health problem causing substantial morbidity and mortality, it is also an important cause of acquired brain injury in young patients, occurring most commonly in the neonate and throughout childhood. This scientific statement represents a synthesis of data and a consensus of the leading experts in childhood cardiovascular disease and stroke.
    [Show full text]
  • Arxiv:2011.13544V1 [Cs.CV] 27 Nov 2020
    Patch-VQ: ‘Patching Up’ the Video Quality Problem Zhenqiang Ying1*, Maniratnam Mandal1*, Deepti Ghadiyaram2†, Alan Bovik1† 1University of Texas at Austin, 2Facebook AI fzqying, [email protected], [email protected], [email protected] Abstract No-reference (NR) perceptual video quality assessment (VQA) is a complex, unsolved, and important problem to social and streaming media applications. Efficient and ac- curate video quality predictors are needed to monitor and guide the processing of billions of shared, often imper- fect, user-generated content (UGC). Unfortunately, current NR models are limited in their prediction capabilities on real-world, “in-the-wild” UGC video data. To advance progress on this problem, we created the largest (by far) subjective video quality dataset, containing 39; 000 real- world distorted videos and 117; 000 space-time localized video patches (‘v-patches’), and 5:5M human perceptual Fig. 1: Modeling local to global perceptual quality: From each video, we ex- tract three spatio-temporal video patches (Sec. 3.1), which along with their subjective quality annotations. Using this, we created two unique scores, are fed to the proposed video quality model. By integrating spatial (2D) and NR-VQA models: (a) a local-to-global region-based NR spatio-temporal (3D) quality-sensitive features, our model learns spatial and temporal distortions, and can robustly predict both global and local quality, a temporal quality VQA architecture (called PVQ) that learns to predict global series, as well as space-time quality maps (Sec. 5.2). Best viewed in color. video quality and achieves state-of-the-art performance on 3 UGC datasets, and (b) a first-of-a-kind space-time video transform the processing and interpretation of videos on quality mapping engine (called PVQ Mapper) that helps smartphones, social media, telemedicine, surveillance, and localize and visualize perceptual distortions in space and vision-guided robotics, in ways that FR models are un- time.
    [Show full text]
  • K191201 Trade/Device Name
    November 15, 2019 Hiossen, Inc. Peter Lee QA/RA Manager 85 Ben Fairless Drive Fairless Hills, Pennsylvania 19030 Re: K191201 Trade/Device Name: EM SA Implant System Regulation Number: 21 CFR 872.3640 Regulation Name: Endosseous Dental Implant Regulatory Class: Class II Product Code: DZE Dated: November 14, 2019 Received: November 15, 2019 Dear Peter Lee: We have reviewed your Section 510(k) premarket notification of intent to market the device referenced above and have determined the device is substantially equivalent (for the indications for use stated in the enclosure) to legally marketed predicate devices marketed in interstate commerce prior to May 28, 1976, the enactment date of the Medical Device Amendments, or to devices that have been reclassified in accordance with the provisions of the Federal Food, Drug, and Cosmetic Act (Act) that do not require approval of a premarket approval application (PMA). You may, therefore, market the device, subject to the general controls provisions of the Act. Although this letter refers to your product as a device, please be aware that some cleared products may instead be combination products. The 510(k) Premarket Notification Database located at https://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfpmn/pmn.cfm identifies combination product submissions. The general controls provisions of the Act include requirements for annual registration, listing of devices, good manufacturing practice, labeling, and prohibitions against misbranding and adulteration. Please note: CDRH does not evaluate information related to contract liability warranties. We remind you, however, that device labeling must be truthful and not misleading. If your device is classified (see above) into either class II (Special Controls) or class III (PMA), it may be subject to additional controls.
    [Show full text]