Database Agnostic ETL Tool: an Economical Solution to Store and Process Data
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Oracle's Zero Data Loss Recovery ApplianceGmbH Enterprise Servers, Storage and Business Continuity White Paper Oracle’s Zero Data Loss Recovery Appliance – and Comparison with EMC Data Domain Josh Krischer December 2015 _____________________________________________________________________________ 2015 © Josh Krischer & Associates GmbH. All rights reserved. Reproduction of this publication in any form without prior written permission is forbidden. The information contained herein has been obtained from sources believed to be reliable. Josh Krischer & Associates GmbH disclaims all warranties as to the accuracy, completeness or adequacy of such information. Josh Krischer & Associates GmbH shall have no liability for errors, omissions or inadequacies in the information contained herein or for interpretations thereof. The reader assumes sole responsibility for the selection of these materials to achieve its intended results. The opinions expressed herein are subject to change without notice. All product names used and mentioned d herein are the trademarks of their respective owners. GmbH Enterprise Servers, Storage and Business Continuity Contents Executive Summary ...................................................................................................................... 3 Oracle’s Zero Data Loss Recovery Appliance ............................................................................... 4 Oracle’s Zero Data Loss Recovery Appliance – Principles of Operation ....................................... 5 Oracle’s Maximum Availability Architecture (MAA) ......................................................................
- 
												  CERIAS Tech Report 2017-5 Deceptive Memory Systems by Christopher NCERIAS Tech Report 2017-5 Deceptive Memory Systems by Christopher N. Gutierrez Center for Education and Research Information Assurance and Security Purdue University, West Lafayette, IN 47907-2086 DECEPTIVE MEMORY SYSTEMS ADissertation Submitted to the Faculty of Purdue University by Christopher N. Gutierrez In Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy December 2017 Purdue University West Lafayette, Indiana ii THE PURDUE UNIVERSITY GRADUATE SCHOOL STATEMENT OF DISSERTATION APPROVAL Dr. Eugene H. Spa↵ord, Co-Chair Department of Computer Science Dr. Saurabh Bagchi, Co-Chair Department of Computer Science Dr. Dongyan Xu Department of Computer Science Dr. Mathias Payer Department of Computer Science Approved by: Dr. Voicu Popescu by Dr. William J. Gorman Head of the Graduate Program iii This work is dedicated to my wife, Gina. Thank you for all of your love and support. The moon awaits us. iv ACKNOWLEDGMENTS Iwould liketothank ProfessorsEugeneSpa↵ord and SaurabhBagchi for their guidance, support, and advice throughout my time at Purdue. Both have been instru mental in my development as a computer scientist, and I am forever grateful. I would also like to thank the Center for Education and Research in Information Assurance and Security (CERIAS) for fostering a multidisciplinary security culture in which I had the privilege to be part of. Special thanks to Adam Hammer and Ronald Cas tongia for their technical support and Thomas Yurek for his programming assistance for the experimental evaluation. I am grateful for the valuable feedback provided by the members of my thesis committee, Professor Dongyen Xu, and Professor Math ias Payer.
- 
												  Legality and Ethics of Web ScrapingMurray State's Digital Commons Faculty & Staff Research and Creative Activity 12-15-2020 Tutorial: Legality and Ethics of Web Scraping Vlad Krotov Leigh Johnson Murray State University Leiser Silva University of Houston Follow this and additional works at: https://digitalcommons.murraystate.edu/faculty Recommended Citation Krotov, V., Johnson, L., & Silva, L. (2020). Tutorial: Legality and Ethics of Web Scraping. Communications of the Association for Information Systems, 47, pp-pp. https://doi.org/10.17705/1CAIS.04724 This Peer Reviewed/Refereed Publication is brought to you for free and open access by Murray State's Digital Commons. It has been accepted for inclusion in Faculty & Staff Research and Creative Activity by an authorized administrator of Murray State's Digital Commons. For more information, please contact [email protected]. See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/343555462 Legality and Ethics of Web Scraping, Communications of the Association for Information Systems (forthcoming) Article in Communications of the Association for Information Systems · August 2020 CITATIONS READS 0 388 3 authors, including: Vlad Krotov Murray State University 42 PUBLICATIONS 374 CITATIONS SEE PROFILE Some of the authors of this publication are also working on these related projects: Addressing barriers to big data View project Web Scraping Framework: An Integrated Approach to Retrieving Big Qualitative Data from the Web View project All content following this
- 
												  Next Generation Disaster Data InfrastructureNext Generation Disaster Data Infrastructure A study report of the CODATA Task Group on Linked Open Data for Global Disaster Risk Research Supported by Study Panel Lead Authors in alphabetical order Bapon Fakhruddin, Tonkin & Taylor International, New Zealand Edward Chu, CODATA Task Group on Linked Open Data for Global Disaster Risk Research Guoqing Li, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, China Contributing Authors in alphabetical order Amit Paul, BNA Technology Consulting LTD., India David Wyllie, Tonkin & Taylor International, New Zealand Durga Ragupathy, Tonkin & Taylor International, New Zealand Jibo Xie, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, China Kun-Lin Lu, CODATA Task Group on Linked Open Data for Global Disaster Risk Research Kunal Sharma, National Disaster Management Authority, India I-Ching Chen, CODATA Task Group on Linked Open Data for Global Disaster Risk Research Nuha Eltinay, Arab Urban Development Institute, Kingdom of Saudi Arabia Saurabh Dalal, National Disaster Management Authority, India Raj Prasanna, Joint Centre for Disaster Research, Massey University Richard Reinen-Hamill, Tonkin & Taylor International, New Zealand Tengfei Yang, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, China Vimikthi Jayawardene, The University of Queensland, Australia Acknowledgments This work is supported by the Committee on Data for Science and Technology (CODATA) of International Science Council (ISC), Integrated Research on Disaster Risk (IRDR); National Key R&D Program of China (2016YFE0122600) and Tonkin & Taylor International Ltd, New Zealand. LODGD (2019), Next Generation Disaster Data Infrastructure -White Paper. CODATA Task Group, Linked Open Data for Global Disaster Risk Research (LODGD).
- 
												  PROTECTING DATA from RANSOMWARE and OTHER DATA LOSS EVENTS a Guide for Managed Service Providers to Conduct, Maintain and Test Backup FilesPROTECTING DATA FROM RANSOMWARE AND OTHER DATA LOSS EVENTS A Guide for Managed Service Providers to Conduct, Maintain and Test Backup Files OVERVIEW The National Cybersecurity Center of Excellence (NCCoE) at the National Institute of Standards and Technology (NIST) developed this publication to help managed service providers (MSPs) improve their cybersecurity and the cybersecurity of their customers. MSPs have become an attractive target for cyber criminals. When an MSP is vulnerable its customers are vulnerable as well. Often, attacks take the form of ransomware. Data loss incidents—whether a ransomware attack, hardware failure, or accidental or intentional data destruction—can have catastrophic effects on MSPs and their customers. This document provides recommend- ations to help MSPs conduct, maintain, and test backup files in order to reduce the impact of these data loss incidents. A backup file is a copy of files and programs made to facilitate recovery. The recommendations support practical, effective, and efficient back-up plans that address the NIST Cybersecurity Framework Subcategory PR.IP-4: Backups of information are conducted, maintained, and tested. An organization does not need to adopt all of the recommendations, only those applicable to its unique needs. This document provides a broad set of recommendations to help an MSP determine: • items to consider when planning backups and buying a backup service/product • issues to consider to maximize the chance that the backup files are useful and available when needed • issues to consider regarding business disaster recovery CHALLENGE APPROACH Backup systems implemented and not tested or NIST Interagency Report 7621 Rev. 1, Small Business planned increase operational risk for MSPs.
- 
												  Ways to Avoid a Data-Storage DisasterTOOLBOX WAYS TO AVOID A DATA-STORAGE DISASTER Hard-drive failures are inevitable, but data loss doesn’t have to be. ILLUSTRATION BY THE PROJECT TWINS THE PROJECT BY ILLUSTRATION BY JEFFREY M. PERKEL was to “clean up the data and get organized”, advantages, and scientists must discover what she says. Instead, she deleted her entire pro- works best for them on the basis of the nature racy Teal was a graduate student when ject. And unlike the safety net offered by the and volume of their data, storage-resource she executed what should have been a Windows and Macintosh operating-system availability and concerns about data privacy. routine command in her Unix termi- trash cans, there’s no way of recovering from In Teal’s case, automation saved the day. Tnal: rm −rf *. That command instructs the executing rm. Unless you have a backup. The server on which she was working was computer to delete everything in the current In the digital world, backing up data is regularly backed up to tape, and the “very directory recursively, including all subdirec- essential, whether those data are smart- friendly and helpful IT folks” on her depart- tories. There was just one problem — she was phone selfies or massive genome-sequencing ment’s life-sciences computing helpdesk were in the wrong directory. data sets. Storage media are fragile, and they able to recover her files. But the situation was At the time, Teal was studying computa- inevitably fail — or are lost, stolen or damaged. particularly embarrassing, she says, because tional linguistics as part of a master’s degree Backup options range from USB memory Teal — who is now executive director at The in biology at the University of California, sticks and cloud-based data-storage services Carpentries, a non-profit organization in San Los Angeles.
- 
												  With NILFS (New Implementation of the Log-Structured File System) DrR&D Spirits Giving Value to “Past Information” with NILFS (New Implementation of the Log-structured File System) Dr. Satoshi Moriai Senior Research Engineer, Supervisor Group Leader, Kernel Group Open Source Software Computing Project NTT Cyber Space Laboratories Open source software (OSS), which can be freely used, redistributed, and distributed as source code, has become popular as an effective means of reducing the cost of software devel- opment, system introduction, and total cost of ownership (TCO). The Kernel Group in NTT Cyber Space Laboratories has been researching and developing a new file system called NILFS that makes a hard disk into a type of “time machine”. We asked Dr. Satoshi Moriai, the leader of the Kernel Group, to explain this fascinating file system to us. Creating a landmark file system that can level as NILFS (new implementation of the log-struc- recover past data tured file system), as shown in Fig. 1. The principle of NILFS can be understood by imag- —Dr. Moriai, please outline your current R&D theme ining that the virtual space of the file system consists for us. of semitransparent films (Fig. 2). When it is record- ing data, the system covers previously recorded data In general, we are researching operating systems with a film and marks that film only with data that dif- with the goal of constructing high-reliability systems. fers from past data. Consequently, while only the lat- We are particularly focused on why data loss in com- est differences are marked on the top film, the entire puter systems happens and how we can prevent it.
- 
												  Ransomware Attack: What’S Your Data Recovery Plan?RANSOMWARE ATTACK: WHAT’S YOUR DATA RECOVERY PLAN? Learn more about how KeepItSafe can help to reduce costs, save time, and provide compliance for online 888 965 9988 backup, disaster recovery-as-a-Service, mobile data www.keepitsafe.com protection, and cloud SaaS backup — contact us today. [email protected] Understanding and Preventing Ransomware Attacks TABLE OF CONTENTS What is Ransomware and Why is it On the Rise? pg. 03 The Most Common Types of Ransomware pg. 04 How Ransomware Spreads pg. 06 What You Can Do: 10 Steps pg. 08 to Defend Against Ransomware How KeepItSafe® Helps Combat Ransomware pg. 11 2 www.keepitsafe.com Understanding and Preventing Ransomware Attacks WHAT IS RANSOMWARE AND WHY IS IT ON THE RISE? New Ransomware 1,400,000 1,200,000 1,000,000 800,000 600,000 400,000 200,000 0 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 2013 2014 2015 McAfee Labs Threats Report, August 2015 Imagine sitting down at your office computer, logging in to your corporate network, and being greeted by the following onscreen message: “We have locked you out of access to all of your company’s systems, files and other data. To have access restored, please deposit $100,000 in the following bitcoin account.” This is “ransomware,” one of the most prevalent forms of malicious cyber attacks facing businesses today. With a ransomware attack, a cyber hacker infects your network or device with malicious software, usually through code attached to an email or contained within seemingly legitimate software you download from a website. Once the malicious software propagates through your systems, the hacker can then encrypt your data — and contact you with an offer to pay a ransom to get the data back.
- 
												  Zero Data Loss Recovery Appliance X8 / X8MORACLE DATA SHEET Zero Data Loss Recovery Appliance X8 / X8M State-of-the-Art Oracle Data Protection Today’s storage solutions for protecting business data do not easily meet the needs of mission critical enterprise databases. They can lose up to a day of business data on every restore, place a heavy load on production servers during backup, do not inherently validate database level recoverability, and cannot scale to meet the needs of ever expanding databases. These challenges are largely due to their fundamental treatment of databases as a set of disjoint files to copy, not as transactional systems with specific integrity and performance requirements. Oracle’s Zero Data Loss Recovery Appliance (Recovery Appliance) is a ground- breaking data protection solution that tightly integrates with the Oracle Database to address these requirements head-on. It eliminates data loss and dramatically reduces data protection overhead on production servers. In addition, the Recovery Appliance continually validates the integrity and recoverability of the data, scales to protect thousands of databases, and protects backups across the full lifecycle, including disk backup, cloud archiving, remote replication and tape archiving. K EY FEATURES Today’s Database Data Protection Problems • Real-Time Redo Transport The fundamental problems with today’s database protection solutions stem from: • End-to-End Data Validation • Incremental-Forever Backup Strategy • Backup and recovery methods that are based on a decades-old nightly backup paradigm, where up to a
- 
												  Eliminate Data Loss for All of YourWHITE PAPER Extreme Protection That Eliminates Data Loss for All of Your Oracle Databases Oracle data protection redefined: Oracle’s Zero Data Loss Recovery Appliance eliminates backup overhead and recovers databases to any point in time. It would not be far-fetched to say that These drivers and the dynamics of data protection today also data is the organization today. Just consider raise compelling questions. For example, can IT effectively what happens when a major system failure spawns a back up database environments within ever-shrinking backup protracted recovery period. Business largely grinds to a halt— windows? How can IT ensure that recoveries can be performed that’s what. seamlessly with as little data loss as possible? And as production cycles lengthen, how can IT handle the mounting Clearly the ability to quickly and completely back up and pressure of having to back up increasingly larger data sets. recover data in the event of a system failure or human error is among the most mission-critical requirements—for both IT and Data protection solutions today consist largely of storage- the organization. These days three prominent trends drive the based mirroring and backup-and-recovery technologies, but imperative for an overhaul of data protection technologies and there continues to be significant risk with these solutions, processes. They are especially when protecting databases. Storage mirroring can • Continued exponential growth of business data potentially lead to data corruption, whereas backup-and- • Demand for high levels of data integrity and data loss recovery technologies can lead to data loss when changes protection made since the most recent backup are either not captured • Zero tolerance of downtime for data backup in today’s or not documented.
- 
												  Data Protection in Smartphones Against Physical ThreatsSurvey: Data Protection in Smartphones Against Physical Threats Ildar Muslukhov The University of British Columbia Department of Electrical and Computer Engineering 2332 Main Mall,Vancouver, Canada [email protected] ABSTRACT the middle of 2011. For example, in some of the US cities In this paper we present a survey of the data protection sys- proportion of the population, which had experienced loss or tems in the presence of physical threats, such as theft, loss, theft of their mobile devices was up to 52% [6]. Finally, damage and malicious use. Recent studies suggests that Symantec reports that in 96% of cases when a smartphone the aforementioned threats are real and happen with very is lost a stranger that finds it will try to access sensitive high rates, e.g., it has been reported that more than 50% of information, such as social networking applications, emails, Maimi city had experience with theft and loss of their device pictures, passwords, and banking applications [13]. in 2011. In this paper we also present adversarial model and users' requirements to data protection systems which were One way to explain this it the small size of the modern based on the result from the interview and survey studies smartphones, that might make it easier to lose or to steal we conducted. Analysis of the existing systems in the light it. For instance, it was reported that 31,544 smartphones of the aforementioned adversarial model and users' require- have been left in taxi cabs of New York city during the ments revealed that existing system (a) do not consider users first six months of 2010 [15].
- 
												  Data Recovery from Virtual Machine Data LossData recovery from virtual machine data loss April 2020 Introduction: Virtual machine data loss Virtualisation technology dominates the enterprise landscape. According to Gartner, most firms report 75% or higher virtualisation. Improvements in hypervisors have reduced the complexity of setting up and maintaining physical servers, greatly improved server utilisation, and increased IT flexibility and responsiveness to the needs of the business. It’s no wonder that the bulk of modern IT systems are virtualised. But, whether you use VMware, Hyper-V, Citrix, Oracle or any of the other hypervisors, there is a potential downside to virtualisation. In order to transform a physical server into many virtual machines (VMs), an additional software layer is added. While simplifying the admin user experience, virtualisation raises the overall complexity of the IT environment as the underlying hardware is obfuscated, making it more difficult for admins to know which physical system their VMs are running on or which storage is used for a particular machine in the event of data loss. With fewer people to maintain and monitor a larger number of virtual machines (compared to physical servers), there are greater chances for problems and data loss. Data gathered across the globe by Ontrack Data Recovery reveals that several causes of data loss incidents for virtualised environments. The leading reasons for virtual machine data loss are user error, ransomware, hardware failures and RAID corruption. The purpose of this paper is to lay out what leads to virtual data loss and explain how global data recovery providers are able to resolve a high percentage of even the most challenging data loss situations in virtualised environments.