Information Retrieval in Document Spaces Using Clustering
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Survey of Hierarchical Clustering Algorithms the Journal Of
M. Kuchaki Rafsanjani, Z. Asghari Varzaneh, N. Emami Chukanlo / TJMCS Vol .5 No.3 (2012) 229-240 The Journal of Mathematics and Computer Science Available online at http://www.TJMCS.com The Journal of Mathematics and Computer Science Vol .5 No.3 (2012) 229-240 A survey of hierarchical clustering algorithms Marjan Kuchaki Rafsanjani Department of Computer Science, Shahid Bahonar University of Kerman, Kerman, Iran [email protected] Zahra Asghari Varzaneh Department of Computer Science, Shahid Bahonar University of Kerman, Kerman, Iran [email protected] Nasibeh Emami Chukanlo Department of Computer Science, Shahid Bahonar University of Kerman, Kerman, Iran [email protected] Received: November 2012, Revised: December 2012 Online Publication: December 2012 Abstract Clustering algorithms classify data points into meaningful groups based on their similarity to exploit useful information from data points. They can be divided into categories: Sequential algorithms, Hierarchical clustering algorithms, Clustering algorithms based on cost function optimization and others. In this paper, we discuss some hierarchical clustering algorithms and their attributes, and then compare them with each other. Keywords: Clustering; Hierarchical clustering algorithms; Complexity. 2010 Mathematics Subject Classification: Primary 91C20; Secondary 62D05. 1. Introduction. Clustering is the unsupervised classification of data into groups/clusters [1]. It is widely used in biological and medical applications, computer vision, robotics, geographical data, and so on [2]. To date, many clustering algorithms have been developed. They can organize a data set into a number of groups /clusters. Clustering algorithms may be divided into the following major categories: Sequential algorithms, Hierarchical clustering algorithms, Clustering algorithms based on cost function optimization and etc [3]. -
The CURE for Class Imbalance
The CURE for Class Imbalance B Colin Bellinger1( ), Paula Branco2, and Luis Torgo2 1 National Research Council of Canada, Ottawa, Canada [email protected] 2 Dalhousie University, Halifax, Canada {pbranco,ltorgo}@dal.ca Abstract. Addressing the class imbalance problem is critical for sev- eral real world applications. The application of pre-processing methods is a popular way of dealing with this problem. These solutions increase the rare class examples and/or decrease the normal class cases. However, these procedures typically only take into account the characteristics of each individual class. This segmented view of the data can have a nega- tive impact. We propose a new method that uses an integrated view of the data classes to generate new examples and remove cases. ClUstered REsampling (CURE) is a method based on a holistic view of the data that uses hierarchical clustering and a new distance measure to guide the sampling procedure. Clusters generated in this way take into account the structure of the data. This enables CURE to avoid common mistakes made by other resampling methods. In particular, CURE prevents the generation of synthetic examples in dangerous regions and undersamples safe, non-borderline, regions of the majority class. We show the effec- tiveness of CURE in an extensive set of experiments with benchmark domains. We also show that CURE is a user-friendly method that does not require extensive fine-tuning of hyper-parameters. Keywords: Imbalanced domains · Resampling · Clustering 1 Introduction Class imbalance is a problem encountered in a wide variety of important clas- sification tasks including oil spill, fraud detection, action recognition, text clas- sification, radiation monitoring and wildfire prediction [4,17,21,22,24,27]. -
Seminarvortrag 17.4.2019 Zu Sonnenflecken
Early telescopic observations of sunspots by Simon Marius and other (1610-1620) Ralph Neuhäuser Astrophysikalisches Institut und Universitäts-Sternwarte www.astro.uni-jena.de FSU Jena 400 years telescopic sunspots. Schwabe cycle 10.4 ± 1.2 yr (since 1750) Schwabe cycle and butterfly diagram Sonnenflecken-Relativzahl (Rudolf Wolf 1816-1893): Rz = k x (10 x g + n) Anzahl der Einzelflecken n, Anzahl der Fleckengruppen g, individueller Gütefaktor des jeweiligen Beobachters k Hoyt & Schatten (1998): Sonnenfleckengruppenzahl RG = (12.08 / N) x Si (ki' x Gi) individueller Korrekturfaktor ki' des i-ten Beobachters Gruppenzahl Gi am betreffenden Tag, N ist die Anzahl der Beobachter des entsprechenden Tages. oder Fleckenfläche statt Fleckenanzahl Active day fraction f = (aktive Tage) / (aktive + inaktive Tage) In 17th century, all sources have to be checked ! Clette et al. 2015 - First telescopic observations of sun spots - Observations by Simon Marius 1611 – 1619 - More observations by Saxonius, Tarde, Malapert: Constraining the first telescopic Schwabe cycle (1620) Erste teleskopische Beobachtungen von Flecken (ab 1609): -Vorstufen als Lesestein um 1000 AD (Ibn al-Haytham) - Linsen, Monokel, Brillen im Mittelalter (China, Italien) - Teleskop 1608 (Hans Lipperdey, Holland) - Galileo Galilei: erste Himmelsbeobachtungen (1609) Jupiter-Monde, Sterne in Milchstraße, Venus-Phasen, Sonnenflecken - Kepler Fernrohr (1611) Kopernikanische Wende: Helio-Zentrismus Erste teleskopische Beobachtungen von Flecken (ab 1609): - Galileo Galilei: erste Himmelsbeobachtungen -
An Improved CURE Algorithm Mingjuan Cai, Yongquan Liang
An Improved CURE Algorithm Mingjuan Cai, Yongquan Liang To cite this version: Mingjuan Cai, Yongquan Liang. An Improved CURE Algorithm. 2nd International Conference on Intelligence Science (ICIS), Nov 2018, Beijing, China. pp.102-111, 10.1007/978-3-030-01313-4_11. hal-02118834 HAL Id: hal-02118834 https://hal.inria.fr/hal-02118834 Submitted on 3 May 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License An improved CURE algorithm Mingjuan Cai 1, Yongquan Liang 2 1College of Computer Science and Engineering and Shandong Province Key Laboratory of Wisdom Mine Information Technology, Shandong University of Science and Technology, Qingdao 266590, China 2College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China [email protected] ABSTRACT. CURE algorithm is an efficient hierarchical clustering algorithm for large data sets. This paper presents an improved CURE algorithm, named ISE-RS-CURE. The algorithm adopts a sample extraction algorithm combined with statistical ideas, which can reasonably select sample points according to different data densities and can improve the representation of sample sets. When the sample set is extracted, the data set is divided at the same time, which can help to reduce the time consumption in the non-sample set allocation process. -
The Maunder Minimum and the Variable Sun-Earth Connection
The Maunder Minimum and the Variable Sun-Earth Connection (Front illustration: the Sun without spots, July 27, 1954) By Willie Wei-Hock Soon and Steven H. Yaskell To Soon Gim-Chuan, Chua Chiew-See, Pham Than (Lien+Van’s mother) and Ulla and Anna In Memory of Miriam Fuchs (baba Gil’s mother)---W.H.S. In Memory of Andrew Hoff---S.H.Y. To interrupt His Yellow Plan The Sun does not allow Caprices of the Atmosphere – And even when the Snow Heaves Balls of Specks, like Vicious Boy Directly in His Eye – Does not so much as turn His Head Busy with Majesty – ‘Tis His to stimulate the Earth And magnetize the Sea - And bind Astronomy, in place, Yet Any passing by Would deem Ourselves – the busier As the Minutest Bee That rides – emits a Thunder – A Bomb – to justify Emily Dickinson (poem 224. c. 1862) Since people are by nature poorly equipped to register any but short-term changes, it is not surprising that we fail to notice slower changes in either climate or the sun. John A. Eddy, The New Solar Physics (1977-78) Foreword By E. N. Parker In this time of global warming we are impelled by both the anticipated dire consequences and by scientific curiosity to investigate the factors that drive the climate. Climate has fluctuated strongly and abruptly in the past, with ice ages and interglacial warming as the long term extremes. Historical research in the last decades has shown short term climatic transients to be a frequent occurrence, often imposing disastrous hardship on the afflicted human populations. -
Dynamic Group Recommendation Based on the Attention Mechanism
future internet Article Dynamic Group Recommendation Based on the Attention Mechanism Haiyan Xu , Yanhui Ding *, Jing Sun, Kun Zhao and Yuanjian Chen School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China; [email protected] (H.X.); [email protected] (J.S.); [email protected] (K.Z.); [email protected] (Y.C.) * Correspondence: [email protected]; Tel.: +86-139-0531-6824 Received: 29 July 2019; Accepted: 16 September 2019; Published: 17 September 2019 Abstract: Group recommendation has attracted significant research efforts for its importance in benefiting group members. The purpose of group recommendation is to provide recommendations to group users, such as recommending a movie to several friends. Group recommendation requires that the recommendation should be as satisfactory as possible to each member of the group. Due to the lack of weighting of users in different items, group decision-making cannot be made dynamically. Therefore, in this paper, a dynamic recommendation method based on the attention mechanism is proposed. Firstly, an improved density peak clustering (DPC) algorithm is used to discover the potential group; and then the attention mechanism is adopted to learn the influence weight of each user. The normalized discounted cumulative gain (NDCG) and hit ratio (HR) are adopted to evaluate the validity of the recommendation results. Experimental results on the CAMRa2011 dataset show that our method is effective. Keywords: group recommendation; density peak clustering; attention mechanism 1. Introduction With the rapid development of information technology, the recommendation system has become an important tool for people to solve information overload issues. The recommendation system has been widely used in many online information systems, such as social media sites, E-commerce platforms, etc., to help users choose products [1–3] that meet their needs. -
An Improvement for DBSCAN Algorithm for Best Results in Varied Densities
Computer Engineering Department Faculty of Engineering Deanery of Higher Studies The Islamic University-Gaza Palestine An Improvement for DBSCAN Algorithm for Best Results in Varied Densities Mohammad N. T. Elbatta Supervisor Dr. Wesam M. Ashour A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Engineering Gaza, Palestine (September, 2012) 1433 H ii stnkmegnelnonkcA Apart from the efforts of myself, the success of any project depends largely on the encouragement and guidelines of many others. I take this opportunity to express my gratitude to the people who have been instrumental in the successful completion of this project. I would like to thank my parents for providing me with the opportunity to be where I am. Without them, none of this would be even possible to do. You have always been around supporting and encouraging me and I appreciate that. I would also like to thank my brothers and sisters for their encouragement, input and constructive criticism which are really priceless. Also, special thanks goes to Dr. Homam Feqawi who did not spare any effort to review and audit my thesis linguistically. My heartiest gratitude to my wonderful wife, Nehal, for her patience and forbearance through my studying and preparing this study. I would like to express my sincere gratitude to my advisor Dr. Wesam Ashour for the continuous support of my master study and research, for his patience, motivation, enthusiasm, and immense knowledge. His guidance helped me in all the time of research and writing of this thesis. I could not have imagined having a better advisor and mentor for my master study. -
M = Al, Cr, Fe, Ga, In, Sc, Ti, and V) and Fej-J-Mjjfg Mixed Series (M = Ga, Cr, V) *
Systematic Investigation of MF3 Crystalline Compounds (M = Al, Cr, Fe, Ga, In, Sc, Ti, and V) and Fej-j-MjjFg Mixed Series (M = Ga, Cr, V) * H.-R. Blank1, M. Frank1, M. Geiger1, J.-M. Greneche2, M. Ismaier1, M. Kaltenhäuserl, R. Kapp1, W. Kreische1, M. Leblanc3, U. Lossen1, and B. Zapf1 1 Physikalisches Institut der Universität Erlangen-Nürnberg, Erwin-Rommel-Str. 1, 91058 Erlangen, Germany 2 URA CNRS 807, Faculte des Sciences, Universite du Maine, 72017 Le Mans, BP 535, Cedex, France 3 URA CNRS 449, Faculte des Sciences, Universite du Maine, 72017 Le Mans, BP 535, Cedex, France Z. Naturforsch. 49 a, 361-366 (1994); received July 23, 1993 The electric field gradient at the fluorine site of several crystalline trifluorides was measured by means of the time differential perturbed angular distribution method. The hyperfine data (vq and rj) are systematically analyzed by taking into account the structural parameters of the crystals; tney are also compared to the results obtained by point charge calculations. Introduction literature: rhombohedral [2], hexagonal-rhombohe- dral [3] and cubic [4] for 3ScF and rhombohedral or During the last fifteen years, the intrinsic character monoclinic for InF3. These aspects are relevant for the of the fluorine ion, mainly its low polarizability, discussion of experimental data, but the TDPAD hy- evoked much fundamental interest in the physical perfine parameters are not highly sensitive to symme properties of fluorides. In the case of iron based fluo try conditions. rides, the electric field gradient (EFG) at the 57Fe sites After a brief description of the experimental aspects, was determined by Mössbauer spectroscopy [1]. -
Download (PDF)
DX.Y “TITLE OF THE DELIVERABLE” Ref. Ares(2021)1046119 - 06/02/2021 Version 1.0 Document Information Contract Number 780681 Project Website https://legato-project.eu/ Contractual Deadline Dissemination Level D4.4 “REPORT ON ENERGY-EFFICIENCY EVALUATIONS AND OPTIMIZATIONS FOR ENERGY-EFFICIENT, SECURE, Nature RESILIENT TASK-BASED PROGRAMMING MODEL AND COMPILER EXTENSIONS” Author DX.Y “TITLEVersion 2 OF THE DELIVERABLE” Contributors Document Information Version 1.0 Reviewers Contract Number 780681 Project Website https://legato-project.eu/ Contractual Deadline 30 November 2020 Dissemination Level Public Document InformationNature Report Author Marcelo Pasin (UNINE) Contributors Isabelly Rocha (UNINE), Christian Göttel (UNINE), Valerio The LEGaTO project has receivedSchiavoni (UNINE), Pascal Felber (UNINE), Gabriel Fernandez funding from the European Union’s Horizon 2020 (TUD), Xavier Martorell (BSC), Leonardo Bautista-Gomez (BSC), research and innovationMustafa Abduljabbar (CHALMERS), Oron Port (TECHNION), programme under the Grant Agreement No 780681 Contract Number 780681Tobias Becker (MAXELER), Alexander Cramb (MAXELER) Reviewers Madhavan Manivannan (CHALMERS), Daniel Ödman (MIS), Elaheh Malekzadeh (MIS), Chistian von Schultz (MIS), Hans Salomonsson (MIS) Project Website https://legato-project.eu/ The LEGaTO project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Grant Agreement No 780681. Contractual Deadline DX.Y Version 1.0 1 / 4 D4.4 Version 2 1/ 62 Dissemination Level Nature -
United States Patent 19 11 Patent Number: 5,360,712 Olm Et Al
D US005360712A United States Patent 19 11 Patent Number: 5,360,712 Olm et al. 45 Date of Patent: Nov. 1, 1994 54 INTERNALLY DOPED SILVER HALIDE 4,981,781 1/1991 McDugle et al. ................... 430/605 EMULSIONS AND PROCESSES FOR THER 5,037,732 8/1991 McDugle et al. ................... 430/567 ARA 5,132,203 7/1992 Bell et al. ............. ... 430/567 PREP TION 5,268,264 12/1993 Marchetti et al. .................. 430/605 75) Inventors: McDugle;Myra T. Olm, Sherrin Webster; A. Puckett, Woodrow both G.of FOREIGN PATENT DOCUMENTS Rochester; Traci Y. Kuromoto, West 513748A1 11/1992 European Pat. Off. ... GO3C 7/392 Henrietta; Raymond S. Eachus, Rochester; Eric L. Bell, Wesbter; OTHER PUBLICATIONS Robert D. Wilson, Rochester, all of Research Disclosure, vol. 176, Dec. 1978, Item 17643, N.Y. Section I, subsection A. Research Disclosure, vol. 308, Dec. 1989, Item 308119, 73) Assignee: Eastman Kodak Company, Section, I, subsection D. Rochester, N.Y. 21 Appl. No.: 91,148 Primary Examiner-Janet C. Baxter J. v. V. 19 Attorney, Agent, or Firm-Carl O. Thomas 51) Int. Cl................................................. GC/ A process is disclosed of preparing a radiation sensitive 52 U.S.C. .................................... 430/567; 430/569; silver halide emulsion comprising reacting silver and 430/604; 430/605 halide ions in a dispersing medium in the presence of a 58 Field of Search ................ 430/567, 569, 604, 605 metal hexacoordination or tetracoordination complex (56) References Cited having at least one organic ligand containing a least one carbon-to-carbon bond, at least one carbon-to-hydro U.S. -
Zerohack Zer0pwn Youranonnews Yevgeniy Anikin Yes Men
Zerohack Zer0Pwn YourAnonNews Yevgeniy Anikin Yes Men YamaTough Xtreme x-Leader xenu xen0nymous www.oem.com.mx www.nytimes.com/pages/world/asia/index.html www.informador.com.mx www.futuregov.asia www.cronica.com.mx www.asiapacificsecuritymagazine.com Worm Wolfy Withdrawal* WillyFoReal Wikileaks IRC 88.80.16.13/9999 IRC Channel WikiLeaks WiiSpellWhy whitekidney Wells Fargo weed WallRoad w0rmware Vulnerability Vladislav Khorokhorin Visa Inc. Virus Virgin Islands "Viewpointe Archive Services, LLC" Versability Verizon Venezuela Vegas Vatican City USB US Trust US Bankcorp Uruguay Uran0n unusedcrayon United Kingdom UnicormCr3w unfittoprint unelected.org UndisclosedAnon Ukraine UGNazi ua_musti_1905 U.S. Bankcorp TYLER Turkey trosec113 Trojan Horse Trojan Trivette TriCk Tribalzer0 Transnistria transaction Traitor traffic court Tradecraft Trade Secrets "Total System Services, Inc." Topiary Top Secret Tom Stracener TibitXimer Thumb Drive Thomson Reuters TheWikiBoat thepeoplescause the_infecti0n The Unknowns The UnderTaker The Syrian electronic army The Jokerhack Thailand ThaCosmo th3j35t3r testeux1 TEST Telecomix TehWongZ Teddy Bigglesworth TeaMp0isoN TeamHav0k Team Ghost Shell Team Digi7al tdl4 taxes TARP tango down Tampa Tammy Shapiro Taiwan Tabu T0x1c t0wN T.A.R.P. Syrian Electronic Army syndiv Symantec Corporation Switzerland Swingers Club SWIFT Sweden Swan SwaggSec Swagg Security "SunGard Data Systems, Inc." Stuxnet Stringer Streamroller Stole* Sterlok SteelAnne st0rm SQLi Spyware Spying Spydevilz Spy Camera Sposed Spook Spoofing Splendide -
Kepler and the Jesuits, Michael Walter Burke-Gaffney, S.J. (1944).Pdf
ixNM^KrnrFRi^ mji iiiNir*! CO c >KU\ ic»n \f» v Mftimioriiu'.t n ( < nice r O Mai mi v* Nf amtouvs LHrJUULI m nc. xwii § m m > z a H m3C jftaUISp m en C H theJESU CD BY M.W. BURKE - GAFFNEY ST. IGNATIUS LIBRARY »**,.* ^ » 980 T- PARK AVENUE / »naT,„, oh NEW YORK CITY 28 "«w Vowk Date Loaned ©23 IM ^0*0v&*0v&A&*&*&*&H&*&*&*&K&r&*&*.&>»&*'&*&*,O'*'&*-0*&*&* Kepler and the Jesuits ^^<^W^Jl^X^lt^>C^lC^X^X^K^>t^X^5<^X^X^K^X^X^X^X^X^l<^)t^l<^> "My thoughts are with the Dead; with them 1 live in long-past years, Their virtues love, their faults condemn, Partake their hopes and fears, And from their lessons seek and find Instruction with a humble mind." — SOUTHEY. M. W. BURKE-GAFFNEY, S.J. 'I measured the skies." Johann Kepler THE BRUCE PUBLISHING COMPANY MILWAUKEE Imprimi potest: T. J. Mullai-ly, S.J. Nihil obstat: H. B. Rjes, Censor librorum Imprimatur: + Moyses E. Kiley. Archiepiscopus Milwaukiensis Die 11 Aprilis. 1944 CONTENTS Page Chapte f 1 I Introducing Kepler II The Imperial Mathematician 15 III . 26 IV V . 60 VI Sunspots ..... • 71 VII Mercury in the Sun . 8s 9i WAR FORMAT VIII Heliocentric Hypothesis • This book is produced in complete accord with the Governinem regulations for the conservation of paper and other essential materials. IX X Aids to Astronomy . 117 XI The Last Chapter 129 Bibli Copyright. 1944 The Bruce Publishing Company Indej Printed in the United States of America CHAPTER I INTRODUCING KEPLER Johann Kepler was enjoying a studentship at the University of Tubingen when the Parodies, the Lutheran school at Graz, applied for a teacher of astronomy.