Cite4Me: Semantic Retrieval and Analysis of Scientific Publications

Bernardo Pereira Nunes Besnik Fetahu Marco Antonio Casanova PUC-Rio L3S Research Center PUC-Rio Rio de Janeiro, Brazil Appelstrasse 9a Rio de Janeiro, Brazil [email protected] Hannover, Germany [email protected] [email protected]

ABSTRACT lications3. However, current approaches by main digital library 4 5 This paper presents the Cite4Me Web application and its features providers, such as ACM Digital Library and Elsevier , do not created for the LAK Challenge 2013. The Web application focuses represent the current state of research on exploring resources us- on two main directions: (i) interlinking of the LAK dataset with ing approaches from Information Retrieval, Information Extraction related data sources from the Linked Open Data cloud; and (ii) and Semantic Web. Thus, get an overview of research topics, find providing innovative search, visualization, retrieval and recommen- publications and discover new nomenclatures are an arduous and dation of scientific publications from the LAK dataset and related laborious task that are not always successful. interlinked resources. Our approach is based on semantic and co- In this paper, we introduce Cite4Me a novel application for ex- occurrence relations to provide new browsing experiences to Web ploratory search, retrieval and visualization of scientific publica- users and an overview of scientific data available. Furthermore, we tions. Cite4Me intends to provide to the end users a single point for present a detailed analysis of the LAK dataset along with applica- accessing papers and hence reducing efforts of searching in several data sources. Our system takes advantage of reference datasets, tions which contributes to the development of the learning analytics 6 field. such as DBpedia , to explore semantic relationships between sci- entific papers and user queries. Additionally, an analysis of topic coverage and shared concepts from related educational datasets, ex- 1. INTRODUCTION tracted from the Linked Open Data cloud, will be introduced. The volume of information on the Web has been growing steadily The remaining of the paper is organized as follows. Section 2 over the last decade and has doubled every two years. The vast presents the approach used for searching, retrieving and recom- amount of data available on the Web along with new means of mending papers. Section 3 describes the process of dataset dis- communications have transformed our society, including the way covery and interlinking and Section 4 shows a brief result analysis we work, live, relate to each other and learn. of the data discovery. Finally, Section 5 presents related work and In the midst of change, the Learning Analytics emerges to make Section 6 presents some concluding remarks. sense of the produced educational data reported by learners, pro- fessors, institutions and so on. Analyzing and understanding the 2. CITE4ME changes along the past years help us to understand the current state As one of the main goals of the field of “Learning Analytics” is and be aware of the forthcoming trends, enabling a new outlook of to support students in their learning process, we developed a Web the future of learning. application called Cite4Me7 that assists students in making deci- 1 2 A recent challenge initiative of SOLAR and LinkedUp project sions to find scientific publications and identify relevant research arises to leverage the creation of tools that enables the analysis, topics. visualization, browsing and recommendation of scientific and edu- Cite4Me implements semantic and co-occurrence methods to (a) cational data. search and retrieve scientific publications; and (b) recommend sci- Although the scientific field has fostered the creation of new entific publications. Moreover, it provides a Web interface that fa- applications in several areas, such as medical, biology, physics, cilitates the search for publications and may help users on discov- amongst others, the information access is based mostly on free ering related terms to a given query. text search and on hierarchical classification system of the pub- In this section, we provide an overview of the major features of the Web application and its Web interface that assist users to 1 Society for Learning Analytics Research - http://www. explore scientific data on the Web. solaresearch.org 2http://linkedup-project.eu/ 2.1 Search and Retrieval Cite4Me relies on search functionalities to meet the users needs. Briefly, we implemented standard Information Retrieval (IR) and Semantic Web (SW) approaches to retrieve and recommend scien- tific papers to the users. We divided this subsection into (i) free text 3http://www.acm.org/about/class/ 4http://dl.acm.org 5http://www.elsevier.com Copyright c 2013 by the papers’ authors. Copying permitted only for pri- 6 vate and academic purposes. http://dbpedia.org 7 LAK-Data Challenge ’13 Leuven, Belgium http://www.cite4me.com/ search; (ii) exploratory search; and (iii) semantic search. for the entities contained in a publication. Finally, the ranking of the results is based on the sum of the tf-idf scores of the matching 2.1.1 Free Text Search concepts. The purpose of the free text search functionality is to offer users Figure 2 illustrates the semantic search functionality. It also gen- the abilities to search for mentions, titles and authors of academic erates a tag cloud from matching publications, showing the most publications contained in the LAK dataset. Even though, this func- prominent terms for a given query. Specifically, the tag cloug based tionality is similar to existing digital libraries, we agree that this is a on the results helps the users to have an insight about the topics and basic functionality that must be provided by our application. There- may assist in finding related terms previously unknown by them. fore, we use standard vector space models (tf-idf ) for indexing and retrieving documents. 2.2 Paper recommendation The tf-idf scores were computed for each term extracted from the Another key feature of our system is the paper recommendation publication content after applying stemming [14]. Furthermore, the based on semantic relationships extracted from reference datasets. searching functionality offers boolean queries with standard oper- The recommendation is based on a previous work [12, 11], where ators, such as ’OR’, ’AND’, and also a ranking of the matching we exploit the number of paths and the distance (length of a path) publications based on the sum of tf-idf scores from the individual between given entities to compute a relatedness score between ex- query terms. tracted entities and associated documents. The first step to measure In summary, our free text search provides to the users publica- the relatedness between documents is to compute the semantic con- tions (P) that match query terms and non-matching publications nectivity score (S CS e) of the entities found in each text (see Eq. 2). P0, which are related to P according to a degree of similarity (see Eq. 1), but does not contain the query terms. τ X l The similarity between a matching publication P and other non- SCS e(a, b) = β · |paths(a,b)| (2) 0 matching publication P in the LAK dataset is measured by the l=1 standard cosine similarity measure, which is built on top of the where |paths | is the number of paths between a and b of length l computed tf-idf scores. (a,b) and 0 < β ≤ 1 is a positive damping factor. As in [12, 11], we used β = 0.5 as our damping factor. Furthermore, we also constrained P · P0 S im(P, P0) = (1) the length of a path to τ = 4. |P||P0| Based on the score for entities, we then define the semantic con- 0 nectivity score (SCS w) between two documents W1 and W2 as fol- where P and P represent the tf-idf scores for the terms in two dis- lows: tinct publications.     2.1.2 Exploratory Search    X |E1 ∩ E2|  1 SCS w(W1, W2)= SCS e(e1, e2) +  ∗ (3) In this section, we provide detailed insights on the exploratory  2  |E1| ∗ |E2| e1∈E1  search functionality of our application. As a preliminary step to e2∈E2  e e provide analytics and information about the actual content and top- 1, 2 ics coverage, all the scientific publications contained in the LAK where Ei is the set of entities associated with Wi, for i = 1, 2. Note dataset are previously enriched. The enrichment process was per- that documents that contain the same entities receive an extra bonus formed using DBpedia Spotlight API8, where entities, entity types (the second term on the right-hand side of Eq. 3). and their respective categories were extracted. Thus, a list of documents pairs is generated and ranked according After the enrichment process, we cluster the publications accord- the score and suggested to the user. Figure 3 illustrates the paper ing to entities and its categories found in each document. The publi- recommendation process computed based on SCS w. cations are clustered in a tree-based structure over the enrichments. Note that, each node of the tree represents a topic in which a pub- 3. DATASET DISCOVERY AND INTER- lication under this node covers. Thus, the exploratory search is performed through the topics covered by each publication. LINKING The process of linking publications, categories and extra re- This section briefly describes the datasets used on automatic re- sources is mediated by DBpedia knowledge graph, where we use lated data discovery from DataHub9 and future steps on dataset dis- the dcterms:subject property to match the resources. covery and interlinking. Thus, as a result, the exploratory search provides a way to explore resources through the connections between their topics, 3.1 LAK Dataset which facilitates the search for topically related resources. Figure 1 The LAK dataset contains the metadata of the papers published shows the exploratory search. in the proceedings of LAK conference 2011-12, a special issue of Learning and Knowledge Analytics: Educational Technology & 2.1.3 Semantic Search Society, the proceedings of the International Conference on Educa- Cite4Me provides also a semantic search engine that assists users tional Data Mining (2008-12) and the Journal of Educational Data to find publications semantically related to the query terms. Analo- Mining (2008-12). In total, 315 descriptions of papers containing gously to explicit semantic analysis (ESA) technique [5], the relat- detailed information about authors, institutions, conference venues edness score, is computed between the enriched concepts found in and the full content of the paper were available. the publications’ content. Basically, the semantic search is an adaptation of the free text 3.2 Data Analysis search presented in the Section 2.1.1. Instead of computing the The goal of the data analysis procedure is to align the various tf-idf scores for the words in a text, it computes the tf-idf score publications in the LAK dataset based on mutual information, such 8http://dbpedia.org/spotlight 9http://www.datahub.io Figure 1: Preview of the exploratory search funcionality.

Figure 2: Preview of the semantic search funcionality.

Figure 3: An example of paper recommendation based on SCS w. 4. EVALUATION OF DATA ANALYSIS AND DATA DISCOVERY This section presents an overview of the results obtained by an- alyzing the LAK dataset with respect to the constructed feature set that describes topics covered by individual publications. Moreover, based on the data analysis procedure and shared information, we show that the establishment of links between the different publica- tions within the LAK dataset and from other datasets in DataHub is possible. In the following subsections, we show the analysis of the LAK dataset and the discovery of relevant datasets and publications. 4.1 Data Analysis The data analysis of the LAK dataset focuses mostly on assess- ing the individual publications for their topic coverage. In this man- ner, we build a connected data graph consisting of the individual publications and items from the feature set. This step is necessary Figure 4: Relevant Dataset Discovery Framework based on to provide the exploratory search functionality, where based on the the generated feature set used to query DataHub Linked Data established edges between publications and feature set items, we provider. can navigate through the publications or topics of interest. There- fore, the results obtained with respect to the constructed feature set and LAK dataset graph are shown in what follows. as the topics covered by them. This is achieved using well estab- Table 1 shows the top ranked items for each of the feature sets, lished datasets like DBpedia10 and Freebase11, where a reference along with the number of associations an item has with respect to point for the unstructured textual content of publications is created all publications (entity, category and type items). Figure 5 shows through an enrichment process. the constructed data graph for the LAK dataset. Again, the enrichment process is carried out using DBpedia 4.2 Data Discovery Spotlight12 [10] and addresses several issues of significant impor- tance. For instance, it offers several advantages such as: (i) identi- After creating the feature set based on the information provided fication of (common) named entities, (ii) disambiguation; and (iii) from reference datasets, we are able to query for relevant datasets expansion of the limited dataset and resource descriptions with ad- in DataHub. ditional background knowledge. Thus, for the top ranked feature set items, the data discovery for relevant resources is considered. Table 2 shows the discovered 3.3 Data Discovery resources and datasets for the top-10 entity items. Note that, we fo- cus only on bibliographic datasets, since we aim at recommending Our Web application uses as its starting point the instances in topically related scientific publications. Due to the lack of bib- the LAK dataset to automatically explore and recommend to users, liographic datasets, we were not able to find related publications datasets that covers similar topics. In order to query, detect and for all entities considered. Table 2 summarizes the discovered re- interlink related datasets, we chose the DataHub as a data provider. sources. The dataset names are represented by their acronyms as DataHub serves as a collecting point of datasets from various fields follows: b3kat - “Bayerische Staatsbibliothek", hebis - “Hessis- and currently it has over 5000 datasets. Note that, from the large ches Bibliotheks Informations System" and npg - “Nature Publish- number of datasets, only 300 datasets are provided as Linked Open ing Group - ALL". Data. As the latter is the main focus of our work, the analysis and Additionally, from the set of 96 bibliographic datasets available, interlinking process is focused for such datasets. only a few of them were offered as Linked Data, thus narrowing 13 Briefly, the data discovery is performed using CKAN data our search space for relevant resources. management framework from DataHub, where based on data anal- ysis and user interests (such as topics covered by a publica- Entity b3kat hebis npg tion/resource) related datasets are suggested. Data 14 0 12 Additionally, we provide to the user a set of resources, amongst Learning 5 0 1 other data analytics, that enables the user to harvest and correlate Data mining 0 0 0 new information from the discovered resources, considering the Algorithm 4 0 0 LAK dataset as a starting point of such discovery. Education 17 1 2 This approach presents several advantages such as the adoption Analysis 42 1 6 and the widespread use of Linked Data principles for publishing Student 7 1 1 scientific papers. Nowadays, many conferences make their pro- Knowledge 11 0 0 ceedings and journals freely accessible, hence our approach would Methodology 4 0 1 take advantage of such open data and offer users topically relevant Statistics 7 0 1 papers for a particular resource in the LAK dataset. Table 2: Number of discovered resources from the bibliographic 10http://dbpedia.org group for the top ranked items from the entity feature set, based 11http://www.freebase.com/ on the LAK dataset. 12http://spotlight.dbpedia.org/ 13http://www.ckan.org Entity Assoc. Category Assoc. Type Assoc. Data 90 Educational_psychology 161 DBpedia:TopicalConcept 150 Learning 80 Data_analysis 150 Freebase:/book 142 Data_mining 67 Learning 139 Freebase:/book/book_subject 142 Algorithm 50 Scientific_method 137 Freebase:/media_common 138 Education 49 Neuropsychological_assessment 136 Freebase:/media_common/quotation_subject 136 Analysis 48 Greek_loanwords 135 Freebase:/computer 125 Student 46 Data 131 Freebase:/education 122 Knowledge 46 Evaluation_methods 129 Freebase:/education/field_of_study 120 Methodology 42 Computer_data 126 Freebase:/computer/software_genre 120 Statistics 41 Research_methods 124 Freebase:/internet 118 System 37 Systems_science 118 Freebase:/internet/website_category 118 Scientific_modelling 37 Formal_sciences 108 Freebase:/award 114 Prediction 36 Data_management 108 Freebase:/media_common/media_genre 105 Data_set 36 Cognitive_science 107 Freebase:/organization 103 Statistical_classification 30 Statistical_terminology 107 Freebase:/award/award_discipline 103 Evaluation 29 Developmental_psychology 101 Freebase:/business 102 Standard_deviation 29 Intelligence 93 Freebase:/organization/organization_sector 99 Probability 28 Data_mining 91 Freebase:/people 99 Behavior 26 Critical_thinking 87 Freebase:/film 94 Interaction 24 Thought 84 Freebase:/book/periodical_subject 93

Table 1: Top ranked items from the feature set for the LAK Dataset, from the dataset analysis.

Industrial processes UnitLaboratory operations techniques SeparationDistillation processes

PaperBooks products

http://data.linkeded...

MultivariableComputerQuantification calculus security mo... ReligiousDifferential fundamenta... operators

BritishGeneticClothingAnalog brands algorithms circuits retailers o... Retail GeneticcompaniesFashion programmingdesignTheories ofReproduction ... of law CompaniesEvolutionary establishe...Propositions algorit... TeachableCompanies unitsImmediateSupermarkets based for ... in inference L... of Nort... 1884 establishmentsFlynn'sDepartment taxonomy ... storesZoology DepartmentElectronic storesSpirituality of... design JavaCompetition specification r... Popular culture MarksLegal & Spencer termsMilitary JVM programmingJava programming lang... lan... ClusterFault-tolerant computing compu... Media studies MultipleElectronics births terms SocialEconomic status problems Local areaSupercomputers networks JavaProgramming platformNarcissism language... Theology SchoolEducational examinations qualific... ConcurrentOligarchySocial events programmi... Russell Group 1951 introductions Musical notation Predicate logic Logic symbols CaseAuthorityLawyers Western Reserve... Propositional calculus Marxist theory EcclesiasticalEducationalLecturers titles institut... HistoryLists by of country Cambridge Host citiesArtisans of the C... OrganisationsOxbridgeCoimbra based Group... PopulatedHudson's places Bay est... Company... VisitorUniversity attractions of Cambri...... Orders of magnitude Classes of computers AnglicanCultureLaw ecclesiasti... degrees in Cambridge PopulatedKindnessEdmonton places est... AnglicanCounterculture priests HybridElectric vehicles vehicles Garment industry http://data.linkeded... Universities1209Legal establishments education and col...... DrawingEngines EconomicDevelopment development economics MilitarySecret projects military prog... Systems of formal lo... Atomic bombingsManhattan ofMachines Project H... Environmental design Syntax ArticlesMilitary with exampl... historyCode names of ... LandscapeArchitecture architecture MilitaryCriticismNuclear history ofweapons ... prog... Visual arts CrystallographyInversePhilosophyProgrammingMultilingual problems Internetof biology language... websites activism MusicalArticles techniquesReviewUnits with of exampl...websites frequency Individual physical ... ApproximationLexis (linguistics) algori...ProgrammingDynamically language... typed pr... Nuclear history ofRhythm t... WorksArt ofmedia art Objective-CCollaborativeDevelopmentalGNUstepCustomary projects units biologyNeXTReality of m... by type NuclearLiteraryPhilosophy weaponsconcepts of t... ComputerComputationalString physics (computer physicseng... sci... GeneralDiscrete encyclopedias Searchgeometry algorithmsContinuousWikipediaHabitat mappings (ecology) te... Operators (programmi... HistoryMathematicalLandscapeChemical constantsWikimedia ecology propertiesFreeMineralogyOpen projectsencyclopedias content projects DiscoveryVideo Communicat... game development ArticlesCondensed withOnline exampl... matter dictionariesLexical phy... databasesMassCommerce TelevisionDiscoveryEnglish-language channels Channel ... tel... CreativeFunctional Commons-lic...Internet programmingOnline properties encyclopediasForce Physiology... Class-basedC programming programm... langua... PowerStatisticalImperialTails laws Grammaticalof units mechanics probability...Searching cases Structural engineering Quantity ModesBridges Catholic music AncientMelody Greek types music

ArchitecturalHigher commun...Higher education education in ...in ... MBTIPersonality types tests AutomobileAutomotive transmiss... transmiss... Personality typologies MechanicalEngagement power con... Jungian tradition http://data.linkeded... NotationSymbols Logical connectives Semiotics Military units and f... Knowledge bases Ad-hocUnited units States and Navy for... o... Model theory AcademicPeer literature review Classical logic Task forces Virtue Expert witnesses 19th-century1856 mathema... births Acronyms PsychologistsScienceTests studies AIDSPandemics origin hypotheses Association of Indep... MentalMusical healthScientistsMaster's profes...compositionSafety degrees SaintRussian Petersburg statisticians Sta... Health Syndromesdisasters Political theories HIV/AIDS Formal languages Proof theory Vocabulary 20th-centuryNationalism mathema... Monetary economics Set theory Trigonometry ChristianMeasuringCombat iconography instruments People fromSovereignty Ryazan Federalism in the Un... CrossChristian symbols symbols 1922 deaths EnterpriseQuality architect... management PapermakingPaperStationery Computer storage media Heraldic ordinaries FullSustainability AftermathMembersBoolean of warthe algebra ... InformationFood architects safety Recursion UnitPunched record cardequipment ChristianMilitaryTransducers operationsterms ... LogicalGrief calculi Electronics PragmaticsIntention Self careIBM unit record equi... Evidence law Programming language... Religious symbols Three-digitFull Memberstelephon... of the ... CognitiveGovernment neuroscience of the Un... Employment compensat... TraditionalWriting Chinese media ... IBM Historystorage of devices computing... Causality EngineersSensors Free packageProbabilityLinuxRed package managem... Hattheorists manage... Money School qualificationsIndustrialTheory automationMental of computation content Packaging materials History of software Units of angle Causal inference HealthRussian sciences mathematicians DistanceOpen University education i... Articles to be merge... Programming idiomsInstructionCentral processing processing u... AidHealthInductive1981 care introductionsreasoning Educational institut... 1789 establishments ... AutomationSelf-reference Microprocessorshttp://data.linkeded...ChinesePrinting inventions materialshttp://data.linkeded... Physical exercise FormerEmergency Eastern telephone Ortho...... 1969 establishments ... Buildings and struct... Paper art 1968Medicine introductionsPrimary care Exempt charities Building MiddleElectricity States Associ... Style (fiction) MIPS Technologies AssociationDistance of Commo...education i... Real estate Electric current PublicArchive services formats Charities based in http://data.linkeded...B... BallisticsTheatre InternationalNursing develo... Charities based in S... Traditions Alchemical processes http://data.linkeded... PerformanceCulture jamming art tech... Support groups CombinatoricsArguments on words MeasureTelevision theory technology FiniteIntegral rings calculus Instruction set arch... GraduationCeremonies Articles with exampl...Metalogic Science occupations Computer-aidedDigital design imaging Internet radio Languages Literature Domain-specificDigital geometry prog... PeercastingLinguistic research Psychiatry Trainscontrover... Physical layer proto... ArtComputer movements graphics da... Internet broadcastingCorpus linguistics Engineering discipli... Art materials Video game genres Firearm terminology Computer engineeringStreamingInternet media television syst... LiteraryWriting criticismoccupations Fiction MicrosoftRail Windows Criminologytransport Modular arithmetic North Central Associ... ActionPsychiatric (genre) diagnosis Continuing education Electronic engineering CloudDigital storage television Materials ActionLabeling video games theory Political engineering Applications of dist... http://data.linkeded... PhilosophicalFiction-writing method... mode EntertainmentMusic Academic administrat... Video on demand serv... Metaphysics Reality Wikipedia articles w... Association of Ameri...Behaviorism PhilosophicalAnalogy argume... http://data.linkeded... Drug delivery devices Subjects taught in m... http://data.linkeded... Homogeneous chemical...Solutions Independent agencies... Dosage forms Foundations1950 establishments based in...... Examinations Concepts in aesthetics SpaceNothing NationalOrganizations Science Fou... establ... Belief PhysicalColloidal chemistry chemistry Simple living Science and technolo...Pittsburgh History &... Psychiatric institut... Musical terminology Environments Veracity Funding bodiesEducational institut... History of psychiatry Macroeconomics http://data.linkeded... ElementaryApproximations arithmetic Anti-psychiatry Documents Core issues in ethics Laboratories SoftwareUniversities bugs and col... Social institutions Political culture Medical terms CarnegieOak Ridge Mellon Associated... Univ... Cohort studies Inquiry http://data.linkeded... DiscoveryIntegrated and invent... circuits ReliabilityFailure engineer... Italian loanwords TheoriesGrammar of aesthetics frameworks Printing Units of time Maintenance SyntacticHigher transforma... category theory Rhetoric Dispositional beliefs ArtsHealthcare quality Semiconductor devices Geometric shapes Bias HealthEvidence-based informatics medic... Fractions SocialCommunalismNoam theories Chomsky Informal fallacies Books by type http://data.linkeded... Telecommunications HTML PhilosophicalMorphology theories Giving CategoryHistory Metanarrativestheory of sociology Meetings TextbooksLatin words and phra... http://data.linkeded...Least squaresInterest (psychology)Grid computingAustralian inventions 21st century FluidFluid dynamics mechanics http://data.linkeded... Ratios TheoreticalPhysical cosmologyphysics NewGenerative ThoughtParadoxes linguistics terms Chronology Hierarchy ContinuumPiping mechanics Binary operations Single equation meth... CheminformaticsString theory 20thModern century history Employment Millennia Aerodynamics Economics of uncerta... School terminology Typing Natural philosophy Idealism Multi-dimensionalFilms by type ge... ModuleFunctionalism theory Public speakingAdulthood ArchitecturalWealth design http://data.linkeded... Conformity Philosophy of life Empiricism DevelopmentPresentation People in informatio... Neuroscience 3D computer graphics Real analysis Construction equipment Computer programmers Schoolteachers Skills Software requirements Academic pressureIndustry in... Infrastructure Education Positiveand traini... psychology National Association... Language Production economics Academic degrees Virtual communities Universities and col... http://data.linkeded... Community organizing OutsourcingStandards-based Constructioneduc... Educators http://data.linkeded... Visual3D effects imaging Elementary and prima... Projects Selection Calendars Horology Definition Order theory Education terminology Design Performing arts High schools and sec... Memory Patent law Population genetics Welfare economics http://data.linkeded... Teaching PhilosophicalEngineering logic occupati... Spacetime Economic growth 1801 establishmentsNew York Post ... Graphics file formats Individualism Patterns Syntactic entities AlexanderPublicationsNewspapers Hamilton establi... published... AccountingEducation terminology economics School-related terms SelfExplorationContemporaryPostmodern art art Youth Standardized tests Electronic documentsAdobeElections Systems Project management http://data.linkeded... EngineeringEconomic concepts indicatorsCentimetre%E2%80%93g...Orders of magnitude ... LegalResearch Legalentities institutes research ... OccupationsMarriage Humanities Vertical transport d... News Corporation sub... ColonialHappinessGovernment architectur...1764 establishments instituti... Physical quantities SI base units EconomicsISO of standards regulat... Campuses http://data.linkeded... Energy Time DigitalMassachusetts pressCompaniesCultural Instit... studies EmbeddedPlanned systems science and ... Learning to read Ethical principles OrganizationsDoctoralNew basedEngland degrees ... Associat... Corporatism Scale model scales InstitutionsAgricultureEducationalPolitical founded... terms institut... Cohort study methodsEmotionshttp://data.linkeded... Didactics Motivation Elevators Introductory physics Utility United NationsLabour Gener... law Grammar http://data.linkeded... Qualities of thought Software optimizationScale modeling ChangeCorporations PositivemanagementPrivateForms mental universities of atti...governmentPersonal development Occupations in music Students Fundamental physics ... ThinkHarvard tanksTitles University Conductors (music) Political science te... Ethics Analytic functions Computer performanceClassical genetics StandardsDemocracy organizati...HealthScientific research documents Music performance Meaning (philosophy ...Rooms Dynamical systems http://data.linkeded... Model aircraft 16thBusinessFinance arrondissementRhode law Island in ... the ... Education reform Higher education Evidence-basedBrown University pract...Public relations RemoteHelp desktop desk Concurrent computing Electronic feedback Numbers Uncertainty of numbers Formal systems PublicCorporations universitiesOrganisationTheses law for Eco... http://data.linkeded... Risk Schools Educational facilities Topology Mechanical vibrations GeorgianOrganizational architectur... culture Law Customer experience ... Exponentials Filter frequency res... Mathematical axioms EducationalColonialCommunicationRegulation institut...Page Colleges description science lan... HistorySocial of ideasconcepts Socioeconomics Psychology Concepts in physics Mathematical analysis Mathematical proofs InternationalTypes ofeconom...Non-profit businessSocial organizat...anthropologyen... Social psychologyhttp://data.linkeded... Axiology Encodings Wave mechanics FundraisingGovernanceQuality Budgetsassurance AcademiaEducational stages Television terminology Mathematics education Iterative methods ProfessionalAcademic titles ...institutions Culture Discourse analysis COBOL Object-oriented prog... Polynomials Neuropsychology UniversityHarvard MedicalofVector ChicagoMeta-analysis graphics School RoleSociology status of cultureCharles Sanders Peirce Group processes HumanPhilosophy behavior of language Heat transfer Statistics education Validity (statistics) MergersGothic and Revival acquisit... archi... Social philosophy .NET programming lan... http://data.linkeded...Training Underwater diving sa... Mathematical science...Filter theoryMathematical optimiz... Computational fluid ... Sexual arousal CommitteeBiotechnologyCambridge Systematicon Institu... review Identity Sociodynamics Behavior Summary statisticsPolymorphism f... Sexual emotions GreekGenetics inventions or genomics... ComputerCompilers libraries Sun Microsystems UniversitiesProfessorial and col... degrees http://data.linkeded... School types Logic Sorting algorithms OrganizationsHome economics establ... Syntactic relationsh... Community Applied linguistics Abstract algebraMathematics Numerical analysis EquationsTypes of functions Technology Types of universityEngineering ... http://data.linkeded... Infinity Film and video techn... Control theory Elementary algebra Mind%E2%80%93body pr... State functions Coding theory Interpolation Science education http://data.linkeded...Alternative education Linear algebra Algebra Theories of mind PhenomenaPhenomenology Sentences by type Probability distribu... Philosophy of psycho... Human resource manag... Behavioral conceptsInterdisciplinaryMathematical fi... concepts Psychological theories Web 2.0 neologisms http://data.linkeded...http://data.linkeded...Productionhttp://data.linkeded... and manuf... Philosophical concepts Special functions DNA LocalComputer government programming... Chemical engineering Parallel computing http://data.linkeded... Operations research Education policy http://data.linkeded... Citizen media Potential Lexicography http://data.linkeded... Human-based computat... http://data.linkeded... Attention-deficitBranches hy... of philosophy Differential equations http://data.linkeded... Ecological metrics Attention Psychology articles ... Phase transitions Peer-to-peer computing Ontology Semantics Functions and mappings Oral communication http://data.linkeded...Linguistics http://data.linkeded... Manufacturing Arabic loanwords Optimal control Computer storageCultural dev... economics File sharing networks Curricula Systems engineering Education by subject Image processing Neuroimaging BehaviouralPerception sciences Population Evaluation Cryptography Radiology Sound productionRecording tec... Learning methods EvolutionaryUnits biology ofEconomics information... terminology Classified information Brain Computer storage Accountability Business theory Consciousness ConceptsUnsolved in metaphys... problemsInformation in... sensitiv... http://data.linkeded...Data collection Simulation Integral transforms Storage media http://data.linkeded... Behavioral and socia... Mental processes Object-based program... http://data.linkeded... English words and ph... OptimizationOrgans algorit... Frequency domain ana... Mathematical structu... Leadership Academic publishing Demography Programming language... Grammatical voices BASIC interpreters Greek loanwords Physics Geometric algorithms Wagering Electrical circuits Capital http://data.linkeded... MicrosoftMicrosoftBASIC BASIC Visual compilers Stu... Calculus Physical sciences Film techniques BusinessPositions of authority Creativity Concepts Procedural programmi... Mathematical economics EconomicSocial sociology economy Management occupations Management Algorithm descriptio... Articles with exampl... http://data.linkeded... Statistical forecast... Limbic system Environmental issues... http://data.linkeded... Game theoryhttp://data.linkeded... Biology theories Theories Emotion Concepts in ethics Epistemology http://data.linkeded... Group theory Bayesian inferenceCovariance and corre... Systems psychology Policy Experimental psychol... PedagogyHuman communication CognitionConcepts in epistemo...http://data.linkeded... Error detection and ... Time serieshttp://data.linkeded... analysis Progressive Era in t... MultimediaSocial epistemology Debating Architects Video Mathematical termino... BiostatisticsMathematical relations Philosophical school... Politics by issue Philosophy of educat... Formal methods http://data.linkeded... Science experiments HistoryRadioactivityWestern of museums art American philosophy Educational television Social sciences Systems Philosophical moveme... Types of organization Branches of psychology Education Translation Arabic words and http://data.linkeded...phr... Normal distribution Mathematical physics MuseumHistoryHistorical collections of biology scientifi... Sports terminology Pragmatism Social sciences meth... Education-related te... Metrology Geometry Relationship counsel...Interpersonal relati...Bibliometrics Writing http://data.linkeded...Terminology http://data.linkeded... Forteana Political philosophyDemographics Government Research Educational assessme... HistoryCollecting of earth sci... CanadianWestern court classical systemPhilosophy mu... of mind http://data.linkeded... Standardization Methods in sociology http://data.linkeded... Human development NationalEconomies security Consumer behaviour Knowledge sharing Length http://data.linkeded... http://data.linkeded... M-estimators Professional certifi... Former courts and tr... Economic systems Strategic management Critical thinking http://data.linkeded... Mind Teleconferencing Demographic economicsUser interfacesNature Wikis Compiler constructionReference Problem solving Statistical terminol... SupportUnited vector States machi... Statisticalfedera... ratios Institutes TransportationVideo plann... Film production Integrated developme...http://data.linkeded... Engineering statistics Measurement Quality control Supply chain managem... Tools1993Process introductions (computing) Hypothesis testing Associative arrays Bioinformatics VideotelephonyStatements Population ecology Adolescence Philosophy of science Greek words and phra... Accessibility Operating system tec... Structure http://data.linkeded... http://data.linkeded... Article Feedback 5 A... ErgonomicsUrban design UsabilitySociology index Knowledge Philosophy of mathem... Web analytics Data-centricProprietary program... database... Programming paradigms Probability assessment Linear programming ERP software Postgraduate schoolshttp://data.linkeded... MarketingRussian inventions http://data.linkeded...Design of experimentsVector calculus Error Gambling terminology Personality theories Internet marketingMathematical financeSocial issues Library science Biology 1992 software Object-oriented prog... http://data.linkeded... CyberneticsMental structures P-complete problems Data warehousingHealth Educational technology Programming language... Systems theory Concepts in logic Observation Psychometrics Privacy Acoustics Elementary mathematics Continuous distribut... Convex optimization Business planningAnalyticsDigital rights Literacy User interface techn... MicrosoftDesktop database database s... app...Qualitative research Arrays Sociological termino... Information Age History of mining http://data.linkeded... ThoughtPharmacokinetics Statistical theory Clubs and societiesFinancial data analy...Human rights Earth sciences Nonverbal communicat...Organizations Decision theory Periodic table Abstraction http://data.linkeded...Statistical deviatio... Conditionals Academic disciplinesBusiness terms Information technology Psychological testing Interrogative words ... Bayesian statistics Statistical models AnthropologicalSociety cate... MiningInformation science NeuropsychologicalKnowledge representa... a... Articles with incons... Regression analysis Theory of probabilit... Social groups Ecology ClassificationProgramming systems Occupationallanguage... safety ... http://data.linkeded... Diagrams Genetics CollaborationStrategy Computing Articles with exampl...Sampling (statistics) Signal processing Cultural history Economics Process management Computer programming Mathematical logic http://data.linkeded... ComputerInnovation occupationsInnovation economics Urban studiesSociolinguistics and pl...Scientific revolution American inventions Evaluation methods http://data.linkeded... Resources Planninghttp://data.linkeded... Actuarial science Latent variable models Science and technolo...Crowdsourcing International relati... http://data.linkeded... Statistical inference Information technolo... Reading http://data.linkeded... HeuristicsSources of knowledge Comparison of assess... Law enforcement titles http://data.linkeded... VirtualEducational reality Anthropology software Analysis ProbabilityParametric statistics http://data.linkeded... Law enforcementPart-time employment occu... Types of marketing http://data.linkeded...Standards Learning Conceptual models Means Dynamic programming Human–computer inter... Microsoft developmen... Product managementMedical statistics Randomness PublishingLaw enforcement terms Ecosystems http://data.linkeded... Complex dynamics Science Market research Business software Personal life Sociological terms Cognitive science Probability and stat... http://data.linkeded... Educational psycholo... Law enforcement occu... http://data.linkeded... Distance education Conservation Computing terminology Summary statistics http://data.linkeded... SingaporePolice Police ranks Force History of Organizationaleducation behav... Formal sciences Futurology Basic conceptsLogic inand se... statistics Questionnaire constr... Source codeScientifichttp://data.linkeded... method Tool-using species Vectors Articles containing ... Learning management ...Human%E2%80%93comput... Functional analysis Cultural landscapes http://data.linkeded... Communication Research methods Hidden Markov models Virtual learning env... Decision Support Sys... http://data.linkeded...Technical factors of... http://data.linkeded... Megafauna of Australia Number theory Developmental psycho... Expert systems Environmental sciencehttp://data.linkeded... Cardinal numbers Places http://data.linkeded... http://data.linkeded...http://data.linkeded... http://data.linkeded... Architectural termin... http://data.linkeded... PrismatoidZonohedra polyhedra Methodology ProbabilityPhysical theory objects Space-filling polyhe... http://data.linkeded... Critical phenomena Integers Cubes Interaction Mathematical notation Epistemology of scie... Ship construction http://data.linkeded... Platonic solids http://data.linkeded...Computer data Logical consequence Estimation theory Forestry Volume Economic anthropology Environment http://data.linkeded... Interpretation (phil... PredictionStatistical data sets Megafauna of Eurasia Social research http://data.linkeded... Systems science Generalized linear m... Personality http://data.linkeded...Sociological theories Technical communicat... Aptitude http://data.linkeded... Plants http://data.linkeded... Apes Materials science Mathematical modeling http://data.linkeded...Naturalhttp://data.linkeded... language pro... InfographicsScientific modeling http://data.linkeded... http://data.linkeded... MegafaunaTrees of North A... Online gamingOnline games services Artificial intellige... http://data.linkeded... Factorial and binomi... Internet properties ... Information retrieval HumanCosmopolitan geography species FolksonomySecureSocializationPhoto communication sharing http://data.linkeded...Hypertext http://data.linkeded... http://data.linkeded... Sequences and series ArchivalRisk management science Reputation management Medicinal chemistry Econometrics Statistical methods Megafauna of SouthPlotting A... software Discrete distributions Computer1999 introductions art Content management s... X86-64 Linux distrib... http://data.linkeded... Data management Auxiliary sciences o... Real algebraic geome... EducationalRiskFacebook analysisvideoStudent ga... culture Groupware Domain name system Plant morphologyLinux numerical anal... Sensitivity analysis 1988 introductionsSystems ecology Complex systems theory Data analysis Numerical programmin... SocialDocument bookmarking%3C!--Professions--%... managementSemantic Web ... Revolutionary tactics Self-organizationDebian-based distrib... Software architecture Intelligence Real numbers Data analysis software Mathematical series Computer-mediatedNetwork Internetperformance co... access Organizational theory Free software cultur... Quality control tools Megafauna of AfricaIRIX software SecurityHealth Taxonomypromotion Debian http://data.linkeded... Transdisciplinarity Mathematical andhttp://data.linkeded... qua... http://data.linkeded... Probability distribu... Permutations XML-basedComputerBlog networkstandards hostingZoomableInteroperability sec...services User Interf... Mass media Postmodernism Business intelligence Educational psychology http://data.linkeded... Statistical intervals Matter ProprietaryC software cross-pl... InternetSymbianWorkflow privacy softwareSocialVirtual technology constructionism avatars SocialMedia systems technology1993 softwarePsychological attitude English inventions Poisson processesPharmacology http://data.linkeded... Psephology CompaniesPerformance establishe...Cognitive management psychology Social networks Education schoolsStatistical data types StateEducation schools in in the the... Uni... Information systems Matrices Logical fallaciesArray programming la... SuperorganismsTranslation VideostudiesParsing game culture CommunicationNetwork theory theory Data Data structures Categorical data IntelligenceBiological2004 (inform... establishmentssystems ... Cascading Style Sheets Industrial design 1989 introductions Elementary specialMarkov f... models WindowsMobileDeliberative Phone computers Algorithmssoftware methods on strings Network addressing Teacher training Natural sciences Conjugate prior dist... WebsitesBadaTypes whichFilm software andofmirro... library video termi... Web design Web services Types of databases Application software http://data.linkeded... Exponential family d... SoftwareGames development of mentalDigital skill humanities Internet Protocol Cross-platform softw... Computer architecture Reasoning HumanBroadband rightsE-commerceDigital by issue libraries StylesheetHawaiian languages words and p... Community buildingInternational trade Windows word http://data.linkeded...process... http://data.linkeded... Vector spaces UnderlyingSoftwareComputationalLearning design principle...MUD psychology statis... terminology Social informationArticles p... in1983 need software of ... Statistics Graphical models Distributions with Megafaunac... Numerical linear alg... SymbiosisCommunityBlog websites software Computer graphics Information Articles with exampl... CompaniesHome computer establishe...Notetaking software 1969 introductionsMac OSTechnicalAtari X word ST software proces...communicat... Industrial engineering Dimension Stable distributionsAnimals describedCurves in... MarketingProxyPrivatelyRSS serversGlobal research held internet compa... c... comm... IntellectualOpen methodologies propert... Mac OS word processors Quantitative researchQualityCollective intellige... Mathematical sciences Decision trees Bayesian networks Estimation of densit... LifelongComputer learning security so... Internet protocols Holism Machine learning Scottish inventions Invasive mammalStatistics spec... articles ... ComputerLaptopsDeviance security and pr...social ... 2000s in computerSocial sc... networking se... Computer law Geography Classification algor... http://data.linkeded... CompaniesAutomatic basedDigital identifica... in R... media Cloud applications Social media Communication design Data mining Logarithms Representation theor... Articles with exampl... BlackBerryCrimeRights software preventionSociology Buzzwords Internet Experimental physics Charts Relational databaseData modeling ... Environmental health Internet forum termi... Internet ages Web 2.0 World Wide Web http://data.linkeded...http://data.linkeded... Theoretical computer... http://data.linkeded... Representation theory WebPersistentInternet syndication Worlds architecture form... Neologisms Digital technology Prospect theory http://data.linkeded... Learning in computer... Internet memesEnterprise applicati... New media http://data.linkeded... http://data.linkeded... Bilinear operators Windows software Brand management DocumentarySociocultural film tec... global... Qualia Identifiers Algorithms Speech recognition http://data.linkeded... Articles with exampl... Statistical classifi... Science-related lists Sound Microsoft Office http://data.linkeded... http://data.linkeded... Sparse matrices Service-orientedComputing (bu... platforms Networks Control flowDigitalProbability photographyMetadata interpre...http://data.linkeded... Assistive technology Information technolo...Analytical chemistry Waves Radiation health eff... http://data.linkeded... Hearing Euclidean solid geom... History of radio Mutation Statistical charts a... http://data.linkeded... Elementary shapes Websites Television genres ComputabilityAll articles theory lacking... Utility software type http://data.linkeded...Computational resour...Elementary geometry SQL http://data.linkeded... One ArticlesMetric including geometry r... Computer science Articles lacking sou... Complex numbers Web development Database theory Article Feedback 5 DeclarativeArticles programm... with exampl...Applied mathematics http://data.linkeded... LocalLocal government government dis... in ... http://data.linkeded... Web applications Computational science http://data.linkeded... PlacesEconomicsDisk in Berkshire file systems Philosophyof transpo...... disambigu... http://data.linkeded... PopulatedHistory of places telecommu... est... Relational model Integrals Software companiesComputer b... memory Area ComputationalModernism neuros... TownsString in Berkshire similarity me... Economic geography Historical eras Query languages Constraint programming CompaniesCloud establishe...computing StatesNorthern of the United... American co... Reporting Sociocultural evolut... Computers Input/output Electronic circuit v... DOSUnitaryLocal on authoritiesIBM authorities PC compat...... ad...Household income Epidemiology Computer languagesMultivariate statist... Linear operators in ... Cloud platforms MemberCountries statesSubdivisions of bordering NATO of the ...... Radio formats GlobalizationTheories of history Accesshttp://data.linkeded... control Database management ... Markup languages Online educationIBMKennet PC compatiblesand AvonCountries Canal borderingEnglish-speaking ... cou... http://data.linkeded... Logic in computer sc... Website management First-levelIncomeGeographic in administ... the United... informati... Cultural geography History of television UnitedSuperpowersCountry StatesFormer subdivisions... confederationsIdentity management Visualization (graph... Particle physics http://data.linkeded... StatesEarth and sciences territori...School data districts ... Internet search algo... Combinatorial optimi... 1776 establishments ... Logging JustificationData modeling langua... CountriesG8 nationsPopulation bordering density ... SearchGoogle engine optimi... http://data.linkeded... BicontinentalIncome countr... http://data.linkeded... Thermodynamic entropy Library cataloging a... Algebraic structures Analytic geometry Computational comple... Internet companies o... Link analysis Programming language... Pharmaceutical indus... Internet properties ... Databases Data types Secondary education Real-time web http://data.linkeded...Philosophy of therma...Statistical natural ... Combinatorics PrivatelyTwitter held compa... Cartography http://data.linkeded... Geometric measurement Systemic Risk - Beha... filtering 2006Companies establishments basedFree ...inFree S...learning educational suppor... sof... Online chat Collaborative software Free learning manage...Internet culture ApplicationRecommender layer pr... systems Subroutines http://data.linkeded... Open problems Free software http://data.linkeded... Multiple choice Programming constructs Free content managem... Emerging technologies Computational lingui... Artificial intellige... University of Cambri... Free software progra... Internet forums Graph theory Files Computer networking World Wide Web Conso... Computer file systems http://data.linkeded... TelecommunicationsYouth rights e... Clinical research Symmetry Markov processes NP-completeGraph families problems Types of communities Vision Inter-process commun... http://data.linkeded... http://data.linkeded... PerfectGraph graphs connectivity Photography Non-parametric stati... 1971Differential in computer topology sci... ListedEnglish buildings Heritage in ... Computer security ComplexityParity classes ArchaeologyApplied psychology of the U... Maps Exploratory data ana... Experiments Graph data structures BritishTown architecture and country pla... http://data.linkeded... MatrixSingular Differentialnormal value forms decom... geometry Aesthetics Graphic design ClusterGeostatistics analysis Stochastic processes Glossaries of mathem... EducationalConsensus research reality Geodesy ManifoldsMatrix theory Text messaging Algebraic graph theory Chemical kinetics Environmental moveme... SurveyTypes methodology of polling Coordinate systems Neural networks Member states of the... Robust statistics GraphicalFrench user Androidinter...loanwords software Ren%C3%A9Lie algebras Descartes IOS software G20 nations GeometricLie groupstopology Companies listed on ... Federal countries Set families Liberal democracies Technology in society Computer file formats http://data.linkeded...http://data.linkeded... SystemServices administration management ... Software industry ServerServers hardware (computing) Nursing research South Asian countries Variable (computerVideo p...GUI gamewidgets gameplay Software licenses Food and Drug Admini... Article Feedback Bla... BusinessStatistical models outliers InternetHistory search of the engi... Inter... 1985 software Drug discovery Member statesIndia of the... Software distributionGraphicalVideo game user terminology inter... Building engineering Clinical trials StatesG15 nations and territori... Computer networks Advertising publicat... http://data.linkeded...Java platform software MemberCountries states of the...the Ind... Computer jargon Statistical programm... Former BritishBRICS colon... Statistical software Words coinedBlogging in the ... Software engineering Free algebraic struc... LiteraryPolitics andgenres technol... Geometric group theory CombinatorialProperties group of groups ... International rankings UnifiedSystems Modeling Modeling Lan... Lan...Matrix decompositions Blogs Open formats Internet terminology AustralianEthernet televisio... http://data.linkeded... BinaryThermodynamicsNon-standard arithmetic analysis English-languageKohlberg Kravis tel... Robe... Humans FundamentalStock marketanalysis RotationOracleModel acquisitions selection Seven Network Network HTTPprotocols Web browsers ForeignDerivatives exchange (finance)mar... Free software compan...SurfacesPi Networking hardware .NETTasks2002 framework ofsoftware Natural lan... GoogleHistory Earthof mathematicsSoftware2005 software that uses Qt Television channels ... NavigationModels of computationCommodities market Units of informationAngleLexical units AnalysisAsymptotic of algorithms analysis Computer-assistedConcurrencyMachine (compute... translation tr... GeometricLinuxRegression centers Orientationsoftware variableWordsField theory ... PetriFormal nets specification... Binary treesSoftwareTemperature companiesAstrological b... aspects Softwarehttp://data.linkeded... development... MicrosoftGPS applicatio... MathematicsRemoteKeyhole ofsensing infin... MarkupTriangle Langu... centers Trees (data structur... DifferentialBilinearGISOpen forms geometr... file Travel formats Alliance Private equity portf... UnitsBigTableCompanies of linguistic implementat... establishe...Nucleic ... acids Data serialization f... FreewarePrimitive Circlestypes AffineVirtualHistory geometry globes ofClassicalOrbits calculus mechanics Mac OS X software KinematicsCelestialConic sections mechanics Software development... Distributed computin... EuclideanCompanies geometry based in S... Project management s... Freedesktop.orgSoftwareApplication using programm... the M... Operating systems FreeFree graphics windowingX Window software systemsSystem http://data.linkeded... Computer hardware co... MacSpreadsheet OS software software CloudComputer computing companies prov... o... SignsEnterpriseInventory of death modelling AnaerobicCompostingBusinessBiodegradable digestion process waste ... Perimeter securityTelecommunication th... Articles containing ...

Discrete mathematics Network architecture 1969 in computer sci... Hydrology Directed graphs WaterWater WeatherstreamsSociological waves hazards theory SpreadsheetBibliography file filefor... fo... FluvialFlood landformsWeatherHistoriography PresentationXML layer p... BasicWaterPostmodern meteorological...GeomorphologyRivers theory

Composite data types http://data.linkeded... NationalSemiconductorCollier Medal1975 ElectronicsTrophy of compan... establishments Te...IBM recip... companie...... WindowsChildren administration 1896 establishmentsInformationCompaniesAmerican ...theoryUML brands establishe... Partners Olympic sportsStairwaysSafety codes ComputerDisplay storage technologySocial com...Companies classes c... establishe... Childhood Abstract dataSocialism typesComputer security so... MacArchitectural OS user interface elements Multinational1911 Publiclyestablishments compan...PointCompanies tradedMicrosoft of sale comp...... compan... listed on ... SportsFigure IceGardenentertainmentskating dancing features SocialCompanies divisions basedSoftware in W... companies b... DowCompanies JonesTransactionCompaniesCompanies Industrial... in the processing based Dow... establishe... in R...

Blogospheres

HackingSolid mechanics (computerProperty se... Web security exploits DeformationInjectionVitaceae exploits Computer ViticulturesecurityProperty ex... law SocialSecuritySoftware inequalityGrape compliance testingvarieties

Alumni of WoodworkingKeeleWood Univ... People from Stoke-on... The1955 PoguesEnglish births membersguitarists People associatedEnglish banjoists wi... LivingWoodcarving people

Figure 5: Topic coverage of LAK data graph for the individual resources. 5. RELATED WORK [3] G. Cobo, D. García-Solórzano, J. A. Morán, E. Santamaría, Cobo et al.[3] presents an analysis of student participation in on- C. Monzo, and J. Melenchón. Using agglomerative line discussion forums using an agglomerative hierarchical clus- hierarchical clustering to model learner participation profiles tering algorithm, and explore the profiles to find relevant activ- in online discussion forums. In Proc. of the 2nd International ity patterns and detect different student profiles. Barber et al. [1] Conference on Learning Analytics and Knowledge, LAK uses a predictive analytic model to prevent students from failing ’12, pages 248–251, New York, NY, USA, 2012. ACM. in courses. They analyze several variables, such as grades, age, [4] A. Essa and H. Ayad. Student success system: risk analytics attendance and others, that can impede the student learning.Kahn and data visualization using ensembles of predictive models. et al. [7] present a long-term study using hierarchical cluster anal- In Proc. of the 2nd International Conference on Learning ysis, t-tests and Pearson correlation that identified seven behavior Analytics and Knowledge, LAK ’12, pages 158–161, New patterns of learners in online discussion forums based on their ac- York, NY, USA, 2012. ACM. cess. García-Solórzano et al. [6] introduce a new educational mon- [5] E. Gabrilovich and S. Markovitch. Computing semantic itoring tool that helps tutors to monitor the development of the relatedness using wikipedia-based explicit semantic analysis. students. Unlike traditional monitoring systems, they propose a In Proc. of the 20th international joint conference on faceted browser visualization tool to facilitate the analysis of the Artifical intelligence, IJCAI’07, pages 1606–1611, San student progress. Glass [8] provides a versatile visualization tool to Francisco, CA, USA, 2007. Morgan Kaufmann Pub. Inc. enable the creation of additional visualizations of data collections. [6] D. García-Solórzano, G. Cobo, E. Santamaría, J. A. Morán, Essa et al. [4] utilize predictive models to identify learners aca- C. Monzo, and J. Melenchón. Educational monitoring tool demically at-risk. They present the problem with an interesting based on faceted browsing and data portraits. In Proc. of the analogy to the patient-doctor workflow, where first they identify the 2nd International Conference on Learning Analytics and problem, analyze the situation and then prescribe courses that are Knowledge, LAK ’12, pages 170–178, New York, NY, USA, indicated to help the student to succeed. Siadaty et al.[13] present 2012. ACM. the Learn-B environment, a hub system that captures information [7] T. M. Khan, F. Clear, and S. S. Sajadi. The relationship about the users usage in different softwares and learning activities between educational performance and online access routines: in their workplace and present to the user feedback to support future analysis of students’ access to an online discussion forum. In decisions, planning and accompanies them in the learning process. Proc. of the 2nd International Conference on Learning In the same way, McAuley et al. [9] propose a visual analyt- Analytics and Knowledge, LAK ’12, pages 226–229, New ics to support organizational learning in online communities. They York, NY, USA, 2012. ACM. present their analysis through an adjacency matrix and an ad- [8] D. Leony, A. Pardo, L. de la Fuente Valentín, D. S. justable timeline that show the communication-actions of the users de Castro, and C. D. Kloos. Glass: a learning analytics and is able to organize it into temporal patterns. Bramucci et al. [2] visualization tool. In Proc. of the 2nd International presents Sherpa an academic recommendation system to support Conference on Learning Analytics and Knowledge, LAK students on making decisions. For instance, using the learner pro- ’12, pages 162–163, New York, NY, USA, 2012. ACM. files they recommend courses or make interventions in case that [9] J. McAuley, A. O’Connor, and D. Lewis. Exploring students are at-risk. reflection in online communities. In Proc. of the 2nd In the related work, we showed how different perspectives and International Conference on Learning Analytics and the necessity of new tools and methods to make data available and Knowledge, LAK ’12, pages 102–110, New York, NY, USA, help decision-makers. 2012. ACM. [10] P. N. Mendes, M. Jakob, A. García-Silva, and C. Bizer. 6. CONCLUSION Dbpedia spotlight: shedding light on the web of documents. In this paper we presented the main features of the Cite4Me Web In Proc. of the 7th International Conference on Semantic application. Cite4Me makes use of several data sources to provide Systems, I-Semantics ’11, pages 1–8, New York, NY, USA, information for users interested on scientific publications and its 2011. ACM. applications. [11] B. Pereira Nunes, S. Dietze, M. A. Casanova, R. Kawase, Additionally, we provided a general framework on data discov- B. Fetahu, and W. Nejdl. Combining a co-occurrence-based ery and correlated resources based on a constructed feature set, and a semantic measure for entity linking. In ESWC, 2013 consisting of items extracted from reference datasets. It made pos- (to appear). sible for users, to search and relate resources from a dataset with [12] B. Pereira Nunes, R. Kawase, S. Dietze, D. Taibi, M. A. other resources offered as Linked Data. Casanova, and W. Nejdl. Can entities be friends? In For more information about the Cite4Me Web application refer G. Rizzo, P. Mendes, E. Charton, S. Hellmann, and to http://www.cite4me.com. A. Kalyanpur, editors, Proc. of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic 7. REFERENCES Web Conference, volume 906 of CEUR-WS.org, pages 45–57, Nov. 2012. [1] R. Barber and M. Sharkey. Course correction: using analytics ´ ´ ´ ´ to predict course success. In Proc. of the 2nd International [13] M. Siadaty, D. Gaševic, J. Jovanovic, N. Milikic, Z. Jeremic, ´ Conference on Learning Analytics and Knowledge, LAK L. Ali, A. Giljanovic, and M. Hatala. Learn-b: a social ’12, pages 259–262, New York, NY, USA, 2012. ACM. analytics-enabled tool for self-regulated workplace learning. In Proc. of the 2nd International Conference on Learning [2] R. Bramucci and J. Gaston. Sherpa: increasing student Analytics and Knowledge, LAK ’12, pages 115–119, New success with a recommendation engine. In Proc. of the 2nd York, NY, USA, 2012. ACM. International Conference on Learning Analytics and Knowledge, LAK ’12, pages 82–83, New York, NY, USA, [14] C. van Rijsbergen, S. Robertson, and M. Porter. New models 2012. ACM. in probabilistic information retrieval. 1980.