Automated Online Detection of Popular Christmas Food Items Within Bisnode Countries Using Data Science
Total Page:16
File Type:pdf, Size:1020Kb
Most popular Christmas food in Europe Automated online detection of popular Christmas food items within Bisnode countries using data science. Norway’s top keyword Most popular Christmas food is “ribbe” – a roasted pork dish of Christmas food web “Perunalaatikko”, a search are associated sweetened potato dish to sweets is the most searched 36% for in Finland of Christmas food web 50% search are associated of Austrias top to meat ten keywords are 20% sweets related, with “vanillekipferl” being most popular In Hungary “mézeskalács” is of Christmas food web the most popular search are associated keyword 20% to veggie Switzerland is the only country where the most popular keyword, fondue, is a dairy Croatia’s top keyword is “bakalar” Most popular keywords Sweets Fish 6% vanillekipferl Austria bakalar Croatia Dairy chocolat Belgium kapr Czech Republic Plätzchen Germany riba Macedonia mézeskalács Hungary 18% 29% Sernik Poland Veggie Veggie Sweets toit Estonia Meat perunalaatikko Finland flæskesteg Denmark salata Serbia 18% ribbe Norway Fish Dairy kapustnica Slovakia 29% kranjska klobasa Slovenia fondue Switzerland Meat julskinka Sweden Sweets are the most popular category for half the countries Austria Belgium Switzerland Czech Republic Germany Denmark sweets drink dairy side dish Estonia Finland Croatia Macedonia Norway Poland soup meat fish veggie Sweden Slovakia Serbia Slovenia Hungary The percentage of the categories per country is computed by considering the weight of the top 10 keywords. Sweets – top 10 1. Hungary 2. Poland 3. Sweden 4. Denmark 5. Germany 6. Austria 7. Belgium 8. Slovenia 9. Croatia 10. Norway Meat – top 10 1. Norway 2. Slovenia 3. Belgium 4. Estonia 5. Sweden 6. Slovakia 7. Croatia 8. Denmark 9. Switzerland 10. Serbia Fish – top 10 1. Czech Republic 2. Croatia 3. Macedonia 4. Austria 5. Poland 6. Slovakia 7. Hungary 8. Sweden 9. Finland 10. Belgium Veggie – top 10 1. Finland 2. Serbia 3. Estonia 4. Macedonia 5. Slovakia 6. Czech Republic 7. Denmark 8. Hungary 9. Poland 10. Croatia Methodology 1. Step: acquisition of online content By leveraging search engines, Bisnode Group Analytics gathered in an automated way thousands of christmas food websites for each of the Bisnode Countries. Website structure and content were then auto matically explored and analysed. 2. Step: extraction of most relevant keywords per country Using text mining algorithms, food-related keywords per country where detected. Relying on statistical analysis, most relevant and country specific items have been identified, among millions of key- words extracted from websites. Christmas specificity of keywords is then further identified by ana- lysing their web-search popularity over time during the last 5 years (see side picture). Keywords that show a yearly spike at christmas are kept, while the ambiguous ones are dropped. 3. Step: assessing popularity Web-search popularity associated to each keyword is then used 2013-11 2015-11 2017-11 2014-11 2016-11 2018-11 2015-05 2017-05 2014-05 2015-02 2015-08 2016-05 2017-02 2017-08 2018-05 to normalise popularity of keywords per country and to rank them. 2014-02 2014-08 2016-02 2016-08 2018-02 2018-08 This normalised popularity is the score we use for all subsequent analyses. julskinka knäck lussekatter Data to drive you forward.