The 9th Swedish Workshop on Data Science
The 9th Swedish Workshop on Data Science (SweDS21) will be hosted by Linnaeus University in Växjö, Sweden, between the 2nd and 3rd of December 2021.
SweDS is a national event with a focus of maintaining and developing Swedish data science research and its applications by fostering the exchange of ideas and promoting collaboration within and across disciplines. This annual workshop brings together researchers and practitioners of data science working in a variety of academic, commercial, industrial, or other sectors.
Past workshops have included presentations from a variety of domains, e.g., computer science, linguistics, economics, archaeology, environmental science, education, journalism, medicine, health-care, biology, sociology, psychology, history, physics, chemistry, geography, forestry, design, and music.
Important dates
- Submission deadline: October 15th, 2021
- Author notification: November 5th, 2021
- Workshop: December 2-3, 2021
Programme
Thursday, December 2nd
13:00-13:30 Opening ceremony
13:30-14:30 Keynote: Prof. Alexandru Telea: Visualizing the black box of machine learning: Challenges and opportunities
14:45-16:30 Paper Session 1 - Chair: Dr. Ricardo Cerri, UFSCar (BR)
- Machine Learning-Assisted Analysis of Small Angle X-Ray Scattering - Piotr Tomaszewski, Shun Yu, Markus Borg, and Jerk Rönnols
- Classifying Fake and Real Neurally Generated News - Anitha Govindaraju and Josephine Griffith
- Predicting Security Vulnerabilities Using Software Code Metrics - Sundarakrishnan Ganesh, Tobias Ohlsson, and Francis Palma
- An Interdisciplinary Web-Based Framework for Data-Driven Placement Analysis of CCTV Cameras - Kenneth Lewenhagen, Martin Boldt, Anton Borg, Manne Gerell, and Johan Dahlén
- Self-Similarity of Twitter Users - Masoud Fatemi, Kostiantyn Kucher, Mikko Laitinen, and Pasi Fränti
16:45-17:30 Ext. Abstract Session 1 – Chair: Dr. Morgan Ericsson, LnU
- Mining for Recurring Themes in Speech Recording Descriptions - Maria Skeppstedt, Magnus Ahltorp, Rickard Domeij, Gunnar Eriksson, and Jenny Öqvist
- Dynamic Ranking of IEEE VIS Author Importance - Daniel Witschard, Ilir Jusufi, and Andreas Kerren
- Understanding Customer Preference for Sustainability Information Using Machine Learning - Sze Yin Kwok, Apoorva Kothapally, Vedasree Reddy Sapatapu, and Veselka Boeva
- Understanding Traffic Cruising Causation via Parking Data Enhancement - Mirza Jasarevic, Veselka Boeva, Fredrik Sjölin, and Per-Olav Gramstad
- Data Center Load Balancing Under Homomorphic Encryption - Rickard Bränvall, Gokturk Yuksek, and Kanad Ghose
17:30-18:30 Social Meetup (online)
Friday, December 3rd
09:00-10:15 Keynote: Dr. Antonina Danylenko: Machine Learning @Spotify: when prediction is not enough
10:30-12:00 Paper Session 2 - Chair: Dr. Zenun Kastrati, LnU
- SBGTool: Similarity-Based Grouping Tool for Students’ Learning Outcomes - Zeynab Mohseni, Rafael M. Martins, and Italo Masiello
- DeepGRASS: Graph, Sequence and Scaled Embeddings on Large Scale Transactions Data - Mahesh Balan Umaithanu, Vignesh Ravichandran, Rohith Srinivaas Mohanakrishnan, and Venkat Subramanian Selvaraj
- Machine Learning for Social Sciences: Stance Classification of User Messages on a Migrant-Critical Discussion Forum - Victoria Yantseva and Kostiantyn Kucher
- A Novel Deep Learning Based Model for Classification of Rice Leaf Diseases - Amartya Bhattacharya
12:00-13:00 Lunch
13:00-14:00 Keynote: Prof. Per Runeson: Open data ecosystems – wishful thinking or successful business?
14:15-15:15 Ext. Abstract Session 2 - Chair: Dr. Rafael M. Martins, LnU
- Mining SCADA Protocol for Anomaly Detection - Mahwish Anwar, Anton Borg, and Lars Lundberg
- Empirical Study: Visual Analytics for Comparing Stacking to Blending Ensemble Learning - Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, and Andreas Kerren
- PUF: Positive and Unlabeled Learning Using Pseudo F Measure - Mahesh Balan Umaithanu, Rohith Srinivaas Mohanakrishnan, and Venkat Subramanian Selvaraj
- Automatic Generation of Individualized Programming Problems - Daniel Toll, Morgan Ericsson, Anna Wingkvist, and Dustin Payne
- Continuous Pattern Mining via Graph-Based Multi-View Clustering - Christoffer Åleskog, Vishnu Manasa Devagiri, and Veselka Boeva
- Model Validation: A Holistic Framework for Model Monitoring - Sundaraparipurnan Narayanan
15:30-16:00 Closing ceremony and Best Paper Award
16:00-17:00 Afterparty (online)
Keynote Speakers
Keynote 1: Professor Alexandru Telea:
Visualizing the black box of machine learning: Challenges and opportunities
Machine learning (ML) has witnessed tremendous successes in the last decade in classification, regression, and prediction tasks. However, many ML models are used, and sometimes even designed, as black boxes. When such models do not operate properly, their creators do not often know what is the best way to improve them. Moreover, even when operating successfully, users often require to understand how and why they take certain decisions to gain trust therein. We present how information visualization and visual analytics help towards explaining (and improving) ML models. These cover tasks such as understanding high-dimensional datasets; understanding unit specialization during the training of deep learning models; exploring how training samples determine the shape of classification decision boundaries; and helping users annotating samples in semi-supervised active learning scenarios.
Bio
Alexandru Telea is a Professor of Visual Data Analytics at the Department of Information and Computing Sciences, Utrecht University. He holds a PhD from Eindhoven University and has been active in the visualization field for over 22 years. He has been the program co-chair, general chair, or steering committee member of several conferences and workshops in visualization, including EuroVis, VISSOFT, SoftVis, and EGPGV. His main research interests cover unifying information visualization and scientific visualization, high-dimensional visualization, and visual analytics for machine learning. He is the author of the textbook "Data Visualization: Principles and Practice" (CRC Press, 2014).
https://webspace.science.uu.nl/~telea001/Main/HomePage
Keynote 2: Dr Antonina Danylenko: Machine Learning @Spotify: when prediction is not enough
Recommendation systems are an essential part of our daily life. Most of the applications support our decision making by selecting the items for our next purchase, next movies to watch, next music to listen to as well as suggestions for new friends on social networks. Traditionally recommendation systems were focused on a simple prediction problem and optimized for immediate reward, e.g. click or stream. However, in many cases recommendation becomes a sequential problem and a long-term reward in the sequence of decisions needs to be taken into account. To account for this we can employ Reinforcement Learning mechanisms, where advances in deep reinforcement learning allows us to apply these methods to massive amounts of data, states and possible actions.
In this talk I will discuss what is Spotify vision when it comes to the future of recommendation systems and what tools and technologies we use to achieve a company mission to unlock the potential of human creativity—by giving a million creative artists the opportunity to live off their art and billions of fans the opportunity to enjoy and be inspired by it.
Bio
Antonina Danylenko holds a PhD in Computer Science from Linnaeus University, Sweden. In her research she combined different Machine Learning algorithms into an algebraic framework and applied it to software engineering problems. After finishing PhD, she started her industrial career as Solution Architect and Data Scientist at IKEA of Sweden where she focused on strategic and hands-on levels of Data Science. Starting from 2018 she worked at Nordic Entertainment group (Viaplay) as a Head of Applied Machine Learning focusing on personalisation of video streaming service. Currently she is a Machine Learning Engineering manager at Spotify and together with her team she works on strategic content promotions.
https://www.linkedin.com/in/antoninadanylenko/?originalSubdomain=se
Keynote 3: Professor Per Runesson: Open data ecosystems – wishful thinking or successful business?
Sharing data between commercial entities is a practice in its infancy, despite the widespread focus on data-driven businesses. Data brokers exist in some domains, but tend to gravitate around the big data players. However, judging from the success of open source software in commercial contexts, there is a potential innovation growth in open data ecosystems (ODE), which we define as a networked community of actors collaborating on data and related resources.
We have studied cases of ODEs, to understand the challenges and remedies for them to function. This talk summarizes our findings regarding
• Value of data and collaboration
• Inherent characteristics of data
• Governance of data, platform and relations
• Evolution of business models and tools
The studied ODEs, in domains of industry 4.0, automotive, transportation, and the labor market, represent wide variation across real-time and batch data, public and private actors etc. They nevertheless share several joint characteristics, particularly, the value creation from the data or collaboration around data, the need for standardization of formats and procedures, aspects of platform ownership and data acquisition, and competition between actors, are essential. Business models and tool support for ODEs are identified for further research.
Bio
Dr. Per Runeson is a professor of software engineering at Lund University, Sweden, and leader of its Software Engineering Research Group (SERG). His research interests include empirical research and collaboration with industry on software development and management methods. He is particularly interested in studies on testing, and the role of open source and open data in software engineering. He is the principal author of ”Case study research in software engineering”, has coauthored ”Experimentation in software engineering”, serves on the editorial board of Empirical Software Engineering, IEEE Transactions on Software engineering, and Software Testing, Verification and Reliability, and is a member of several program committees.
https://cs.lth.se/per-runeson/
Best Paper Award
The Best Paper Award" was granted to "Machine Learning-Assisted Analysis of Small Angle X-Ray Scattering" by Piotr Tomaszewski, Shun Yu, Markus Borg, and Jerk Rönnols.
Workshop chairs
- Dr. Rafael M. Martins (ISOVIS, Linnaeus University)
- Dr. Morgan Ericsson (DISTA, Linnaeus University)
- Prof. Dr. Danny Weyns (AdaptWise, Linnaeus University / KU Leuven, BE)
Call for Contributions
We invite academic and industrial researchers and practitioners to share their work by submitting papers, giving talks, and/or presenting posters. Submissions will be peer-reviewed (single blind process) and selected based on relevance and quality. Contributions should be submitted in two different categories:
- Extended abstracts + Poster: contributions for the non-archival abstract/poster track can be 1 page long. Acceptable submissions for this track include description of in-progress works, previously-published works, student proposals, or novel ideas.
- Full papers: Maximum 6 pages of content (plus up to 2 pages of references) to be included in the planned proceedings. Submissions for this track are required to be novel, previously unpublished works that are not under review for other conferences or publications during the SweDS review cycle.
Sponsors
-
The main sponsor of SweDS 2021 is Fortnox AB. This workshop would not be possible without their contribution!
-
Our main technical sponsor is the Swedish Chapter of the IEEE.
-
The journal Data (MDPI) has been in invaluable partner in the workshop's organization.
Links for more information
- AdaptWise: the research group "AdaptWise"
- DISTA: the research group "Data Intensive Software Technologies and Applications"
- DSM: the research field "Deterministic and Stochastic Modelling"
- ISOVIS: the research group "Information and Software Visualization"
Organizing committee
Program and Proceedings Chair:
- Kostiantyn Kucher (ISOVIS, Linnaeus University / iVis, Linköping University)
Local chairs:
- Christian Engström (DSM, Linnaeus University), and
- Romain Herault (Linnaeus University)
Publicity Chair
- Diana Unander (Linnaeus University)
Programme Committee
- Danilo Coimbra, Federal University of Bahia (BR)
- Fredrik Ahlgren, Linnaeus University (SE)
- György Kovács, Luleå University of Technology (SE)
- Ilir Jusufi, Linnaeus University (SE)
- Jonas Nordqvist, Linnaeus University (SE)
- Kurt Tutschku, Blekinge Institute of Technology (SE)
- Maria Riveiro, Jönköping University (SE)
- Markus Borg, RISE (SE)
- Pablo Jaskowiak, Universidade Federal de Santa Catarina (BR)
- Ricardo Cerri, Federal University of Sao Carlos (BR)
- Tácito Neves, Universidade Federal de Alagoas (BR)
- Willian Watanabe, Technology Federal University of Paraná (BR)
Dec 1-2, 2021: Big Data Conference 2021
Anyone interested in this conference might also be interested in this year's Big Data Conference, at Linnaeus University in Växjö, on December 1-2, 2021: Lnu.se/en/BigData2021.