MACHINE LEARNING / DATA SCIENCE / DATA ENGINEERING
April 5, 2023 @ Svenska Mässan

ABOUT THE CONFERENCE

GAIA organizes a one-day conference for people interested in artificial intelligence and all things data. The aim is to create an environment for learning, networking, and knowledge sharing among individuals, organizations, and academia around these common interests. The conference focuses on applied machine learning and real-world data science. It introduces diverse content from enthusiastic domain experts and typically covers what is happening within the field in Gothenburg as speakers often have local connections.

Our last conference can be seen here, and we’re currently putting together GAIA Conference 2023. See you there!

WHAT TO EXPECT

Inspiration and knowledge

Fascinating talks will be held by representatives from academia and many different industries. We expect to get inspired and learn about techniques, strategies, and tools commonly used by people in the field. We hope to leave the conference with a long list of new things to explore further!

Location

This year, we are once again hosting the conference at Svenska Mässan, conveniently located near the Korsvägen stop. As usual, this prime venue will allow us to bring plenty of attendees, partners, and startups together.

Food and drinks

Of course, food and drinks are included in the ticket price. We’ll provide you with breakfast, lunch, and fika with infinite amounts of coffee and tea so that you can stay sharp during the whole day. We recommend you plan for some extra time after the closing remarks as we finish the day with bubbles!

One great conference

We’re honored to have so many representatives from Gothenburg sharing their knowledge and thoughts. They’ll tell us more about what’s happening on the west coast, and you’ll have a chance to meet with other local enthusiasts with similar problems and interests.

SCHEDULE

Breakfast

08:00

Opening Remarks

by Jakob Andersson

Chairman | GAIA

09:00

GPT-SW3: The First Large Generative Language Model for the Nordic Languages

by Magnus Sahlgren

Head of Research, NLU | AI Sweden

09:15

Finding the Bad Guys with AI: Connecting the Dots In Data

by Christian Berg, Mats Kvarnström

Founder, Head of AI | Paliscope

09:45

Break

10:15

Modelling Ecosystems with Deep Reinforcement Learning

by Claes Strannegård

Associate Professor | University of Gothenburg

10:30

Few-Shot Learning for Health: How Sleep Cycle Separate Multiple People’s Snoring

by Maria Larsson

Machine Learning Engineer | Sleep Cycle

11:00

Unlocking Hearts by Locking Models: Solving Multiple Echocardiogram Problems Using a Holistic Self-Supervised Approach

by Anders Hildeman

Senior Data Scientist | AstraZeneca

11:30

Lunch Break

12:00

Leveraging Open-Source Tools for Building a Quality Data Warehouse

by Matteo Molteni

Data Engineer | Kognic

13:30

Cross-modal Transfer Between Vision and Language for Protest Detection

by Ria Raj, Kajsa Andreasson

Software Engineer, Software Engineer | Recorded Future

14:00

Teaching Neural Networks a Sense of Geometry

by Jens Agerberg

ML Researcher, Data Scientist | KTH, Ericsson

14:30

Break

15:00

Navigating the Challenges of Offer Categorization in E-commerce: Strategies for Accurate and Scalable Classification

by Zeeshan Dar

Senior Machine Learning Engineer | Prisjakt

15:30

Clean Architecture: How to Structure Your ML Projects to Reduce Technical Debt

by Laszlo Sragner

Founder | Hypergolic

15:55

Training and Using a Large Language Model to Write Theatrical Performances

by Kajsa Norin

NLP Consultant | Kajsa Norin AB

16:20

Closing Remarks

by Josef Lindman Hörnlund

Board Member | GAIA

16:45

Should We Teach Machine Learning Systems?

by Jussi Karlgren

Docent in Language Technology | Helsinki University

09:45

Break

10:15

An Interdisciplinary Expert Pool for Natural Language Understanding

by Francisca Hoyer

Strategic Program Manager, NLU | AI Sweden

10:30

Panel Discussion on Generative AI

by Aron Lagerberg, Francisca Hoyer, Rebecca Oskarsson, Sylvie Saget

11:00

P-Tuning: A Parameter Efficient Tuning to Boost LLM Performance

by Zenodia Charpy

Senior Data Scientist | NVIDIA

11:30

Lunch Break

12:00

Vision-Based Activity Detection Inside the Vehicle

by Mattias Ulmestrand

Technical Developer | Smart Eye

13:30

Synthetic Data for Computer Vision Problems, When Real Annotation Is Too Expensive

by Abgeiba Isunza Navarro

Machine Learning Engineer | Modulai

14:00

Deep Learning for Self-Driving Cars

by Christoffer Petersson

AI Researcher | Zenseact

14:30

Break

15:00

Federated Machine Learning: A Scalable, Privacy-Preserving Approach Ready for Production Environments

by Salman Toor

CTO | Scaleout Systems

15:30

Using AI to Preserve Privacy in Healthcare

by Alma Lund, Marina Delatorre

Machine Learning Engineer, Data Scientist @ Mabel AI

15:55

Building a Data Science Program in Higher Vocational Education

by Kokchun Giang

Educator | IT-Högskolan

16:20

SPEAKERS

Maria Larsson

Machine Learning Engineer @ Sleep Cycle

Few-Shot Learning for Health: How Sleep Cycle Separate Multiple People’s Snoring

How do you distinguish snoring sounds from two or more people sharing a bedroom? And why is it important? With several million users choosing to share their data with Sleep Cycle, we are in a unique and privileged position to deepen our understanding of the impact snoring can have on our sleep quality and, by extension, our well-being. Our vast dataset has allowed us to train a snore embedding network that can differentiate between different people’s snoring, regardless of how many people or pets(!) are sharing a bedroom.

Snoring, often dismissed as a nuisance only to the person sharing the room, can be a sign of obstructive sleep apnea and is linked to a wide range of other negative health-related issues. Providing the user with a comprehensive sleep analysis and letting them know where the snoring is coming from is the first step for them to implement the proper measures for their sleep health and overall well-being.

Biography

Maria Larsson is a leading machine learning engineer in the Sleep Cycle R&D team that develops the market-leading sleep tracking solution worldwide, with millions of daily active users and over two billion nights analyzed in more than 150 countries. Sleep Cycle aims to improve global health by empowering people to sleep better. Apple recently recognized Maria for her outstanding contributions to the iOS app economy as part of their “Simply Outstanding Women” segment.

 

Zenodia Charpy

Senior Data Scientist @ NVIDIA

P-Tuning: A Parameter Efficient Tuning to Boost LLM Performance

As more LLMs become available, industries need techniques for solving real-world natural language tasks. It has been shown that model prompting methods can elicit good zero– and few-shot performance from LLMs and help yield quality results on various downstream natural language processing (NLP) tasks. However, there is a limit to it. In this talk, we will demonstrate how to adapt p-tuning, a prompt-learning method, to low-resource language settings. We use an improved version of p-tuning implemented in NVIDIA NeMo that enables the continuous multitask learning of virtual prompts. In particular, we focus on adapting our English p-tuning workflow to Swedish.

Biography

 

 

Salman Toor

CTO @ Scaleout Systems

Federated Machine Learning: A Scalable, Privacy-Preserving Approach Ready for Production Environments

Federated machine learning has created new possibilities for privacy-preserving data analysis. This is a thriving area of research that primarily focuses on the algorithmic details and communication overheads necessary to train accurate models. Despite significant progress in the field, production-grade federated machine learning frameworks that address essential properties such as security, data privacy, scalability, fault tolerance, and performance in geographically distributed settings have yet to be available to ML engineers.

This talk will highlight the core concepts and features necessary for developing federated learning platforms for production environments at Enterprises. Furthermore, it will briefly cover two active use cases demonstrating the potential of regulated datasets in geographically distributed settings.

Biography

Salman holds a Ph.D. in scientific computing and is an expert on federated machine learning (FedML), scientific data management, scalability, and distributed computing infrastructure performance (DCI). He is the Chief Technology Officer (CTO) at Scaleout Systems and an Associate Professor at Uppsala University, where he researches e-infrastructures.

 

Laszlo Sragner

Founder @ Hypergolic

Clean Architecture: How to Structure Your ML Projects to Reduce Technical Debt

Software engineering principles are frequently mentioned as a solution to data science’s productivity problem. Unfortunately, rarely in a comprehensive format to be actionable or adopted for data-intensive use. In this talk, I will present the Clear Architecture framework that enables practitioners to structure their projects and manage changes throughout their lifecycles. The audience will also learn about a minimum set of programming concepts to make this a reality. The key takeaway is that, as a data scientist, you can take care of your codebase with only a few techniques and a little effort.

Biography

Formerly he was Head of Data Science at Arkera, a fintech startup in London, where he built market intelligence products with Natural Language Processing for Tier 1 investment banks and hedge funds. Prior to that, Laszlo worked in mobile gaming for King Digital (makers of Candy Crush), specializing in player behavior and monetization. He started his career as a quant researcher implementing trading strategies at multiple investment managers.

Christian Berg

Founder @ Paliscope

Finding the Bad Guys with AI: Connecting the Dots In Data

Paliscope is a Swedish company developing software for analyzing large amounts of data like tables, images, text, audio, and video with analysis methods like object detection, scene classification, face recognition, natural language processing, and speech-to-text. Since everything is connected somehow, finding the links, whether they stem from chat logs, speech, or social media posts, is essential. AI is critical in connecting the dots as everyone otherwise would be overloaded with information. Paliscope products are primarily used by law enforcement and other government agencies in several countries. During the presentation, we will demonstrate a couple of use cases for how the software Paliscope YOSE is being used and focus on a couple of technologies used within the tool.

Biography

Christian previously co-founded NetClean and Griffeye, two global leaders in their respective fields, before founding Paliscope. He has a background from Chalmers School of Entrepreneurship.

 

Alma Lund

Machine Learning Engineer @ Mabel AI, Syntronic

Using AI to Preserve Privacy in Healthcare

Language barriers in health care are a major obstacle to national and global sustainable development goals for public health, health equality, and integration. Interpreters in healthcare is a non-regulated industry, with less than 1 % of the interpreters being certified in medical interpretation in Sweden. Often the interpreter knows the patient, making it difficult for the patient to have a private conversation with the doctor. In this talk, we will look at how modern NLP and speech technology can be used to create a secure and private interpretation system. We will go through our challenges in making such a system with the first version designed to help the Ukrainian refugees seeking healthcare across the EU. We will particularly look into dealing with noisy environments and biases in open-source datasets.

Biography

Alma Lund is a machine learning engineer at Mabel AI, where she brings the AI models closer to the people who need them the most. She has a background in engineering physics and computational sciences from Chalmers University of Technology and has a strong interest in creating frameworks for sustainable and inclusive AI.

Francisca Hoyer

Strategic Program Manager, NLU @ AI Sweden

An Interdisciplinary Expert Pool for Natural Language Understanding

The artifacts used as training data in Machine Learning have their own historical, social, and cultural contexts – the majority of them were not produced to be used as training data for machine learning to make future predictions. Instead, they embody, represent, and reinforce a broad range of ideas and values, power relations, stereotypes and prejudices, and systematic forms of discrimination situated in their own places in time and space. Large Language Models “learn” languages by reading large amounts of these human-produced texts. Thereby they also “learn” the ideas and values, power relations, stereotypes and prejudices, systematic forms of discrimination, etc., inscribed in the training data. Therefore, questions of data, representativeness and fairness, equality, and ethics must be central to the development of language technologies. In this presentation, AI Sweden presents a new form of cross-disciplinary collaboration that aims to tackle these issues in concrete use case scenarios: The interdisciplinary expert pool for Natural Language Understanding, a platform for knowledge exchange between humanities scholars, social scientists, civil society organizations, and AI development teams.

Biography

Francisca co-leads the Natural Language Understanding Initiative at AI Sweden, the Swedish national center for applied AI. Originally a trained historian, Francisca moved into the field of AI to leverage her experiences for responsible innovation and social good. She holds a PhD from Uppsala University. Her research interests include global and gender history and span questions such as power relations and representation in historical archives. She is passionate about pulling together innovation projects that engage stakeholders who have traditionally not been involved in developing new technologies and diversifying the AI pipeline with domain experts from the humanities.

Jussi Karlgren

Docent in Language Technology @ Helsinki University

Should We Teach Machine Learning Systems?

When we find that AI-powered systems behave unexpectedly or inappropriately, we often discuss how we can retrain and find or generate better or unbiased data. This is because we have a general hands-off perspective on learning: we expect a system to build a better model if the learning process is left to make sense of input data with minimal supervision and guidance.

This talk will discuss the place of feature engineering and instruction in machine learning to mitigate some of the observed challenges of providing users with high-quality and reliable output and contrast it with today’s approach.

Biography

Jussi Karlgren has worked for many years in industrial and academic research on language technology and information retrieval research, mainly at SICS, Gavagai, and, most recently, Spotify. His research interests are in the knowledge representation of language, understanding how language usage changes over time and new modes of communication and studying stylistic choice in linguistic expression.

 

Magnus Sahlgren

Head of Research, NLU @ AI Sweden

GPT-SW3: The First Large Generative Language Model for the Nordic Languages

This talk gives an overview of the process of building the first large generative language model for Swedish. We cover the motivation for building the model, as well as challenges and opportunities with data and compute. We also give examples of applications of the model and discuss future directions for building and deploying large language models for smaller languages.

Biography

Magnus Sahlgren, PhD, is Head of Research for Natural Language Understanding at AI Sweden. Sahlgren’s research lies at the intersection between computational linguistics, philosophy, and artificial intelligence. He is primarily known for his work on computational models of meaning.

 

Marina Delatorre

Data Scientist @ Mabel AI

Using AI to Preserve Privacy in Healthcare

Language barriers in health care are a major obstacle to national and global sustainable development goals for public health, health equality, and integration. Interpreters in healthcare is a non-regulated industry, with less than 1 % of the interpreters being certified in medical interpretation in Sweden. Often the interpreter knows the patient, making it difficult for the patient to have a private conversation with the doctor. In this talk, we will look at how modern NLP and speech technology can be used to create a secure and private interpretation system. We will go through our challenges in making such a system with the first version designed to help the Ukrainian refugees seeking healthcare across the EU. We will particularly look into dealing with noisy environments and biases in open-source datasets.

Biography

Marina Delatorre has a background as a psychologist and a PhD in Psychology. She currently works as a data scientist at Mabel AI. She uses her expertise in psychology to understand the nuances of communication in healthcare settings and ensure that speech recognition and translation models can capture the diversity of environments and people who use them.

Zeeshan Dar

Senior Machine Learning Engineer @ Prisjakt

Navigating the Challenges of Offer Categorization in E-commerce: Strategies for Accurate and Scalable Classification

The accurate categorization of offers is a prominent challenge in the ecommerce industry. Effective offer categorization is critical to ensuring that products are displayed to the right customers and can provide valuable insights into customer behavior and preferences. However, categorizing offers at scale can be especially challenging when dealing with large product catalogs. To address this, it is important to use a robust system that can accurately categorize offers based on factors such as product attributes, price, and to leverage automation tools such as machine learning algorithms where possible.

Offer categorization can also help with search relevance, sorting offers to products, and facilitating easier indexing by search engines such as Google. Mapping offers to Google’s taxonomy, which comprises of nearly 6000 hierarchical classes, can quickly become challenging when working at scale. To ensure consistent categorization across different systems and platforms, it is important to establish clear taxonomy standards and to continually review and adjust the categorization system to ensure that it remains accurate and effective.

In this talk, we will discuss the challenges encountered in categorizing offers, the strategies we employed to address scaling and categorizing concerns, as well as the issues and limitations we overcame to achieve success.

Biography

Zeeshan discovered his passion for machine learning during his Bachelor’s studies. After founding his own company, he gained further knowledge and experience by working as a Data Scientist at a consultant company for three years. He pursued a Master’s degree at Chalmers University in Sweden, with a focus on machine learning, and subsequently worked as a data science consultant for a prominent autonomous vehicle company for nearly three years. Currently, he works as a senior machine learning engineer at Prisjakt, where he brings extensive expertise to develop innovative and effective products.

 

Mattias Ulmestrand

Technical Developer @ Smart Eye

Vision-Based Activity Detection Inside the Vehicle

As distracted drivers cause many road fatalities, activity detection is crucial to automotive in-cabin sensing. In today’s cars, cameras are used to monitor the driver’s gaze. But while gaze aversion is a critical factor of driver distraction, studies show that the risk increases based not only on what you look at but how you engage.

Robust activity detection in a car requires the fusion of a multitude of signals that each have their unique strength. For example, body movement over time can show that a person is engaged in a secondary, non-driving-related task. To further understand the scene, this will have to be combined with an appearance-based algorithm, such as a deep convolutional neural network that detects interaction with objects. On top of this whole cabin view, the well-established driver monitoring signals can further strengthen insight into occupant activity, such as detecting speaking by analyzing lip movement or gaze direction, which can indicate talking or texting on the phone.

This talk will give insight into the steps in developing such an activity detection system – from data collection and annotation to integration to a System on Chip. Examples will be presented in addition to some useful tips and tricks when developing various detection algorithms.

Biography

Mattias works as a technical developer at Smart Eye, developing deep learning models for computer vision on embedded environments. Prior to working at Smart Eye, Mattias studied Engineering Physics with a master’s in Complex Adaptive Systems at Chalmers University of Technology and ETH Zürich. In his spare time, Mattias enjoys working on personal projects in Machine Learning or related subjects, as well as playing and composing music.

 

Matteo Molteni

Data Engineer @ Kognic

Leveraging Open-Source Tools for Building a Quality Data Warehouse

In recent years the modern data stack has seen a great increase in the number of tools available on the market, making the landscape overwhelming and difficult to navigate. In this presentation, we will give an overview of how Kognic built a data warehouse with quality as its core value. We will present our architecture and explain how we leverage some popular open-source tools in our setup, highlighting their main features and shortcomings. Particular emphasis will be put on how data is transformed for the data mart using a test-driven approach that aims to stay true to software-development best practices.

Biography

Matteo is a Data Engineer at Kognic, where, together with the Data Enablement team he provides infrastructures and services for the company’s analytical needs. With a background in Computational Mathematics, Matteo spent numerous years working in the realm of both Data Science and Data Engineering. In 2022, he made a complete transition to Data Engineering by joining Kognic.

 

Kokchun Giang

Educator @ IT-Högskolan

Building a Data Science Program in Higher Vocational Education

As the industry is becoming more data-driven, hiring roles with more data-related skill sets, such as data scientists, data engineers, and data analysts, is necessary. To meet this need in the industry, we have built a two-year data science program in higher vocational education together with industry experts. The program contains courses in programming, data processing, mathematics, statistics, machine learning, deep learning, data engineering, and more. Besides these theoretical courses, there are two internship courses where students apply their skills to real data science problems in different companies.

The program is also built using a modern approach to education that follows the trend in society. For example, we have created a Discord community for each cohort, alum, and industry partner. In this way, the students can share experiences and knowledge and help each other, becoming each other’s learning resources. The students also use GitHub throughout their education, building portfolios to share with future employers.

Biography

Kokchun is a data scientist and teacher who strives to inspire people to pursue the beauty of programming and mathematics. He is constantly sharpening his technological and pedagogical skills to do this successfully. His pedagogical strategy is based on a combination of structure from special pedagogy and clear visualizations for storytelling. Currently, he works at IT-Högskolan, where he built the data science program with industry partners.

 

Mats Kvarnström

Head of AI @ Paliscope

Finding the Bad Guys with AI: Connecting the Dots In Data

Paliscope is a Swedish company developing software for analyzing large amounts of data like tables, images, text, audio, and video with analysis methods like object detection, scene classification, face recognition, natural language processing, and speech-to-text. Since everything is connected somehow, finding the links, whether they stem from chat logs, speech, or social media posts, is essential. AI is critical in connecting the dots as everyone otherwise would be overloaded with information. Paliscope products are primarily used by law enforcement and other government agencies in several countries. During the presentation, we will demonstrate a couple of use cases for how the software Paliscope YOSE is being used and focus on a couple of technologies used within the tool.

Biography

Mats Kvarnström is Head of AI at Paliscope. He has a PhD in Mathematical Statistics from Chalmers and has worked with data analysis and machine learning in many different application areas before starting at Paliscope, from image analysis in cell biology to biostatistics in the pharmaceutical industry and probe-sourced maps for autonomous driving.

Kajsa Andreasson

Software Engineer @ Recorded Future

Cross-modal Transfer Between Vision and Language for Protest Detection

Multimodal data (data with two or more modalities like text, images, or audio) has gained more and more attention in the last year. One example is the image-generating model DALL·E 2, released by OpenAI, which uses the modalities of images and text. Even though multimodal models have proven to have great potential, most of today’s systems for socio-political event detection are text-based.

In this presentation, we discuss a proposed approach of using the increasing amount of multimodal data to decrease the need for annotation – as presented in our paper “Cross-modal Transfer Between Vision and Language for Protest Detection.” We propose a method that utilizes existing annotated unimodal data to perform zero-shot event detection in another data modality. Specifically, we focus on protest detection in text and images and show that a pretrained vision-and-language alignment model (CLIP) can be leveraged to this end. In particular, our results suggest that annotated protest text data can act supplementarily to detect protests in images, but significant transfer is also demonstrated in the opposite direction.

Biography

Kajsa is a software engineer in the Text Analytics team at Recorded Future with a big passion for natural language processing, artificial intelligence techniques, and how they can be used to make our world a better and safer place. She recently graduated from the master’s programme Complex Adaptive Systems at Chalmers, and on a day-to-day basis, she loves implementing state-of-the-art methods most efficiently.

 

 

Anders Hildeman

Senior Data Scientist @ AstraZeneca

Unlocking Hearts by Locking Models: Solving Multiple Echocardiogram Problems Using a Holistic Self-Supervised Approach

Supervised training of deep learning models for medical imaging applications requires a significant amount of labeled data, which can be challenging to acquire as it necessitates the expertise of highly skilled medical professionals to annotate a large number of image sequences. To overcome this limitation, we propose the Adaptive Locked Agnostic Network (ALAN) concept for label-free, self-supervised learning of visual features. The advantage of this methodology is that the self-supervised training can be performed once on a large unlabeled dataset, and the features discovered can then be used on a plethora of subsequent downstream tasks. Since these self-engineered features have rich informational content, downstream tasks can be solved with more straightforward and less data-hungry methods than in fully-supervised approaches. In this paper, we perform two typical downstream tasks for echocardiogram data: segmentation of anatomical regions and classification of the echocardiogram views. We evaluate the proposed self-supervised approach on three publicly available echocardiogram datasets: EchoNet-Dynamic, CAMUS, and TMED-2.

Biography

Anders holds a PhD in Spatial Statistics, an MS in engineering mathematics, and a background in oceanography and remote sensing. He has gained extensive experience with the predictive modeling of digital images and geospatial data. In his professional role, he is working with general data science problems within pharma and research in the field of medical image analysis, mainly focusing on not focusing at all, i.e., exploring as many methods and datatypes as he can.

 

 

Abgeiba Isunza Navarro

Machine Learning Engineer @ Modulai

Synthetic Data for Computer Vision Problems, When Real Annotation Is Too Expensive

Deep learning has been successfully used for various computer vision problems. However, models often need large amounts of data to solve a specific task effectively. Unfortunately, there is often a bottleneck to accessing labeled data. With the advancement in graphic tools, simulator engines, and generative models, synthetic data generation has become a suitable alternative to approach the lack of data in specific domains.

We will talk about the current development in synthetic data generation and how we have used it to build good models for computer vision tasks based primarily on synthetic data. We will go through the different use cases, explain the tools used and give an insight into how we approach a computer vision problem when we lack a large amount of labeled data.

Biography

Abgeiba is a machine learning (ML) engineer at Modulai, an ML consultancy firm in Sweden. As part of her role as an ML engineer, she has worked on various projects applying and developing AI products for different industries. Prior to joining Modulai, she worked on ML projects at Ericsson and BBVA banking. Abgeiba holds an M.Sc. in Machine Learning from KTH, Sweden, and a B.Sc. in Electronics and Telecommunications from Tecnológico de Monterrey, Mexico.

Ria Raj

Software Engineer @ Recorded Future

Cross-modal Transfer Between Vision and Language for Protest Detection

Multimodal data (data with two or more modalities like text, images, or audio) has gained more and more attention in the last year. One example is the image-generating model DALL·E 2, released by OpenAI, which uses the modalities of images and text. Even though multimodal models have proven to have great potential, most of today’s systems for socio-political event detection are text-based.

In this presentation, we discuss a proposed approach of using the increasing amount of multimodal data to decrease the need for annotation – as presented in our paper “Cross-modal Transfer Between Vision and Language for Protest Detection.” We propose a method that utilizes existing annotated unimodal data to perform zero-shot event detection in another data modality. Specifically, we focus on protest detection in text and images and show that a pretrained vision-and-language alignment model (CLIP) can be leveraged to this end. In particular, our results suggest that annotated protest text data can act supplementarily to detect protests in images, but significant transfer is also demonstrated in the opposite direction.

Biography

Ria has a background in automation, mechatronics, and machine learning. She works as a software engineer in the Threat Intelligence team at Recorded Future. Her interest in AI came from learning about natural language processing, and today she calls herself an enthusiast, both professionally and personally. She hopes to be a part of using ML and AI to make our world a safer place!

 

 

Christoffer Petersson

AI Researcher @ Zenseact

Deep Learning for Self-Driving Cars

The mission of Zenseact is to develop a world-leading autonomous driving software platform for consumer vehicles, with the primary goal of dramatically reducing the number of traffic fatalities and injuries around the world. I will discuss how we use deep learning to reach this goal, some of the key challenges we see, and how we plan to expand the use of learning-based algorithms. I will also review some of our recent and ongoing deep learning-related research activities.

Biography

On the research side, Christoffer Petersson leads and supervises a number of deep learning-related research activities at Zenseact and is Adjunct Associate Professor in Machine Learning at Chalmers. On the product side, he works towards expanding the use of deep learning in the Zenseact software stack and automating the training data generation. Christoffer did a PhD in Theoretical Physics at Chalmers, and after research positions in Physics at CERN, in Madrid and Brussels, and returning to Chalmers as Associate Professor in Physics, he switched gears and joined Zenuity (from which Zenseact originates) in 2017 and has since then worked on deep learning and computer vision in both product development and research.

Jens Agerberg

ML Researcher, Data Scientist @ KTH, Ericsson

Teaching Neural Networks a Sense of Geometry

By taking neural networks back to the school bench and teaching them some elements of geometry and topology, we can build algorithms that can reason about the shape of data. Surprisingly these methods can be useful not only for computer vision but in a wide range of applications, such as evaluating and improving the learning of embeddings or the distribution of samples originating from generative models. This is the promise of the emerging field of Topological Data Analysis (TDA) which we will introduce and review recent works at its intersection with machine learning.

Biography

While working as a data scientist at Ericsson, Jens is also pursuing a PhD in machine learning at KTH within the WASP program. He believes that teaching computers a sense of geometrical recognition and reasoning is a promising direction if we want to develop more powerful AIs. One way to do so is to expand the mathematical toolkit underlying machine learning to include math from less well-known fields such as computational geometry and topology. Jens will discuss some of the results and challenges in doing so.

 

Claes Strannegård

Associate Professor @ Department of Applied IT, University of Gothenburg

Modelling Ecosystems with Deep Reinforcement Learning

I will present the ecosystem simulator Ecotwin (www.ecotwin.se), which uses three-dimensional terrain models based on real geographical data together with individual animal models controlled by deep reinforcement learning. Moreover, I will illustrate how this simulator can be used for studying pristine ecosystems as well as ecosystems affected by human economic activities such as exploitation of natural resources, land use change, pollution, and climate change. The main benefit of the simulator is that it can be used as an analytical tool at the planning stage for understanding local ecological consequences of a range of economic activities, thus ensuring relatively eco-friendly decision-making.

Biography

Claes Strannegård is an associate professor at the department of Applied IT at the University of Gothenburg.

 

Kajsa Norin

NLP Consultant @ Kajsa Norin AB

Training and Using a Large Language Model to Write Theatrical Performances

Will all writers have their own AI in the future? This project explores the possibility of training a large language model on a writer’s previous material in order to create an AI that acts as a creative assistant. In recent years, there has been a growing interest in using artificial intelligence to generate creative content. One such application is the use of large language models (LLMs) to write theatrical performances.

In this talk, we will review the technical aspects of training and using an LLM to generate scripts for theatrical performances. We will discuss the challenges of working with natural language data, the techniques used to preprocess and clean the data, and the methods used to train and fine-tune the LLM as well as deploying it. A performance written by the model and writer in collaboration will premiere in Stockholm at Dansens Hus in November 2023.

Biography

Kajsa has been working in the data science field for the past eight years, focusing on NLP and language technology. Lately, she has dived deeper into similarity search and generative AI models. As a machine learning consultant, Kajsa now helps her clients build intelligent solutions and utilize data to enhance their products.

CONTACT

Program and speakers

Are you wondering about the schedule or any of our sessions? Send an email to program@gaia.fish.

Questions or comments

Do you have any other questions or comments to us, send an email to conf@gaia.fish and we promise to help you!

Partnerships

Want to support the conference by becoming a partner? Reach out to us on partnership@gaia.fish for more information.

© 2023 GAIA

conf@gaia.fish