Difference between revisions of "Community/training"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Participants)
(Programme)
 
(109 intermediate revisions by 3 users not shown)
Line 1: Line 1:
'''HPLT & NLPL Winter School on Large-Scale Language Modeling and Neural Machine Translation with Web Data'''
+
'''HPLT & NLPL Winter School on Large Language Models: Creation, Customization, Evaluation, and Use'''
  
[[File:skeikampen.2020.png|center]]
+
[[File:Skeikampen.2023.jpg|center]]
  
 
= Background =
 
= Background =
  
After a two-year pandemic hiatus, the NLPL network and Horizon Europe
+
Since 2023, the NLPL network and Horizon Europe
project ''High-Performance Language Technologies'' (HPLT) join forces
+
project ''[https://hplt-project.org High-Performance Language Technologies]'' (HPLT)
to re-launch the successful winter school series on large-scale NLP.
+
have joined forces to organize the successful winter school series on Web-scale NLP.
 
The winter school seeks to stimulate ''community formation'',
 
The winter school seeks to stimulate ''community formation'',
i.e. strengthening interaction and collaboration among Nordic and
+
i.e. strengthening interaction and collaboration among
 
European research teams in NLP and advancing a shared level of knowledge
 
European research teams in NLP and advancing a shared level of knowledge
 
and experience in using high-performance e-infrastructures for large-scale
 
and experience in using high-performance e-infrastructures for large-scale
 
NLP research.
 
NLP research.
The 2023 edition of the winter school puts special emphasis on
+
The 2024 edition of the winter school puts special emphasis on
 
NLP researchers from countries who participate in the EuroHPC
 
NLP researchers from countries who participate in the EuroHPC
 
[https://www.lumi-supercomputer.eu/lumi-consortium/ LUMI consortium].
 
[https://www.lumi-supercomputer.eu/lumi-consortium/ LUMI consortium].
 
For additional background, please see the archival pages from the
 
For additional background, please see the archival pages from the
 
[http://wiki.nlpl.eu/index.php/Community/training/2018 2018],
 
[http://wiki.nlpl.eu/index.php/Community/training/2018 2018],
[http://wiki.nlpl.eu/index.php/Community/training/2019 2019], and
+
[http://wiki.nlpl.eu/index.php/Community/training/2019 2019],
[http://wiki.nlpl.eu/index.php/Community/training/2020 2020]
+
[http://wiki.nlpl.eu/index.php/Community/training/2020 2020], and
 +
[http://wiki.nlpl.eu/index.php/Community/training/2023 2023]
 
NLPL Winter Schools.
 
NLPL Winter Schools.
  
For early 2023, HPLT will hold its winter school from Monday, February 6, to
+
For early 2024, HPLT will hold its winter school from Sunday, February 4, to
Wednesday, February 8, 2023, at a
+
Tuesday, February 6, 2024, at a
 
[https://www.thonhotels.com/our-hotels/norway/skeikampen/ mountain-side hotel]
 
[https://www.thonhotels.com/our-hotels/norway/skeikampen/ mountain-side hotel]
 
(with skiing and walking opportunities) about two hours north of Oslo.
 
(with skiing and walking opportunities) about two hours north of Oslo.
 
The project will organize group bus transfer from and to the Oslo
 
The project will organize group bus transfer from and to the Oslo
airport ''Gardermoen'', leaving the airport at 9:30 on Monday morning
+
airport ''Gardermoen'', leaving the airport at 9:45 on Sunday morning
and returning there around 17:30 on Wednesday afternoon.
+
and returning there around 17:30 on Tuesday afternoon.
  
 
The winter school is subsidized by the HPLT project: there is no fee for
 
The winter school is subsidized by the HPLT project: there is no fee for
Line 35: Line 36:
 
All participants will have to cover their own travel and accomodation
 
All participants will have to cover their own travel and accomodation
 
at Skeikampen, however.
 
at Skeikampen, however.
Two nights at the hotel, including all meals, will come to NOK 3190 (NOK 2790 per person in a shared double room),  
+
Two nights at the hotel, including all meals, will come to NOK 3745 (NOK 3345 per person in a shared double room),  
 
to be paid to the hotel directly.
 
to be paid to the hotel directly.
  
 
= Programme =
 
= Programme =
  
The 2023 winter school will have a thematic focus on ''Large-Scale Language Modeling and Neural Machine Translation with Web Data''.
+
The 2024 winter school will have a thematic focus on ''Large Language Models: Creation, Customization, Evaluation, and Use''.
 
The programme will be comprised of in-depth technical presentations (possibly including some
 
The programme will be comprised of in-depth technical presentations (possibly including some
hands-on elements) from, among others, the
+
hands-on elements) by seasoned experts, with special emphasis on open science and European languages,  
[https://bigscience.huggingface.co BigScience] and [https://commoncrawl.org Common Crawl] initiatives,
+
but also include critical reflections on current development trends in LLM-focussed NLP.
but also include critical reflections on working with massive, uncurated language data.
+
The programme will be complemented with a panel discussion and a ‘walk-through’ of available
The programme may be complemented with an evening ‘research bazar’ (by participants) to stimulate academic socializing and a ‘walk-through’ of available infrastructure on the shared EuroHPC LUMI supercomputer.
+
infrastructure on the shared EuroHPC LUMI supercomputer.
  
 
Confirmed presenters include:
 
Confirmed presenters include:
  
* Mehdi Ali, Fraunhofer IAIS
+
* [http://afra.alishahi.name Afra Alishahi, Tilburg University, The Netherlands]
* [https://faculty.washington.edu/ebender/ Emily M. Bender, University of Washington]
+
* [https://di.ku.dk/english/staff/vip/?pure=en/persons/631668 Desmond Elliot, University of Copenhagen, Denmark]
* [https://www.cs.jhu.edu/~phi/ Philipp Koehn, Johns Hopkins University]
+
* [https://muennighoff.github.io/ Niklas Muennighoff, Contextual AI]
* [https://huggingface.co/teven Teven Le Scao, Hugging Face]
+
* [https://perso.limsi.fr/neveol/bio.html Aurélie Névéol, Interdisciplinary Laboratory of Numerical Sciences, France]
* [https://nljubesi.github.io Nikola Ljubešić, Jožef Stefan Institute & University of Ljubljana]
 
* [https://commoncrawl.org/about/team/#headshot-14714 Sebastian Nagel, Common Crawl]
 
* [https://annargrs.github.io Anna Rogers, University of Copenhagen]
 
* [https://portizs.eu/#about Pedro Ortiz Suarez, University of Mannheim and DFKI]
 
* Zeerak Talat, Simon Fraser University
 
* [https://sites.google.com/site/ivanvulic/ Ivan Vulić, Cambridge University]
 
  
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
!colspan=3|Monday, February 6, 2023
+
!colspan=3|Sunday, February 4, 2024
 
|-
 
|-
 
| 13:00 || 14:00 || Lunch
 
| 13:00 || 14:00 || Lunch
 
|-
 
|-
| 14:00 || 15:30 || '''Session 1'''
+
| 14:00 || 15:30 || '''Session 1''': [http://svn.nlpl.eu/outreach/skeikampen/2024/alishahi.pdf Analyzing and Interpreting Deep Neural Models of Language] ([http://afra.alishahi.name Afra Alishahi])
 
|-
 
|-
 
| 15:30 || 15:50 || Coffee Break
 
| 15:30 || 15:50 || Coffee Break
 
|-
 
|-
| 15:50 || 17:20 || '''Session 2'''
+
| 16:00 || 17:30 || '''Session 2''': [http://svn.nlpl.eu/outreach/skeikampen/2024/alishahi.pdf Analyzing and Interpreting Deep Neural Models of Language] ([http://afra.alishahi.name Afra Alishahi])
 
|-
 
|-
| 17:20 || 17:40 || Coffee Break
+
| 17:30 || 17:50 || Coffee Break
 
|-
 
|-
| 17:40 || 19:10 || '''Session 3'''
+
| 17:50 || 19:20 || '''Session 3''': [http://svn.nlpl.eu/outreach/skeikampen/2024/muennighoff.pdf Scaling Data-constrained Language Models] ([https://muennighoff.github.io/ Niklas Muennighoff])
 +
 
 +
[https://docs.google.com/presentation/d/1WQDr_2sWkeBzAqNN521QJLyklM40fmv6KY_7JDhAntg/edit Slides]
 
|-
 
|-
 
| 19:30 ||  || Dinner
 
| 19:30 ||  || Dinner
|-
 
| 21:00 || || '''Evening Session 1'''
 
 
|}
 
|}
  
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
!colspan=3|Tuesday, February 7, 2022
+
!colspan=3|Monday, February 5, 2024
 
|-
 
|-
 
|colspan=3 | Breakfast is available from 07:30
 
|colspan=3 | Breakfast is available from 07:30
 
|-
 
|-
| 08:30 || 10:00 || '''Session 4'''
+
| 09:00 || 10:30 || '''Session 4''': [http://svn.nlpl.eu/outreach/skeikampen/2024/névéol1.pdf Bias in Natural Language Processing: focus on large language models] ([https://perso.limsi.fr/neveol/bio.html Aurélie Névéol])
 
|-
 
|-
|colspan=3| Lunch is available between 13:00 and 14:30
+
|colspan=3| Free time (Lunch is available between 13:00 and 14:30)
 
|-
 
|-
| 15:00 || 16:20 || '''Session 5'''
+
| 15:00 || 16:30 || '''Session 5''': [http://svn.nlpl.eu/outreach/skeikampen/2024/elliot.pdf Multilingual and multimodal language models] ([https://di.ku.dk/english/staff/vip/?pure=en/persons/631668 Desmond Elliot])
 
|-
 
|-
| 16:20 || 16:40 || Coffee Break
+
| 16:30 || 16:50 || Coffee Break
 
|-
 
|-
| 16:40 || 18:00 || '''Session 6'''
+
| 16:50 || 17:40 || '''Session 6''': [http://svn.nlpl.eu/outreach/skeikampen/2024/elliot.pdf Multilingual and multimodal language models] ([https://di.ku.dk/english/staff/vip/?pure=en/persons/631668 Desmond Elliot])
 
|-
 
|-
| 18:00 || 18:10 || Coffee Break
+
| 17:40 || 18:00 || Coffee Break
 
|-
 
|-
| 18:10 || 19:30 || '''Session 7'''
+
| 18:00 || 19:15 || '''Session 7'''. «Large vs. Small»: panel discussion. Panelists: Desmond Elliott (University of Copenhagen), Evangelia Gogoulou (RISE, Sweden), Afra Alishahi (Tilburg University), Jan Hajič (Charles University in Prague), and Aurélie Névéol (LISN, France)
 
|-
 
|-
 
| 19:30 ||  || Dinner
 
| 19:30 ||  || Dinner
 
|-
 
|-
| 21:00 || || '''Evening Session 2'''
+
| 21:00 || || '''Evening Session'''. [http://svn.nlpl.eu/outreach/skeikampen/2024/lumi.pdf LUMI: BERT in an Hour, GPT in a Week] ([https://www.mn.uio.no/ifi/english/people/aca/davisamu/ David Samuel] and [https://www.utu.fi/en/people/risto-luukkonen Risto Luukkonen])
 
|}
 
|}
  
Line 109: Line 104:
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
!colspan=3|Wednesday, February 8, 2020
+
!colspan=3|Tuesday, February 6, 2024
 
|-
 
|-
 
|colspan=3| Breakfast is available from 07:30
 
|colspan=3| Breakfast is available from 07:30
 
|-
 
|-
| 08:30 || 10:00 || '''Session 8'''
+
| 08:30 || 10:00 || '''Session 8''': [http://svn.nlpl.eu/outreach/skeikampen/2024/névéol2.pdf Reproducibility in Natural Language Processing] ([https://perso.limsi.fr/neveol/bio.html Aurélie Névéol])
 
|-
 
|-
 
| 10:00 || 10:30 || Coffee Break
 
| 10:00 || 10:30 || Coffee Break
 
|-
 
|-
| 10:30 || 12:00 || '''Session 9'''
+
| 10:30 || 12:00 || '''Session 9''': [http://svn.nlpl.eu/outreach/skeikampen/2024/névéol3.pdf Understanding and measuring the environmental impact of Natural Language Processing] ([https://perso.limsi.fr/neveol/bio.html Aurélie Névéol])
 
|-
 
|-
 
| 12:30 || 13:30 || Lunch
 
| 12:30 || 13:30 || Lunch
 +
|-
 +
| 14:00 || 17:00 || Bus transfer to OSL Airport
 
|}
 
|}
  
 
= Registration =
 
= Registration =
  
Registration is now closed.  The 2023 winter school was heavily over-subscribed.
+
In total, we anticipate around 55 participants at the 2024 winter school.
 
+
We have received more requests for participation than we will be able to accommodate,
In total, we anticipate up to 60 participants in the 2023 Winter School.
+
and the registration form has now been closed.
Please register your intent of participation through our
+
We processed requests for participation on a first-come, first-served basis, with an eye toward regional balance.
[https://nettskjema.no/a/300790 on-line registration form].
+
Interested parties who have submitted the registration form were confirmed in three batches, on December 11, on December 15,
We will process requests for participation on a first-come, first-served
+
and on December 22, which was also the closing date for winter school registration.
basis, with an eye toward regional balance.
 
Interested parties who have submitted the registration form will be confirmed
 
in three batches, one on December 5, another one on December 12, and finally
 
after the closing date for registration, which is Thursday, December 15, 2022.
 
  
Once confirmed by the organizing team, participant names will be published
+
Once confirmed by the organizing team, participant names are published
on this page, and registration will establish a
+
on this page, and registration establishes a
 
''binding agreement'' with the hotel.
 
''binding agreement'' with the hotel.
 
Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute
 
Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute
Line 146: Line 139:
 
With a few exceptions, winter school participants travel to and from the conference hotel
 
With a few exceptions, winter school participants travel to and from the conference hotel
 
jointly on a chartered bus (the HPLT shuttle).
 
jointly on a chartered bus (the HPLT shuttle).
The bus will leave OSL airport no later than 9:30 CET on Monday, February 6.
+
The bus will leave OSL airport no later than 9:45 CET on Sunday, February 4.
Thus, please meet up at 9:15 and make your arrival known to your assigned
+
Thus, please meet up by 9:30 and make your arrival known to your assigned
 
‘tour guide’ (who will introduce themselves to you by email beforehand).
 
‘tour guide’ (who will introduce themselves to you by email beforehand).
  
The group will gather near the bus and taxi information booth in the downstairs
+
The group will gather near the DNB currency exchange booth in the downstairs
 
arrivals area, just outside the international arrivals luggage claims and slightly
 
arrivals area, just outside the international arrivals luggage claims and slightly
to the right, as one exits the customs area:
+
to the left as one exits the customs area:
The yellow dot numbered (17) on the
+
the yellow dot numbered (18) on the
 
[https://avinor.no/globalassets/_oslo-lufthavn/ankomst-arrivals.pdf OSL arrivals map].
 
[https://avinor.no/globalassets/_oslo-lufthavn/ankomst-arrivals.pdf OSL arrivals map].
The group will then walk over to the bus terminal, to leave the airport by 9:30.
+
The group will then walk over to the bus terminal, to leave the airport not long after 9:40.
 
The drive to the Skeikampen conference hotel will take us about three hours, and the bus
 
The drive to the Skeikampen conference hotel will take us about three hours, and the bus
 
will make one stop along the way to stretch our legs and fill up on coffee.
 
will make one stop along the way to stretch our legs and fill up on coffee.
  
The winter school will end with lunch on Wednesday, February 8, before the group returns
+
The winter school will end with lunch on Tuesday, February 6, before the group returns
 
to OSL airport on the HPLT shuttle.
 
to OSL airport on the HPLT shuttle.
 
The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL
 
The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL
around 17:00 to 17:30 CET.
+
around 17:00 to 17:30 CET. After stopping at the OSL airport, the bus will continue to central Oslo.
  
 
= Organization =
 
= Organization =
  
The 2023 Winter School is organized by a team of volunteers from the NLPL and HPLT networks,
+
The 2024 Winter School is organized by a team of volunteers at the University
 +
of Oslo, supported by a programme committee from the HPLT and NLPL network and beyond,
 
please see below.
 
please see below.
 
For all inquiries regarding registration, the programme, logistics,
 
For all inquiries regarding registration, the programme, logistics,
 
or such, please contact <code>hplt-training@ifi.uio.no</code>.
 
or such, please contact <code>hplt-training@ifi.uio.no</code>.
  
The programme committee is comprised of (regrettably lacking in diversity)
+
The programme committee is comprised of:
  
* Hans Eide (Uninett Sigma2, Norway)
+
* Isabelle Augenstein (University of Copenhagen, Denmark)
* Filip Ginter (University of Turku, Finland)
+
* Emily M. Bemder (University of Washington, USA)
* Barry Haddow (University of Edinburgh, UK)
+
* Kenneth Heafield (Edinburgh University, UK)
* Jan Hajič (Charles University in Prague, Czech Republic)
+
* Jindřich Helcl (Charles University, Czech Republic)
* Daniel Hershcovich (University of Copenhagen, Denmark)
 
 
* Marco Kuhlmann (Linköping University, Sweden)
 
* Marco Kuhlmann (Linköping University, Sweden)
 +
* Per Egil Kummervold (National Library of Norway)
 
* Andrey Kutuzov (University of Oslo, Norway)
 
* Andrey Kutuzov (University of Oslo, Norway)
 
* Joakim Nivre (RISE and Uppsala University, Sweden)
 
* Joakim Nivre (RISE and Uppsala University, Sweden)
Line 184: Line 178:
 
* Sampo Pyysalo (University of Turku, Finland)
 
* Sampo Pyysalo (University of Turku, Finland)
 
* Gema Ramirez (Prompsit Language Engineering, Spain)
 
* Gema Ramirez (Prompsit Language Engineering, Spain)
 +
* Anna Rogers (IT University of Copenhagen, Denmark)
 
* Magnus Sahlgreen (AI Sweden)
 
* Magnus Sahlgreen (AI Sweden)
 
* David Samuel (University of Oslo, Norway)
 
* David Samuel (University of Oslo, Norway)
 
* Jörg Tiedemann (University of Helsinki, Finland)
 
* Jörg Tiedemann (University of Helsinki, Finland)
 +
* Erik Velldal (University of Oslo, Norway)
  
 
= Participants =
 
= Participants =
  
# Mehdi Ali (Fraunhofer IAIS)
+
# Afra Alishahi, Tilburg University (The Netherlands)
# Chantal Amrhein (University of Zurich)
+
# Ali Allaith, University of Copenhagen (Denmark)
# Nikolay Arefev (University of Oslo)
+
# Nikolay Arefev, University of Oslo (Norway)
# Mikko Aulamo (University of Helsinki)
+
# Joseph Attieh, University of Helsinki (Finland)
# Elisa Bassignana (IT University of Copenhagen)
+
# Christopher Brückner, Charles University in Prague (Czech Republic)
# Emily M. Bender (University of Washington)
+
# Lucas Charpentier, University of Oslo (Norway)
# Vladimír Benko (Slovak Academy of Sciences)
+
# Konstantin Dobler, Hasso Plattner Institute (Germany)
# Nikolay Bogoychev (Edinburgh University)
+
# Aleksei Dorkin, University of Tartu (Estonia)
# Dhairya Dalal (University of Galway)
+
# Luise Dürlich, Uppsala University (Sweden)
# Annerose Eichel (University of Stuttgart)
+
# Simen Eide, Schibsted (Norway)
# Kenneth Enevoldsen (Aarhus University)
+
# Desmond Elliott, University of Copenhagen (Denmark)
# Mehrdad Farahani (Chalmers University of Technology)
+
# Kenneth Enevoldsen, Aarhus University (Denmark)
# Ona de Gibert (University of Helsinki)
+
# Mariia Fedorova, University of Oslo (Norway)
# Janis Goldzycher (University of Zurich)
+
# Emilie Francis, Gothenburg University (Sweden)
# Jan Hajič (Charles University in Prague)
+
# Evangelia Gogoulou, RISE (Sweden)
# Jindřich Helcl (Charles University in Prague)
+
# Jan Hajič, Charles University in Prague (Czech Republic)
# Oskar Holmström (Linköping University)
+
# Lasse Hansen, Aarhus University Hospital (Denmark)
# Sami Itkonen (University of Helsinki)
+
# Jindřich Helcl, Charles University in Prague (Czech Republic)
# Shaoxiong Ji (University of Helsinki)
+
# Yiping Jin, Pompeu Fabra University (Spain)
# Antonia Karamolegkou (University of Copenhagen)
+
# Lars Johnsen, National Library (Norway)
# Marco Kuhlmann (Linköping University)
+
# Amanda Kann, Stockholm University (Sweden)
# Nina Khairova (Umeå universitet)
+
# Jan Kostkan, Aarhus University (Denmark)
# Philipp Koehn (Johns Hopkins University)
+
# Andrey Kutuzov, University of Oslo (Norway)
# Andrey Kutuzov (University of Oslo)
+
# Tsz Kin Lam, University of Edinburgh (UK)
# Jelmer van der Linde (Edinburgh University)
+
# Wenyan Li, University of Copenhagen (Denmark)
# Pierre Lison (Norsk regnesentral)
+
# Pierre Lison, Norsk Regnesentral
# Nikola Ljubešić (Jožef Stefan Institute & University of Ljubljana)
+
# Jouni Luoma, University of Turku (Finland)
# Yan Meng (University of Amsterdam)
+
# Risto Luukkonen, University of Turku (Finland)
# Max Müller-Eberstein (IT University of Copenhagen)
+
# Arianna Masciolini, Gothenburg University (Sweden)
# Sebastian Nagel (Common Crawl)
+
# Petter Mæhlum, University of Oslo (Norway)
# Graeme Nail (Edinburgh University)
+
# Vladislav Mikhailov, University of Oslo (Norway)
# Anna Nikiforovskaja (Université de Lorraine)
+
# Yousuf Ali Mohammed, Gothenburg University (Sweden)
# Irina Nikishina (Universität Hamburg)
+
# Aurélie Névéol, LISN & CNRS (France)
# Joakim Nivre (RISE and Uppsala University)
+
# Tobias Norlund, AI Sweden (Sweden)
# Stephan Oepen (University of Oslo)
+
# Stephan Oepen, University of Oslo (Norway)
# Anders Jess Pedersen (Alexandra Institute)
+
# Lilja Øvrelid, University of Oslo (Norway)
# Laura Cabello Piqueras (University of Copenhagen)
+
# Alberto Parola, University of Copenhagen (Denmark)
# Myrthe Reuver (Vrije Universiteit Amsterdam)
+
# Siddhesh Pawar, University of Copenhagen (Denmark)
# Anna Rogers (University of Copenhagen)
+
# Erofili Psaltaki, University of Helsinki (Finland)
# Frankie Robertson (University of Jyväskylä)
+
# Akseli Reunamo, University of Turku (Finland)
# Phillip Rust (University of Copenhagen)
+
# David Samuel, University of Oslo (Norway)
# Egil Rønnestad (University of Oslo)
+
# Ricardo Muñoz Sánchez, Gothenburg University (Sweden)
# David Samuel (University of Oslo)
+
# Gautam Kishore Shahi, University of Duisburg-Essen (Germany)
# Diana Santos (University of Oslo)
+
# Janine Siewert, University of Helsinki (Finland)
# Teven Le Scao (Hugging Face)
+
# Étienne Simon, University of Oslo (Norway)
# Yves Scherrer (University of Helsinki)
+
# Inguna Skadiņa, University of Latvia
# Edoardo Signoroni (Masaryk University)
+
# Ondrej Sotolar, Masaryk University (Czech Republic)
# Michal Štefánik (Masaryk University)
+
# Pavel Stranak, Charles University in Prague (Czech Republic)
# Pedro Ortiz Suarez (University of Mannheim and DFKI)
+
# Maria Irena Szawerna, Gothenburg University (Sweden)
# Zeerak Talat (Simon Fraser University)
+
# Jörg Tiedemann, University of Helsinki (Finland)
# Jörg Tiedemann (University of Helsinki)
+
# Ekaterina Uetova, Technological University Dublin (Ireland)
# Samia Touileb (University of Bergen)
+
# Erik Velldal, University of Oslo (Norway)
# Teemu Vahtola (University of Helsinki)
+
# Tea Vojtěchová, Charles University in Prague (Czech Republic)
# Thomas Vakili (Stockholm University)
+
# Jonas Waldendorf, University of Edinburgh (UK)
# Tea Vojtěchová (Charles University in Prague)
+
# Jaume Zaragoza-Bernabeu, Prompsit Language Engineering (Spain)
# Ivan Vulić (University of Cambridge)
+
# Giulio Zhou, University of Edinburgh (UK)
# Nicholas Walker (Norsk regnesentral)
 
# Sondre Wold (University of Oslo)
 
# Jaume Zaragoza-Bernabeu (Prompsit)
 
# Natalia Zawadzka-Paluektau (University of Warsaw)
 

Latest revision as of 17:46, 7 February 2024

HPLT & NLPL Winter School on Large Language Models: Creation, Customization, Evaluation, and Use

Skeikampen.2023.jpg

Background

Since 2023, the NLPL network and Horizon Europe project High-Performance Language Technologies (HPLT) have joined forces to organize the successful winter school series on Web-scale NLP. The winter school seeks to stimulate community formation, i.e. strengthening interaction and collaboration among European research teams in NLP and advancing a shared level of knowledge and experience in using high-performance e-infrastructures for large-scale NLP research. The 2024 edition of the winter school puts special emphasis on NLP researchers from countries who participate in the EuroHPC LUMI consortium. For additional background, please see the archival pages from the 2018, 2019, 2020, and 2023 NLPL Winter Schools.

For early 2024, HPLT will hold its winter school from Sunday, February 4, to Tuesday, February 6, 2024, at a mountain-side hotel (with skiing and walking opportunities) about two hours north of Oslo. The project will organize group bus transfer from and to the Oslo airport Gardermoen, leaving the airport at 9:45 on Sunday morning and returning there around 17:30 on Tuesday afternoon.

The winter school is subsidized by the HPLT project: there is no fee for participants and no charge for the bus transfer to and from the conference hotel. All participants will have to cover their own travel and accomodation at Skeikampen, however. Two nights at the hotel, including all meals, will come to NOK 3745 (NOK 3345 per person in a shared double room), to be paid to the hotel directly.

Programme

The 2024 winter school will have a thematic focus on Large Language Models: Creation, Customization, Evaluation, and Use. The programme will be comprised of in-depth technical presentations (possibly including some hands-on elements) by seasoned experts, with special emphasis on open science and European languages, but also include critical reflections on current development trends in LLM-focussed NLP. The programme will be complemented with a panel discussion and a ‘walk-through’ of available infrastructure on the shared EuroHPC LUMI supercomputer.

Confirmed presenters include:

Sunday, February 4, 2024
13:00 14:00 Lunch
14:00 15:30 Session 1: Analyzing and Interpreting Deep Neural Models of Language (Afra Alishahi)
15:30 15:50 Coffee Break
16:00 17:30 Session 2: Analyzing and Interpreting Deep Neural Models of Language (Afra Alishahi)
17:30 17:50 Coffee Break
17:50 19:20 Session 3: Scaling Data-constrained Language Models (Niklas Muennighoff)

Slides

19:30 Dinner
Monday, February 5, 2024
Breakfast is available from 07:30
09:00 10:30 Session 4: Bias in Natural Language Processing: focus on large language models (Aurélie Névéol)
Free time (Lunch is available between 13:00 and 14:30)
15:00 16:30 Session 5: Multilingual and multimodal language models (Desmond Elliot)
16:30 16:50 Coffee Break
16:50 17:40 Session 6: Multilingual and multimodal language models (Desmond Elliot)
17:40 18:00 Coffee Break
18:00 19:15 Session 7. «Large vs. Small»: panel discussion. Panelists: Desmond Elliott (University of Copenhagen), Evangelia Gogoulou (RISE, Sweden), Afra Alishahi (Tilburg University), Jan Hajič (Charles University in Prague), and Aurélie Névéol (LISN, France)
19:30 Dinner
21:00 Evening Session. LUMI: BERT in an Hour, GPT in a Week (David Samuel and Risto Luukkonen)


Tuesday, February 6, 2024
Breakfast is available from 07:30
08:30 10:00 Session 8: Reproducibility in Natural Language Processing (Aurélie Névéol)
10:00 10:30 Coffee Break
10:30 12:00 Session 9: Understanding and measuring the environmental impact of Natural Language Processing (Aurélie Névéol)
12:30 13:30 Lunch
14:00 17:00 Bus transfer to OSL Airport

Registration

In total, we anticipate around 55 participants at the 2024 winter school. We have received more requests for participation than we will be able to accommodate, and the registration form has now been closed. We processed requests for participation on a first-come, first-served basis, with an eye toward regional balance. Interested parties who have submitted the registration form were confirmed in three batches, on December 11, on December 15, and on December 22, which was also the closing date for winter school registration.

Once confirmed by the organizing team, participant names are published on this page, and registration establishes a binding agreement with the hotel. Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute spaces), and no-shows will be charged the full price for at least one night by the hotel.

Logistics

With a few exceptions, winter school participants travel to and from the conference hotel jointly on a chartered bus (the HPLT shuttle). The bus will leave OSL airport no later than 9:45 CET on Sunday, February 4. Thus, please meet up by 9:30 and make your arrival known to your assigned ‘tour guide’ (who will introduce themselves to you by email beforehand).

The group will gather near the DNB currency exchange booth in the downstairs arrivals area, just outside the international arrivals luggage claims and slightly to the left as one exits the customs area: the yellow dot numbered (18) on the OSL arrivals map. The group will then walk over to the bus terminal, to leave the airport not long after 9:40. The drive to the Skeikampen conference hotel will take us about three hours, and the bus will make one stop along the way to stretch our legs and fill up on coffee.

The winter school will end with lunch on Tuesday, February 6, before the group returns to OSL airport on the HPLT shuttle. The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL around 17:00 to 17:30 CET. After stopping at the OSL airport, the bus will continue to central Oslo.

Organization

The 2024 Winter School is organized by a team of volunteers at the University of Oslo, supported by a programme committee from the HPLT and NLPL network and beyond, please see below. For all inquiries regarding registration, the programme, logistics, or such, please contact hplt-training@ifi.uio.no.

The programme committee is comprised of:

  • Isabelle Augenstein (University of Copenhagen, Denmark)
  • Emily M. Bemder (University of Washington, USA)
  • Kenneth Heafield (Edinburgh University, UK)
  • Jindřich Helcl (Charles University, Czech Republic)
  • Marco Kuhlmann (Linköping University, Sweden)
  • Per Egil Kummervold (National Library of Norway)
  • Andrey Kutuzov (University of Oslo, Norway)
  • Joakim Nivre (RISE and Uppsala University, Sweden)
  • Stephan Oepen (University of Oslo, Norway)
  • Sampo Pyysalo (University of Turku, Finland)
  • Gema Ramirez (Prompsit Language Engineering, Spain)
  • Anna Rogers (IT University of Copenhagen, Denmark)
  • Magnus Sahlgreen (AI Sweden)
  • David Samuel (University of Oslo, Norway)
  • Jörg Tiedemann (University of Helsinki, Finland)
  • Erik Velldal (University of Oslo, Norway)

Participants

  1. Afra Alishahi, Tilburg University (The Netherlands)
  2. Ali Allaith, University of Copenhagen (Denmark)
  3. Nikolay Arefev, University of Oslo (Norway)
  4. Joseph Attieh, University of Helsinki (Finland)
  5. Christopher Brückner, Charles University in Prague (Czech Republic)
  6. Lucas Charpentier, University of Oslo (Norway)
  7. Konstantin Dobler, Hasso Plattner Institute (Germany)
  8. Aleksei Dorkin, University of Tartu (Estonia)
  9. Luise Dürlich, Uppsala University (Sweden)
  10. Simen Eide, Schibsted (Norway)
  11. Desmond Elliott, University of Copenhagen (Denmark)
  12. Kenneth Enevoldsen, Aarhus University (Denmark)
  13. Mariia Fedorova, University of Oslo (Norway)
  14. Emilie Francis, Gothenburg University (Sweden)
  15. Evangelia Gogoulou, RISE (Sweden)
  16. Jan Hajič, Charles University in Prague (Czech Republic)
  17. Lasse Hansen, Aarhus University Hospital (Denmark)
  18. Jindřich Helcl, Charles University in Prague (Czech Republic)
  19. Yiping Jin, Pompeu Fabra University (Spain)
  20. Lars Johnsen, National Library (Norway)
  21. Amanda Kann, Stockholm University (Sweden)
  22. Jan Kostkan, Aarhus University (Denmark)
  23. Andrey Kutuzov, University of Oslo (Norway)
  24. Tsz Kin Lam, University of Edinburgh (UK)
  25. Wenyan Li, University of Copenhagen (Denmark)
  26. Pierre Lison, Norsk Regnesentral
  27. Jouni Luoma, University of Turku (Finland)
  28. Risto Luukkonen, University of Turku (Finland)
  29. Arianna Masciolini, Gothenburg University (Sweden)
  30. Petter Mæhlum, University of Oslo (Norway)
  31. Vladislav Mikhailov, University of Oslo (Norway)
  32. Yousuf Ali Mohammed, Gothenburg University (Sweden)
  33. Aurélie Névéol, LISN & CNRS (France)
  34. Tobias Norlund, AI Sweden (Sweden)
  35. Stephan Oepen, University of Oslo (Norway)
  36. Lilja Øvrelid, University of Oslo (Norway)
  37. Alberto Parola, University of Copenhagen (Denmark)
  38. Siddhesh Pawar, University of Copenhagen (Denmark)
  39. Erofili Psaltaki, University of Helsinki (Finland)
  40. Akseli Reunamo, University of Turku (Finland)
  41. David Samuel, University of Oslo (Norway)
  42. Ricardo Muñoz Sánchez, Gothenburg University (Sweden)
  43. Gautam Kishore Shahi, University of Duisburg-Essen (Germany)
  44. Janine Siewert, University of Helsinki (Finland)
  45. Étienne Simon, University of Oslo (Norway)
  46. Inguna Skadiņa, University of Latvia
  47. Ondrej Sotolar, Masaryk University (Czech Republic)
  48. Pavel Stranak, Charles University in Prague (Czech Republic)
  49. Maria Irena Szawerna, Gothenburg University (Sweden)
  50. Jörg Tiedemann, University of Helsinki (Finland)
  51. Ekaterina Uetova, Technological University Dublin (Ireland)
  52. Erik Velldal, University of Oslo (Norway)
  53. Tea Vojtěchová, Charles University in Prague (Czech Republic)
  54. Jonas Waldendorf, University of Edinburgh (UK)
  55. Jaume Zaragoza-Bernabeu, Prompsit Language Engineering (Spain)
  56. Giulio Zhou, University of Edinburgh (UK)