Difference between revisions of "Community/training"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Programme)
(Programme)
 
(112 intermediate revisions by 3 users not shown)
Line 1: Line 1:
[[File:skeikampen.2020.png|center]]
+
'''HPLT & NLPL Winter School on Large Language Models: Creation, Customization, Evaluation, and Use'''
 +
 
 +
[[File:Skeikampen.2023.jpg|center]]
  
 
= Background =
 
= Background =
  
After a two-year pandemic hiatus, the NLPL network and Horizon Europe
+
Since 2023, the NLPL network and Horizon Europe
project ''High-Performance Language Technologies'' (HPLT) join forces
+
project ''[https://hplt-project.org High-Performance Language Technologies]'' (HPLT)
to re-launch the successful winter school series on large-scale NLP.
+
have joined forces to organize the successful winter school series on Web-scale NLP.
 
The winter school seeks to stimulate ''community formation'',
 
The winter school seeks to stimulate ''community formation'',
i.e. strengthening interaction and collaboration among Nordic and
+
i.e. strengthening interaction and collaboration among
 
European research teams in NLP and advancing a shared level of knowledge
 
European research teams in NLP and advancing a shared level of knowledge
 
and experience in using high-performance e-infrastructures for large-scale
 
and experience in using high-performance e-infrastructures for large-scale
 
NLP research.
 
NLP research.
The 2023 edition of the winter school puts special emphasis on
+
The 2024 edition of the winter school puts special emphasis on
 
NLP researchers from countries who participate in the EuroHPC
 
NLP researchers from countries who participate in the EuroHPC
 
[https://www.lumi-supercomputer.eu/lumi-consortium/ LUMI consortium].
 
[https://www.lumi-supercomputer.eu/lumi-consortium/ LUMI consortium].
 
For additional background, please see the archival pages from the
 
For additional background, please see the archival pages from the
 
[http://wiki.nlpl.eu/index.php/Community/training/2018 2018],
 
[http://wiki.nlpl.eu/index.php/Community/training/2018 2018],
[http://wiki.nlpl.eu/index.php/Community/training/2019 2019], and
+
[http://wiki.nlpl.eu/index.php/Community/training/2019 2019],
[http://wiki.nlpl.eu/index.php/Community/training/2020 2020]
+
[http://wiki.nlpl.eu/index.php/Community/training/2020 2020], and
 +
[http://wiki.nlpl.eu/index.php/Community/training/2023 2023]
 
NLPL Winter Schools.
 
NLPL Winter Schools.
  
For early 2023, HPLT will hold its winter school from Monday, February 6, to
+
For early 2024, HPLT will hold its winter school from Sunday, February 4, to
Wednesday, February 8, 2023, at a
+
Tuesday, February 6, 2024, at a
 
[https://www.thonhotels.com/our-hotels/norway/skeikampen/ mountain-side hotel]
 
[https://www.thonhotels.com/our-hotels/norway/skeikampen/ mountain-side hotel]
 
(with skiing and walking opportunities) about two hours north of Oslo.
 
(with skiing and walking opportunities) about two hours north of Oslo.
 
The project will organize group bus transfer from and to the Oslo
 
The project will organize group bus transfer from and to the Oslo
airport ''Gardermoen'', leaving the airport at 9:30 on Monday morning
+
airport ''Gardermoen'', leaving the airport at 9:45 on Sunday morning
and returning there around 17:30 on Wednesday afternoon.
+
and returning there around 17:30 on Tuesday afternoon.
  
 
The winter school is subsidized by the HPLT project: there is no fee for
 
The winter school is subsidized by the HPLT project: there is no fee for
Line 33: Line 36:
 
All participants will have to cover their own travel and accomodation
 
All participants will have to cover their own travel and accomodation
 
at Skeikampen, however.
 
at Skeikampen, however.
Two nights at the hotel, including all meals, will come to NOK 3190 (NOK 2790 per person in a shared double room),  
+
Two nights at the hotel, including all meals, will come to NOK 3745 (NOK 3345 per person in a shared double room),  
 
to be paid to the hotel directly.
 
to be paid to the hotel directly.
  
 
= Programme =
 
= Programme =
  
The 2023 winter school will have a thematic focus on ''Large-Scale Language Modeling and Neural Machine Translation with Web Data''.
+
The 2024 winter school will have a thematic focus on ''Large Language Models: Creation, Customization, Evaluation, and Use''.
 
The programme will be comprised of in-depth technical presentations (possibly including some
 
The programme will be comprised of in-depth technical presentations (possibly including some
hands-on elements) from, among others, the
+
hands-on elements) by seasoned experts, with special emphasis on open science and European languages,  
[https://bigscience.huggingface.co BigScience] and [https://commoncrawl.org Common Crawl] initiatives,
+
but also include critical reflections on current development trends in LLM-focussed NLP.
but also include critical reflections on working with massive, uncurated language data.
+
The programme will be complemented with a panel discussion and a ‘walk-through’ of available
The programme may be complemented with an evening ‘research bazar’ (by participants) to stimulate academic socializing and a ‘walk-through’ of available infrastructure on the shared EuroHPC LUMI supercomputer.
+
infrastructure on the shared EuroHPC LUMI supercomputer.
  
 
Confirmed presenters include:
 
Confirmed presenters include:
  
* Mehdi Ali, Fraunhofer IAIS
+
* [http://afra.alishahi.name Afra Alishahi, Tilburg University, The Netherlands]
* [https://faculty.washington.edu/ebender/ Emily M. Bender, University of Washington]
+
* [https://di.ku.dk/english/staff/vip/?pure=en/persons/631668 Desmond Elliot, University of Copenhagen, Denmark]
* [https://www.cs.jhu.edu/~phi/ Philipp Koehn, Johns Hopkins University]
+
* [https://muennighoff.github.io/ Niklas Muennighoff, Contextual AI]
* [https://huggingface.co/teven Teven Le Scao, Hugging Face]
+
* [https://perso.limsi.fr/neveol/bio.html Aurélie Névéol, Interdisciplinary Laboratory of Numerical Sciences, France]
* [https://nljubesi.github.io Nikola Ljubešić, Jožef Stefan Institute & University of Ljubljana]
 
* [https://commoncrawl.org/about/team/#headshot-14714 Sebastian Nagel, Common Crawl]
 
* [https://annargrs.github.io Anna Rogers, University of Copenhagen]
 
* [https://portizs.eu/#about Pedro Ortiz Suarez, University of Mannheim and DFKI]
 
* Zeerak Talat, Simon Fraser University
 
* [https://sites.google.com/site/ivanvulic/ Ivan Vulić, Cambridge University]
 
  
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
!colspan=3|Monday, February 6, 2023
+
!colspan=3|Sunday, February 4, 2024
 
|-
 
|-
 
| 13:00 || 14:00 || Lunch
 
| 13:00 || 14:00 || Lunch
 
|-
 
|-
| 14:00 || 15:30 || '''Session 1'''
+
| 14:00 || 15:30 || '''Session 1''': [http://svn.nlpl.eu/outreach/skeikampen/2024/alishahi.pdf Analyzing and Interpreting Deep Neural Models of Language] ([http://afra.alishahi.name Afra Alishahi])
 
|-
 
|-
 
| 15:30 || 15:50 || Coffee Break
 
| 15:30 || 15:50 || Coffee Break
 
|-
 
|-
| 15:50 || 17:20 || '''Session 2'''
+
| 16:00 || 17:30 || '''Session 2''': [http://svn.nlpl.eu/outreach/skeikampen/2024/alishahi.pdf Analyzing and Interpreting Deep Neural Models of Language] ([http://afra.alishahi.name Afra Alishahi])
 
|-
 
|-
| 17:20 || 17:40 || Coffee Break
+
| 17:30 || 17:50 || Coffee Break
 
|-
 
|-
| 17:40 || 19:10 || '''Session 3'''
+
| 17:50 || 19:20 || '''Session 3''': [http://svn.nlpl.eu/outreach/skeikampen/2024/muennighoff.pdf Scaling Data-constrained Language Models] ([https://muennighoff.github.io/ Niklas Muennighoff])
 +
 
 +
[https://docs.google.com/presentation/d/1WQDr_2sWkeBzAqNN521QJLyklM40fmv6KY_7JDhAntg/edit Slides]
 
|-
 
|-
 
| 19:30 ||  || Dinner
 
| 19:30 ||  || Dinner
|-
 
| 21:00 || || '''Evening Session 1'''
 
 
|}
 
|}
  
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
!colspan=3|Tuesday, February 7, 2022
+
!colspan=3|Monday, February 5, 2024
 
|-
 
|-
 
|colspan=3 | Breakfast is available from 07:30
 
|colspan=3 | Breakfast is available from 07:30
 
|-
 
|-
| 08:30 || 10:00 || '''Session 4'''
+
| 09:00 || 10:30 || '''Session 4''': [http://svn.nlpl.eu/outreach/skeikampen/2024/névéol1.pdf Bias in Natural Language Processing: focus on large language models] ([https://perso.limsi.fr/neveol/bio.html Aurélie Névéol])
 
|-
 
|-
|colspan=3| Lunch is available between 13:00 and 14:30
+
|colspan=3| Free time (Lunch is available between 13:00 and 14:30)
 
|-
 
|-
| 15:00 || 16:20 || '''Session 5'''
+
| 15:00 || 16:30 || '''Session 5''': [http://svn.nlpl.eu/outreach/skeikampen/2024/elliot.pdf Multilingual and multimodal language models] ([https://di.ku.dk/english/staff/vip/?pure=en/persons/631668 Desmond Elliot])
 
|-
 
|-
| 16:20 || 16:40 || Coffee Break
+
| 16:30 || 16:50 || Coffee Break
 
|-
 
|-
| 16:40 || 18:00 || '''Session 6'''
+
| 16:50 || 17:40 || '''Session 6''': [http://svn.nlpl.eu/outreach/skeikampen/2024/elliot.pdf Multilingual and multimodal language models] ([https://di.ku.dk/english/staff/vip/?pure=en/persons/631668 Desmond Elliot])
 
|-
 
|-
| 18:00 || 18:10 || Coffee Break
+
| 17:40 || 18:00 || Coffee Break
 
|-
 
|-
| 18:10 || 19:30 || '''Session 7'''
+
| 18:00 || 19:15 || '''Session 7'''. «Large vs. Small»: panel discussion. Panelists: Desmond Elliott (University of Copenhagen), Evangelia Gogoulou (RISE, Sweden), Afra Alishahi (Tilburg University), Jan Hajič (Charles University in Prague), and Aurélie Névéol (LISN, France)
 
|-
 
|-
 
| 19:30 ||  || Dinner
 
| 19:30 ||  || Dinner
 
|-
 
|-
| 21:00 || || '''Evening Session 2'''
+
| 21:00 || || '''Evening Session'''. [http://svn.nlpl.eu/outreach/skeikampen/2024/lumi.pdf LUMI: BERT in an Hour, GPT in a Week] ([https://www.mn.uio.no/ifi/english/people/aca/davisamu/ David Samuel] and [https://www.utu.fi/en/people/risto-luukkonen Risto Luukkonen])
 
|}
 
|}
  
Line 107: Line 104:
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-
!colspan=3|Wednesday, February 8, 2020
+
!colspan=3|Tuesday, February 6, 2024
 
|-
 
|-
 
|colspan=3| Breakfast is available from 07:30
 
|colspan=3| Breakfast is available from 07:30
 
|-
 
|-
| 08:30 || 10:00 || '''Session 8'''
+
| 08:30 || 10:00 || '''Session 8''': [http://svn.nlpl.eu/outreach/skeikampen/2024/névéol2.pdf Reproducibility in Natural Language Processing] ([https://perso.limsi.fr/neveol/bio.html Aurélie Névéol])
 
|-
 
|-
 
| 10:00 || 10:30 || Coffee Break
 
| 10:00 || 10:30 || Coffee Break
 
|-
 
|-
| 10:30 || 12:00 || '''Session 9'''
+
| 10:30 || 12:00 || '''Session 9''': [http://svn.nlpl.eu/outreach/skeikampen/2024/névéol3.pdf Understanding and measuring the environmental impact of Natural Language Processing] ([https://perso.limsi.fr/neveol/bio.html Aurélie Névéol])
 
|-
 
|-
 
| 12:30 || 13:30 || Lunch
 
| 12:30 || 13:30 || Lunch
 +
|-
 +
| 14:00 || 17:00 || Bus transfer to OSL Airport
 
|}
 
|}
  
 
= Registration =
 
= Registration =
  
In total, we anticipate up to 50 participants in the 2023 Winter School.
+
In total, we anticipate around 55 participants at the 2024 winter school.
Please register your intent of participation through our
+
We have received more requests for participation than we will be able to accommodate,
[https://nettskjema.no/a/300790 on-line registration form].
+
and the registration form has now been closed.
We will process requests for participation on a first-come, first-served
+
We processed requests for participation on a first-come, first-served basis, with an eye toward regional balance.
basis, with an eye toward regional balance.
+
Interested parties who have submitted the registration form were confirmed in three batches, on December 11, on December 15,
Interested parties who have submitted the registration form will be confirmed
+
and on December 22, which was also the closing date for winter school registration.
in three batches, one on December 5, another one on December 12, and finally
 
after the closing date for registration, which is Thursday, December 15, 2022.
 
  
Once confirmed by the organizing team, participant names will be published
+
Once confirmed by the organizing team, participant names are published
on this page, and registration will establish a
+
on this page, and registration establishes a
 
''binding agreement'' with the hotel.
 
''binding agreement'' with the hotel.
 
Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute
 
Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute
Line 142: Line 139:
 
With a few exceptions, winter school participants travel to and from the conference hotel
 
With a few exceptions, winter school participants travel to and from the conference hotel
 
jointly on a chartered bus (the HPLT shuttle).
 
jointly on a chartered bus (the HPLT shuttle).
The bus will leave OSL airport no later than 9:30 CET on Monday, February 6.
+
The bus will leave OSL airport no later than 9:45 CET on Sunday, February 4.
Thus, please meet up at 9:15 and make your arrival known to your assigned
+
Thus, please meet up by 9:30 and make your arrival known to your assigned
 
‘tour guide’ (who will introduce themselves to you by email beforehand).
 
‘tour guide’ (who will introduce themselves to you by email beforehand).
  
The group will gather near the bus and taxi information booth in the downstairs
+
The group will gather near the DNB currency exchange booth in the downstairs
 
arrivals area, just outside the international arrivals luggage claims and slightly
 
arrivals area, just outside the international arrivals luggage claims and slightly
to the right, as one exits the customs area:
+
to the left as one exits the customs area:
The yellow dot numbered (17) on the
+
the yellow dot numbered (18) on the
 
[https://avinor.no/globalassets/_oslo-lufthavn/ankomst-arrivals.pdf OSL arrivals map].
 
[https://avinor.no/globalassets/_oslo-lufthavn/ankomst-arrivals.pdf OSL arrivals map].
The group will then walk over to the bus terminal, to leave the airport by 9:30.
+
The group will then walk over to the bus terminal, to leave the airport not long after 9:40.
 
The drive to the Skeikampen conference hotel will take us about three hours, and the bus
 
The drive to the Skeikampen conference hotel will take us about three hours, and the bus
 
will make one stop along the way to stretch our legs and fill up on coffee.
 
will make one stop along the way to stretch our legs and fill up on coffee.
  
The winter school will end with lunch on Wednesday, February 8, before the group returns
+
The winter school will end with lunch on Tuesday, February 6, before the group returns
 
to OSL airport on the HPLT shuttle.
 
to OSL airport on the HPLT shuttle.
 
The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL
 
The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL
around 17:00 to 17:30 CET.
+
around 17:00 to 17:30 CET. After stopping at the OSL airport, the bus will continue to central Oslo.
  
 
= Organization =
 
= Organization =
  
The 2023 Winter School is organized by a team of volunteers from the NLPL and HPLT networks,
+
The 2024 Winter School is organized by a team of volunteers at the University
 +
of Oslo, supported by a programme committee from the HPLT and NLPL network and beyond,
 
please see below.
 
please see below.
 
For all inquiries regarding registration, the programme, logistics,
 
For all inquiries regarding registration, the programme, logistics,
 
or such, please contact <code>hplt-training@ifi.uio.no</code>.
 
or such, please contact <code>hplt-training@ifi.uio.no</code>.
  
The programme committee is comprised of (regrettably lacking in diversity)
+
The programme committee is comprised of:
  
* Hans Eide (Uninett Sigma2, Norway)
+
* Isabelle Augenstein (University of Copenhagen, Denmark)
* Filip Ginter (University of Turku, Finland)
+
* Emily M. Bemder (University of Washington, USA)
* Barry Haddow (University of Edinburgh, UK)
+
* Kenneth Heafield (Edinburgh University, UK)
* Jan Hajič (Charles University in Prague, Czech Republic)
+
* Jindřich Helcl (Charles University, Czech Republic)
* Daniel Hershcovich (University of Copenhagen, Denmark)
 
 
* Marco Kuhlmann (Linköping University, Sweden)
 
* Marco Kuhlmann (Linköping University, Sweden)
 +
* Per Egil Kummervold (National Library of Norway)
 
* Andrey Kutuzov (University of Oslo, Norway)
 
* Andrey Kutuzov (University of Oslo, Norway)
 
* Joakim Nivre (RISE and Uppsala University, Sweden)
 
* Joakim Nivre (RISE and Uppsala University, Sweden)
Line 180: Line 178:
 
* Sampo Pyysalo (University of Turku, Finland)
 
* Sampo Pyysalo (University of Turku, Finland)
 
* Gema Ramirez (Prompsit Language Engineering, Spain)
 
* Gema Ramirez (Prompsit Language Engineering, Spain)
 +
* Anna Rogers (IT University of Copenhagen, Denmark)
 
* Magnus Sahlgreen (AI Sweden)
 
* Magnus Sahlgreen (AI Sweden)
 
* David Samuel (University of Oslo, Norway)
 
* David Samuel (University of Oslo, Norway)
 
* Jörg Tiedemann (University of Helsinki, Finland)
 
* Jörg Tiedemann (University of Helsinki, Finland)
 +
* Erik Velldal (University of Oslo, Norway)
  
 
= Participants =
 
= Participants =
  
# Mehdi Ali (Fraunhofer IAIS)
+
# Afra Alishahi, Tilburg University (The Netherlands)
# Chantal Amrhein (University of Zurich)
+
# Ali Allaith, University of Copenhagen (Denmark)
# Mikko Aulamo (University of Helsinki)
+
# Nikolay Arefev, University of Oslo (Norway)
# Elisa Bassignana (IT University of Copenhagen)
+
# Joseph Attieh, University of Helsinki (Finland)
# Emily M. Bender (University of Washington)
+
# Christopher Brückner, Charles University in Prague (Czech Republic)
# Vladimír Benko (Slovak Academy of Sciences)
+
# Lucas Charpentier, University of Oslo (Norway)
# Nikolay Bogoychev (Edinburgh University)
+
# Konstantin Dobler, Hasso Plattner Institute (Germany)
# Lucas Charpentier (University of Oslo)
+
# Aleksei Dorkin, University of Tartu (Estonia)
# Dhairya Dalal (University of Galway)
+
# Luise Dürlich, Uppsala University (Sweden)
# Annerose Eichel (University of Stuttgart)
+
# Simen Eide, Schibsted (Norway)
# Kenneth Enevoldsen (Aarhus University)
+
# Desmond Elliott, University of Copenhagen (Denmark)
# Mehrdad Farahani (Chalmers University of Technology)
+
# Kenneth Enevoldsen, Aarhus University (Denmark)
# Ona de Gibert (University of Helsinki)
+
# Mariia Fedorova, University of Oslo (Norway)
# Janis Goldzycher (University of Zurich)
+
# Emilie Francis, Gothenburg University (Sweden)
# Jan Hajič (Charles University in Prague)
+
# Evangelia Gogoulou, RISE (Sweden)
# Jindřich Helcl (Charles University in Prague)
+
# Jan Hajič, Charles University in Prague (Czech Republic)
# Oskar Holmström (Linköping University)
+
# Lasse Hansen, Aarhus University Hospital (Denmark)
# Sami Itkonen (University of Helsinki)
+
# Jindřich Helcl, Charles University in Prague (Czech Republic)
# Antonia Karamolegkou (University of Copenhagen)
+
# Yiping Jin, Pompeu Fabra University (Spain)
# Marco Kuhlmann (Linköping University)
+
# Lars Johnsen, National Library (Norway)
# Nina Khairova (Umeå universitet)
+
# Amanda Kann, Stockholm University (Sweden)
# Philipp Koehn (Johns Hopkins University)
+
# Jan Kostkan, Aarhus University (Denmark)
# Andrey Kutuzov (University of Oslo)
+
# Andrey Kutuzov, University of Oslo (Norway)
# Jelmer van der Linde (Edinburgh University)
+
# Tsz Kin Lam, University of Edinburgh (UK)
# Pierre Lison (Norsk regnesentral)
+
# Wenyan Li, University of Copenhagen (Denmark)
# Nikola Ljubešić (Jožef Stefan Institute & University of Ljubljana)
+
# Pierre Lison, Norsk Regnesentral
# Yan Meng (University of Amsterdam)
+
# Jouni Luoma, University of Turku (Finland)
# Max Müller-Eberstein (IT University of Copenhagen)
+
# Risto Luukkonen, University of Turku (Finland)
# Sebastian Nagel (Common Crawl)
+
# Arianna Masciolini, Gothenburg University (Sweden)
# Graeme Nail (Edinburgh University)
+
# Petter Mæhlum, University of Oslo (Norway)
# Anna Nikiforovskaja (Université de Lorraine)
+
# Vladislav Mikhailov, University of Oslo (Norway)
# Irina Nikishina (Universität Hamburg)
+
# Yousuf Ali Mohammed, Gothenburg University (Sweden)
# Joakim Nivre (RISE and Uppsala University)
+
# Aurélie Névéol, LISN & CNRS (France)
# Stephan Oepen (University of Oslo)
+
# Tobias Norlund, AI Sweden (Sweden)
# Anders Jess Pedersen (Alexandra Institute)
+
# Stephan Oepen, University of Oslo (Norway)
# Laura Cabello Piqueras (University of Copenhagen)
+
# Lilja Øvrelid, University of Oslo (Norway)
# Myrthe Reuver (Vrije Universiteit Amsterdam)
+
# Alberto Parola, University of Copenhagen (Denmark)
# Anna Rogers (University of Copenhagen)
+
# Siddhesh Pawar, University of Copenhagen (Denmark)
# Frankie Robertson (University of Jyväskylä)
+
# Erofili Psaltaki, University of Helsinki (Finland)
# Phillip Rust (University of Copenhagen)
+
# Akseli Reunamo, University of Turku (Finland)
# Egil Rønnestad (University of Oslo)
+
# David Samuel, University of Oslo (Norway)
# David Samuel (University of Oslo)
+
# Ricardo Muñoz Sánchez, Gothenburg University (Sweden)
# Diana Santos (University of Oslo)
+
# Gautam Kishore Shahi, University of Duisburg-Essen (Germany)
# Teven Le Scao (Hugging Face)
+
# Janine Siewert, University of Helsinki (Finland)
# Yves Scherrer (University of Helsinki)
+
# Étienne Simon, University of Oslo (Norway)
# Edoardo Signoroni (Masaryk University)
+
# Inguna Skadiņa, University of Latvia
# Michal Štefánik (Masaryk University)
+
# Ondrej Sotolar, Masaryk University (Czech Republic)
# Pedro Ortiz Suarez (University of Mannheim and DFKI)
+
# Pavel Stranak, Charles University in Prague (Czech Republic)
# Zeerak Talat (Simon Fraser University)
+
# Maria Irena Szawerna, Gothenburg University (Sweden)
# Jörg Tiedemann (University of Helsinki)
+
# Jörg Tiedemann, University of Helsinki (Finland)
# Samia Touileb (University of Bergen)
+
# Ekaterina Uetova, Technological University Dublin (Ireland)
# Teemu Vahtola (University of Helsinki)
+
# Erik Velldal, University of Oslo (Norway)
# Thomas Vakili (Stockholm University)
+
# Tea Vojtěchová, Charles University in Prague (Czech Republic)
# Tea Vojtěchová (Charles University in Prague)
+
# Jonas Waldendorf, University of Edinburgh (UK)
# Ivan Vulić (University of Cambridge)
+
# Jaume Zaragoza-Bernabeu, Prompsit Language Engineering (Spain)
# Nicholas Walker (Norsk regnesentral)
+
# Giulio Zhou, University of Edinburgh (UK)
# Sondre Wold (University of Oslo)
 
# Jaume Zaragoza-Bernabeu (Prompsit)
 
# Natalia Zawadzka-Paluektau (University of Warsaw)
 

Latest revision as of 17:46, 7 February 2024

HPLT & NLPL Winter School on Large Language Models: Creation, Customization, Evaluation, and Use

Skeikampen.2023.jpg

Background

Since 2023, the NLPL network and Horizon Europe project High-Performance Language Technologies (HPLT) have joined forces to organize the successful winter school series on Web-scale NLP. The winter school seeks to stimulate community formation, i.e. strengthening interaction and collaboration among European research teams in NLP and advancing a shared level of knowledge and experience in using high-performance e-infrastructures for large-scale NLP research. The 2024 edition of the winter school puts special emphasis on NLP researchers from countries who participate in the EuroHPC LUMI consortium. For additional background, please see the archival pages from the 2018, 2019, 2020, and 2023 NLPL Winter Schools.

For early 2024, HPLT will hold its winter school from Sunday, February 4, to Tuesday, February 6, 2024, at a mountain-side hotel (with skiing and walking opportunities) about two hours north of Oslo. The project will organize group bus transfer from and to the Oslo airport Gardermoen, leaving the airport at 9:45 on Sunday morning and returning there around 17:30 on Tuesday afternoon.

The winter school is subsidized by the HPLT project: there is no fee for participants and no charge for the bus transfer to and from the conference hotel. All participants will have to cover their own travel and accomodation at Skeikampen, however. Two nights at the hotel, including all meals, will come to NOK 3745 (NOK 3345 per person in a shared double room), to be paid to the hotel directly.

Programme

The 2024 winter school will have a thematic focus on Large Language Models: Creation, Customization, Evaluation, and Use. The programme will be comprised of in-depth technical presentations (possibly including some hands-on elements) by seasoned experts, with special emphasis on open science and European languages, but also include critical reflections on current development trends in LLM-focussed NLP. The programme will be complemented with a panel discussion and a ‘walk-through’ of available infrastructure on the shared EuroHPC LUMI supercomputer.

Confirmed presenters include:

Sunday, February 4, 2024
13:00 14:00 Lunch
14:00 15:30 Session 1: Analyzing and Interpreting Deep Neural Models of Language (Afra Alishahi)
15:30 15:50 Coffee Break
16:00 17:30 Session 2: Analyzing and Interpreting Deep Neural Models of Language (Afra Alishahi)
17:30 17:50 Coffee Break
17:50 19:20 Session 3: Scaling Data-constrained Language Models (Niklas Muennighoff)

Slides

19:30 Dinner
Monday, February 5, 2024
Breakfast is available from 07:30
09:00 10:30 Session 4: Bias in Natural Language Processing: focus on large language models (Aurélie Névéol)
Free time (Lunch is available between 13:00 and 14:30)
15:00 16:30 Session 5: Multilingual and multimodal language models (Desmond Elliot)
16:30 16:50 Coffee Break
16:50 17:40 Session 6: Multilingual and multimodal language models (Desmond Elliot)
17:40 18:00 Coffee Break
18:00 19:15 Session 7. «Large vs. Small»: panel discussion. Panelists: Desmond Elliott (University of Copenhagen), Evangelia Gogoulou (RISE, Sweden), Afra Alishahi (Tilburg University), Jan Hajič (Charles University in Prague), and Aurélie Névéol (LISN, France)
19:30 Dinner
21:00 Evening Session. LUMI: BERT in an Hour, GPT in a Week (David Samuel and Risto Luukkonen)


Tuesday, February 6, 2024
Breakfast is available from 07:30
08:30 10:00 Session 8: Reproducibility in Natural Language Processing (Aurélie Névéol)
10:00 10:30 Coffee Break
10:30 12:00 Session 9: Understanding and measuring the environmental impact of Natural Language Processing (Aurélie Névéol)
12:30 13:30 Lunch
14:00 17:00 Bus transfer to OSL Airport

Registration

In total, we anticipate around 55 participants at the 2024 winter school. We have received more requests for participation than we will be able to accommodate, and the registration form has now been closed. We processed requests for participation on a first-come, first-served basis, with an eye toward regional balance. Interested parties who have submitted the registration form were confirmed in three batches, on December 11, on December 15, and on December 22, which was also the closing date for winter school registration.

Once confirmed by the organizing team, participant names are published on this page, and registration establishes a binding agreement with the hotel. Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute spaces), and no-shows will be charged the full price for at least one night by the hotel.

Logistics

With a few exceptions, winter school participants travel to and from the conference hotel jointly on a chartered bus (the HPLT shuttle). The bus will leave OSL airport no later than 9:45 CET on Sunday, February 4. Thus, please meet up by 9:30 and make your arrival known to your assigned ‘tour guide’ (who will introduce themselves to you by email beforehand).

The group will gather near the DNB currency exchange booth in the downstairs arrivals area, just outside the international arrivals luggage claims and slightly to the left as one exits the customs area: the yellow dot numbered (18) on the OSL arrivals map. The group will then walk over to the bus terminal, to leave the airport not long after 9:40. The drive to the Skeikampen conference hotel will take us about three hours, and the bus will make one stop along the way to stretch our legs and fill up on coffee.

The winter school will end with lunch on Tuesday, February 6, before the group returns to OSL airport on the HPLT shuttle. The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL around 17:00 to 17:30 CET. After stopping at the OSL airport, the bus will continue to central Oslo.

Organization

The 2024 Winter School is organized by a team of volunteers at the University of Oslo, supported by a programme committee from the HPLT and NLPL network and beyond, please see below. For all inquiries regarding registration, the programme, logistics, or such, please contact hplt-training@ifi.uio.no.

The programme committee is comprised of:

  • Isabelle Augenstein (University of Copenhagen, Denmark)
  • Emily M. Bemder (University of Washington, USA)
  • Kenneth Heafield (Edinburgh University, UK)
  • Jindřich Helcl (Charles University, Czech Republic)
  • Marco Kuhlmann (Linköping University, Sweden)
  • Per Egil Kummervold (National Library of Norway)
  • Andrey Kutuzov (University of Oslo, Norway)
  • Joakim Nivre (RISE and Uppsala University, Sweden)
  • Stephan Oepen (University of Oslo, Norway)
  • Sampo Pyysalo (University of Turku, Finland)
  • Gema Ramirez (Prompsit Language Engineering, Spain)
  • Anna Rogers (IT University of Copenhagen, Denmark)
  • Magnus Sahlgreen (AI Sweden)
  • David Samuel (University of Oslo, Norway)
  • Jörg Tiedemann (University of Helsinki, Finland)
  • Erik Velldal (University of Oslo, Norway)

Participants

  1. Afra Alishahi, Tilburg University (The Netherlands)
  2. Ali Allaith, University of Copenhagen (Denmark)
  3. Nikolay Arefev, University of Oslo (Norway)
  4. Joseph Attieh, University of Helsinki (Finland)
  5. Christopher Brückner, Charles University in Prague (Czech Republic)
  6. Lucas Charpentier, University of Oslo (Norway)
  7. Konstantin Dobler, Hasso Plattner Institute (Germany)
  8. Aleksei Dorkin, University of Tartu (Estonia)
  9. Luise Dürlich, Uppsala University (Sweden)
  10. Simen Eide, Schibsted (Norway)
  11. Desmond Elliott, University of Copenhagen (Denmark)
  12. Kenneth Enevoldsen, Aarhus University (Denmark)
  13. Mariia Fedorova, University of Oslo (Norway)
  14. Emilie Francis, Gothenburg University (Sweden)
  15. Evangelia Gogoulou, RISE (Sweden)
  16. Jan Hajič, Charles University in Prague (Czech Republic)
  17. Lasse Hansen, Aarhus University Hospital (Denmark)
  18. Jindřich Helcl, Charles University in Prague (Czech Republic)
  19. Yiping Jin, Pompeu Fabra University (Spain)
  20. Lars Johnsen, National Library (Norway)
  21. Amanda Kann, Stockholm University (Sweden)
  22. Jan Kostkan, Aarhus University (Denmark)
  23. Andrey Kutuzov, University of Oslo (Norway)
  24. Tsz Kin Lam, University of Edinburgh (UK)
  25. Wenyan Li, University of Copenhagen (Denmark)
  26. Pierre Lison, Norsk Regnesentral
  27. Jouni Luoma, University of Turku (Finland)
  28. Risto Luukkonen, University of Turku (Finland)
  29. Arianna Masciolini, Gothenburg University (Sweden)
  30. Petter Mæhlum, University of Oslo (Norway)
  31. Vladislav Mikhailov, University of Oslo (Norway)
  32. Yousuf Ali Mohammed, Gothenburg University (Sweden)
  33. Aurélie Névéol, LISN & CNRS (France)
  34. Tobias Norlund, AI Sweden (Sweden)
  35. Stephan Oepen, University of Oslo (Norway)
  36. Lilja Øvrelid, University of Oslo (Norway)
  37. Alberto Parola, University of Copenhagen (Denmark)
  38. Siddhesh Pawar, University of Copenhagen (Denmark)
  39. Erofili Psaltaki, University of Helsinki (Finland)
  40. Akseli Reunamo, University of Turku (Finland)
  41. David Samuel, University of Oslo (Norway)
  42. Ricardo Muñoz Sánchez, Gothenburg University (Sweden)
  43. Gautam Kishore Shahi, University of Duisburg-Essen (Germany)
  44. Janine Siewert, University of Helsinki (Finland)
  45. Étienne Simon, University of Oslo (Norway)
  46. Inguna Skadiņa, University of Latvia
  47. Ondrej Sotolar, Masaryk University (Czech Republic)
  48. Pavel Stranak, Charles University in Prague (Czech Republic)
  49. Maria Irena Szawerna, Gothenburg University (Sweden)
  50. Jörg Tiedemann, University of Helsinki (Finland)
  51. Ekaterina Uetova, Technological University Dublin (Ireland)
  52. Erik Velldal, University of Oslo (Norway)
  53. Tea Vojtěchová, Charles University in Prague (Czech Republic)
  54. Jonas Waldendorf, University of Edinburgh (UK)
  55. Jaume Zaragoza-Bernabeu, Prompsit Language Engineering (Spain)
  56. Giulio Zhou, University of Edinburgh (UK)