Difference between revisions of "Community/training"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Participants)
(Programme)
Line 66: Line 66:
 
'''TBA'''
 
'''TBA'''
  
 +
== Schedule ==
 
{| class="wikitable"
 
{| class="wikitable"
 
|-
 
|-

Revision as of 15:02, 23 December 2024

HPLT & NLPL 2025 Winter School on Pretraining Data Quality and Multilingual LLM Evaluation

HPLT and NLPL Winter School 2024.jpg

Background

Since 2023, the NLPL network and Horizon Europe project High-Performance Language Technologies (HPLT) have joined forces to organize the successful winter school series on Web-scale NLP. The winter school seeks to stimulate community formation, i.e. strengthening interaction and collaboration among European research teams in NLP and advancing a shared level of knowledge and experience in using high-performance e-infrastructures for large-scale NLP research. This 2025 edition of the winter school puts special emphasis on NLP researchers from countries who participate in the EuroHPC LUMI consortium. For additional background, please see the archival pages from the 2018, 2019, 2020, 2023, and 2024 NLPL Winter Schools.

For early 2025, HPLT will hold its winter school from Monday, February 3, to Wednesday, February 5, 2025, at a mountain-side hotel (with skiing and walking opportunities) about two hours north of Oslo. The project will organize group bus transfer from and to the Oslo airport Gardermoen, leaving the airport at 9:45 on Monday morning and returning there around 17:30 on Wednesday afternoon.

The winter school is subsidized by the HPLT project: there is no fee for participants and no charge for the bus transfer to and from the conference hotel. All participants will have to cover their own travel and accomodation at Skeikampen, however. Two nights at the hotel, including all meals, will come to NOK 3855 (NOK 3455 per person in a shared double room), to be paid to the hotel directly.

Programme

The 2025 winter school will have a thematic focus on Pretraining Data Quality and Multilingual LLM Evaluation. The programme will be comprised of in-depth technical presentations (possibly including some hands-on elements) by seasoned experts, with special emphasis on open science and European languages, but also include critical reflections on current development trends in LLM-focussed NLP. The programme will be complemented with a ‘walk-through’ of example experience reports on the shared EuroHPC LUMI supercomputer.

Confirmed presenters and talks include:

"EuroLLM – A language model for Europe"

"Open Foundation Models: Scaling Laws and Generalization"

TBA

TBA

"Large Language Models and Factuality"

"Data Quality, Language Coverage and Ethical Considerations in Web Crawling"

TBA

Schedule

Monday, February 3, 2025
13:00 14:00 Lunch
14:00 15:30 Session 1
15:30 15:50 Coffee Break
16:00 17:30 Session 2
17:30 17:50 Coffee Break
17:50 19:20 Session 3
19:30 Dinner
Tuesday, February 4, 2025
Breakfast is available from 07:30
09:00 10:30 Session 4
Free time (Lunch is available between 13:00 and 14:30)
15:00 16:30 Session 5
16:30 16:50 Coffee Break
16:50 17:40 Session 6
17:40 18:00 Coffee Break
18:00 19:15 Session 7
19:30 Dinner
21:00 Evening Session


Wednesday, February 5, 2025
Breakfast is available from 07:30
08:30 10:00 Session 8
10:00 10:30 Coffee Break
10:30 12:00 Session 9
12:30 13:30 Lunch
14:00 17:00 Bus transfer to OSL Airport

Registration

In total, we anticipate around 60 participants at the 2025 winter school. Please register your intent of participation through our on-line registration form. We will process requests for participation on a first-come, first-served basis, with an eye toward regional balance. Interested parties who have submitted the registration form will be confirmed in three batches, on December 6, on December 13, and on December 20, which was also the closing date for winter school registration.

Once confirmed by the organizing team, participant names are published on this page, and registration establishes a binding agreement with the hotel. Therefore, a cancellation fee will be incurred (unless we can find someone else to ‘take over’ last-minute spaces), and no-shows will be charged the full price for at least one night by the hotel.

Logistics

With a few exceptions, winter school participants travel to and from the conference hotel jointly on a chartered bus (the HPLT shuttle). The bus will leave OSL airport no later than 9:45 CET on Monday, February 3. Thus, please meet up by 9:30 and make your arrival known to your assigned ‘tour guide’ (who will introduce themselves to you by email beforehand).

The group will gather near the DNB currency exchange booth in the downstairs arrivals area, just outside the international arrivals luggage claims and slightly to the left as one exits the customs area: the yellow dot numbered (18) on the OSL arrivals map. The group will then walk over to the bus terminal, to leave the airport not long after 9:40. The drive to the Skeikampen conference hotel will take us about three hours, and the bus will make one stop along the way to stretch our legs and fill up on coffee.

The winter school will end with lunch on Wednesday, February 5, before the group returns to OSL airport on the HPLT shuttle. The bus will leave Skeikampen at 14:00 CET, with an expected arrival time at OSL around 17:00 to 17:30 CET. After stopping at the OSL airport, the bus will continue to central Oslo.

Organization

The 2025 Winter School is organized by a team of volunteers at the University of Oslo, supported by a programme committee from the HPLT and NLPL network and beyond, please see below. For all inquiries regarding registration, the programme, logistics, or such, please contact hplt-training@ifi.uio.no.

The programme committee is comprised of:

  • Barry Haddow (University of Edinburgh, UK)
  • Andrey Kutuzov (University of Oslo, Norway)
  • Stephan Oepen (University of Oslo, Norway)
  • Sampo Pyysalo (University of Turku, Finland)
  • Jörg Tiedemann (University of Helsinki, Finland)

Participants

  1. Nikolay Arefev, University of Oslo (Norway)
  2. Maria Barrett, Silo AI (Finland)
  3. Alexandra Birch, University of Edinburgh (UK)
  4. Laurie Burchell, University of Edinburgh (UK)
  5. Lucas Charpentie, University of Oslo (Norway)
  6. Pinzhen (Patrick) Chen, University of Edinburgh (UK)
  7. Hannah Clausen, University of Oslo (Norway)
  8. Lucia Domenichelli, University of Pisa (Italy)
  9. Aleksei Dorkin, University of Tartu (Estonia)
  10. Kenneth Enevoldsen, Aarhus University (Denmark)
  11. Tita Enstad, National Library (Norway)
  12. Mariia Fedorova, University of Oslo (Norway)
  13. Yanzhu Guo, INRIA Paris (France)
  14. Arzu Burcu Güven, IT University of Copenhagen (Denmark)
  15. Barry Haddow, University of Edinburgh (UK)
  16. Jan Hajič, Charles University (Czech Republic)
  17. Kathy Hämmerl, CIS, LMU // TU Munich (Germany)
  18. Jindřich Helcl, Charles University (Czech Republic)
  19. Bertram Højer, IT University Copenhagen (Denmark)
  20. Sekh Mainul Islam, University of Copenhagen (Denmark)
  21. Jenia Jitsev, Jülich Supercomputing Centre / LAION (Germany)
  22. Márton Kardos, Aarhus University (Denmark)
  23. Anastasiia Klimashevskaia, University of Bergen (Norway)
  24. Mateusz Klimaszewski, The University of Edinburgh (UK)
  25. Ville Komulainen, University of Turku (Finland)
  26. Markus Koskela, CSC – IT Center for Science (Finland)
  27. Vimal Kumar Kumar, University of Limerick (Ireland)
  28. Andrey Kutuzov, University of Oslo (Norway)
  29. Robin Lakay, University of Sienna (Italy)
  30. Hengyu Luo, University of Helsinki (Finland)
  31. Farrokh Mehryary, University of Turku (Finland)
  32. Vladislav Mikhailov, University of Oslo (Norway)
  33. Andreas Motzfeldt, IT University of Copenhagen (Denmark)
  34. Zain Muhammad Mujahid, University of Copenhagen (Denmark)
  35. Sebastian Nagel, Common Crawl Foundation (Germany)
  36. Marianna Nezhurina, Jülich Supercomputing Centre / LAION (Germany)
  37. Stephan Oepen, University of Oslo (Norway)
  38. Emrah Özcan, Yildiz Technical University (Turkey)
  39. Guilherme Penedo, HuugingFace (France)
  40. Irina Proskurina, University of Lyon (France)
  41. Taido Purason, University of Tartu (Estonia)
  42. Ismaël Rousseau, Orange (France)
  43. Anna Rogers, IT University Copenhagen (Italy)
  44. David Samuel, University of Oslo (Norway)
  45. Gema Ramírez Sánchez, Prompsit Language Engineering (Spain)
  46. Marta Sartor, University of Pisa (Italy)
  47. Ipek Baris Schlicht, Universitat Politècnica de València (Spain)
  48. Hanna Shcharbakova, University of Lorraine (France)
  49. Pavel Stepachev, The University of Edinburgh (UK)
  50. Pavel Stranak, Charles University (Czech Republic)
  51. Pedro Ortiz Suarez, Common Crawl Foundation (France)
  52. Otto Tarkka, University of Turku (Finland)
  53. Kushal Tatariya, KU Leuven (Belgium)
  54. Jörg Tiedemann, University of Helsinki (Finland)
  55. Samia Touileb, University of Bergen (Norway)
  56. Elke Vandermeerschen, KU Leuven (Belgium)
  57. Raul Vazquez, University of Helsinki (Finland)
  58. Fedor Vitiugin, Aalto University (Finland)
  59. Tea Vojtěchová, Charles University (Czech Republic)
  60. Artūrs Znotiņš, University of Latvia (Estonia)
  61. Elaine Zosa, Silo AI (Finland)