BEGIN:VCALENDAR
VERSION:2.0
PRODID:-// - ECPv6.0.12//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-ORIGINAL-URL:https://zurich-nlp.ch
X-WR-CALDESC:Events for 
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:Europe/Paris
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:20240331T010000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:20241027T010000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=Europe/Paris:20240425T180000
DTEND;TZID=Europe/Paris:20240425T200000
DTSTAMP:20260430T021148
CREATED:20240401T061424Z
LAST-MODIFIED:20240426T082726Z
UID:1769-1714068000-1714075200@zurich-nlp.ch
SUMMARY:ZurichNLP Meetup #9
DESCRIPTION:We’re happy to announce ZurichNLP #9 on April 25th with the following speakers: \n\nLewis Tunstall (MLE @ HuggingFace) on How to Align Your LLM\nVilém Zouhar (PhD @ ETH Zurich) on Pride and BPE: How We Solved Tokenization but Got It Wrong\n“Tokenization is present in almost all NLP pipelines\, but rarely examined mathematically. We formalize and show boundaries to the most popular tokenization algorithm\, Byte-Pair Encoding. Then\, with information theory\, we show what makes some tokenization better than others and how to use this as a metric before training your expensive models. Lastly\, we admit how we got this hypothesis wrong.”\n\n  \nSlides from Lewis’s presentation can be found here! \nRSVP soon as spots are limited.
URL:https://zurich-nlp.ch/event/zurichnlp-meetup-9/
LOCATION:OAT ETH Zürich (14th Floor)\, Andreasstrasse 5 (14th floor)\, Zurich\, 8050
END:VEVENT
END:VCALENDAR