Seminars

Links' Seminars and Public Events Add to google calendar
2025
Fri 28th Mar
11:00 am
12:15 pm
Add event to google
Seminar by Joël Mba Kouhoue
Speaker: Joël Mba Kouhoue

Title: Modélisation et Intégration des Données Multi-vues : Approches basées sur l’Apprentissage et la Sémantique
"Lille-Salle A12"
Fri 14th Mar
11:00 am
12:30 pm
Add event to google
Seminar by Tito
Speaker: Lê Thành Dũng Nguyễn, aka “Tito” — nguyentito.eu/

Title: The structure of polynomial growth for tree automata/transducers and MSO set queries

Abstract:

Given an ℕ-weighted tree automaton, we give a decision procedure for
exponential vs polynomial growth (with respect to the input size) in quadratic
time, and an algorithm that computes the exact polynomial degree of growth in
cubic time. As a special case, they apply to the growth of the ambiguity of a
nondeterministic tree automaton, i.e. the number of distinct accepting runs
over a given input. Our time complexities match the fine-grained lower
bounds known for these problems restricted to ambiguity of word automata
(Drabik, Dürr, Frei, Mazowiecki & Węgrzycki 2025).

We deduce analogous decidability results (ignoring complexity) for the
growth of the number of results of set queries in Monadic Second-Order logic
(MSO) over ranked trees. In the case of polynomial growth of degree k, we
also prove a reparameterization theorem for such queries: their results can be
mapped to k-tuples of input nodes in a finite-to-one and MSO-definable fashion.

This property of MSO set queries leads directly to a generalization of
Bojańczyk's dimension minimization theorem for string-to-string polyregular
functions. Our generalization applies to MSO set interpretations from trees,
which subsume (as we show) tree-walking tree transducers and invisible pebble
tree-to-string transducers. Finally, with a bit more work we obtain the following:

* a new, short and conceptual proof that macro tree transducers of
linear growth compute only tree-to-tree MSO transductions (a seminal
theorem of Engelfriet and Maneth);

* a procedure to decide polynomial size-to-height increase for macro
tree transducers and compute the polynomial degree, which extends the
recent decidability result of Gallot, Maneth, Nakano and Peyrat (2024)
concerning linear size-to-height increase.
"Lille-Salle B21"
Fri 31st Jan
11:00 am
1:15 pm
Add event to google
Séminar by Thaïs Baudon
Speaker: Thaïs Baudon (perso.ens-lyon.fr/thais.baudon/)

Room: B21, speaker will be online

Title: Algebraic Data Types for High-Performance Computing

Abstract:

Initially present only in functional languages such as OCaml and
Haskell, *Algebraic Data Types* (ADTs) have now become pervasive in
mainstream languages, providing nice data abstractions and an elegant
way to express functions through pattern matching. Unfortunately, ADTs
remain seldom used in low-level programming. One reason is that their
increased convenience comes at the cost of abstracting away the exact
memory layout of values. Even Rust, which tries to optimize data
layout, severely limits control over memory representation.
Our goal is to let programmers specify highly optimized
memory layouts for inductive data structures in a flexible and
expressive way, while still enjoying high-level programming constructs
such as ADTs and pattern matching to manipulate this data.
To this end, we propose a language dubbed *Ribbit* which combines a
high-level language, consisting of ADTs, pattern matching and basic
manipulation of immutable values, with *memory types* specifying the
precise memory layout of each high-level type, providing full control
over the memory representation of values. We provide formal semantics
of both (high-level and memory) languages, criteria defining an agreement
relation between ADTs and their layouts, and compilation algorithms emitting
low-level code to manipulate data following custom memory layouts.
"Lille-Salle B21"
Fri 24th Jan
11:00 am
12:15 pm
Add event to google
Seminar by Théo Matricon
Speaker: Théo Matricon (theomat.github.io/)

Room: B21

Titre : Cost-guided enumeration of regular tree languages in constant delay

Abstract :

Program Synthesis from a few input output examples can be solved in an enumerative fashion, this enables domain agnostic solving assuming the specification can be checked.
The enumeration problem is given a reguar tree language represented by a top down deterministic tree automaton, to enumerate all such trees.
The complexity of the problem is measured in the delay: the time complexity between the enumeration of one tree and the next.
Enumeration by increasing size or depth does not scale for program synthesis due to the combinatorial explosion of the number of trees.
Therefore weights are added on the transition of the input automaton. Summing these weights define a unique cost for each tree inducing a total order over trees.
The goal is to enumerate in a cost-guided fashion i.e. that is in non-decreasing (>=) order of cost.
Algorithms in the literature since the first such search algorithm have all been in logarithmic delay.
We describe the first cost-guided enumeration algorithm with constant delay.


"Lille-Salle B21"
Fri 10th Jan
11:00 am
12:15 pm
Add event to google
Séminar by Thaïs Baudon
**This seminar was cancelled and is postponed to Jan 31**


Speaker: Thaïs Baudon (perso.ens-lyon.fr/thais.baudon/)

Room: B21

Title: Algebraic Data Types for High-Performance Computing

Abstract:

Initially present only in functional languages such as OCaml and
Haskell, *Algebraic Data Types* (ADTs) have now become pervasive in
mainstream languages, providing nice data abstractions and an elegant
way to express functions through pattern matching. Unfortunately, ADTs
remain seldom used in low-level programming. One reason is that their
increased convenience comes at the cost of abstracting away the exact
memory layout of values. Even Rust, which tries to optimize data
layout, severely limits control over memory representation.
Our goal is to let programmers specify highly optimized
memory layouts for inductive data structures in a flexible and
expressive way, while still enjoying high-level programming constructs
such as ADTs and pattern matching to manipulate this data.
To this end, we propose a language dubbed *Ribbit* which combines a
high-level language, consisting of ADTs, pattern matching and basic
manipulation of immutable values, with *memory types* specifying the
precise memory layout of each high-level type, providing full control
over the memory representation of values. We provide formal semantics
of both (high-level and memory) languages, criteria defining an agreement
relation between ADTs and their layouts, and compilation algorithms emitting
low-level code to manipulate data following custom memory layouts.
"Lille-Salle B21"
2024
Fri 20th Dec
11:00 am
12:15 pm
Add event to google
Seminar by Nina Pardal
Speaker: Nina Pardal (glyc.dc.uba.ar/nina/)

Room: B21, via zoom

Title: The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

Abstract:

Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an instantiation of the general Shapley value used in coalition game theory. The definition of this score relies on a probability distribution on the entity population. Since the exact distribution is generally unknown, it needs to be assigned subjectively or be estimated from data, which may lead to misleading feature scores. In this paper, we propose a principled framework for reasoning on SHAP scores under unknown entity population distributions. In our framework, we consider an uncertainty region that contains the potential distributions, and the SHAP score of a feature becomes a function defined over this region. We study the basic problems of finding maxima and minima of this function, which allows us to determine tight ranges for the SHAP scores of all features, and performed experiments on a real-world dataset, showing that our framework may contribute to a more robust feature scoring.
"Lille-Salle B21"
Wed 20th Nov
11:00 am
12:00 pm
Add event to google
Seminar by Bastien Degardins
Speaker: Bastien Degardins

Room: Amphi Atrium (RdC Bâtiment ESPRIT)

Title: Visualization and queries for biology in de Bruijn Graphs using relational databases.

Abstract:

Since the emergence of modern sequencers, sequence bioinformatics has become
crucial for processing sequencing data, essential in medicine, biology,
archaeology, and beyond. However, the growing data volumes require software
adaptations to meet these new challenges, despite significant hardware
advancements. Vizitig is a genomic and transcriptomic data visualization
software based on De Bruijn graphs, offering an intuitive query system without
needing to load indexes into RAM, allowing direct work on disk. By leveraging
relational databases, which are highly optimized and backed by decades of
research, Vizitig opens new avenues for research.It relies on NetworkDisk, a
Python package that manages graphs in a relational database, simplifying
software engineering. With a domain-specific language (DSL), Vizitig enables
intuitive queries and easy manipulation of graph metadata, even for
non-technical users. For experts, its compatibility with NetworkX provides
additional possibilities in terms of graph manipulation.

Fri 15th Nov
11:00 am
12:15 pm
Add event to google
Seminar by Gabriel Bathie
Speaker: Gabriel Bathie (perso.ens-lyon.fr/gabriel.bathie/)

Room: B21

Title: The complexity of Testing Regular Languages - Gabriel Bathie, Corto Mascle and Nathanaël Fijalkow (LaBRI, Université de Bordeaux)

Abstract:

Property testing is concerned with the design of algorithms making sublinear number of queries to distinguish whether the input satisfies a given property or is far from having this property.
A seminal paper of Alon, Krivelevich, Newman, and Szegedy in 2001 introduced property testing of formal languages: the goal is to determine whether an input word belongs to a given language, or is far from any word in that language (in terms of Hamming distance).
They constructed the first property testing algorithm for the class of all regular languages. Somewhat surprisingly, their algorithm uses a number of queries that does not depend on the length of the input word.
This opened up a line of work with improved complexity results and applications to streaming algorithms.
In this work, we show a trichotomy result: the class of regular languages can be divided into three classes, each of which is associated with an optimal testing complexity.
Our analysis yields effective characterizations for all three classes using so-called minimal blocking sequences, reasoning directly and combinatorially on automata.
This talk will give an overview of the methods used since the work of Alon et al. and highlight the main tools used for our combinatorial characterization.
Based on joint work with Corto Mascle and Nathanaël Fijalkow.
"Lille-Salle B21"
Tue 12th Nov
2:00 pm
3:30 pm
Add event to google
Seminar from Aliaume Lopez
Speaker: Aliaume Lopez (www.lsv.fr/~lopez/)

Title: Which polynomials are computed by N-weighted automata?

Room: B21

Abstract:
Given a semiring K, the notion of K-weighted automata generalizes regular languages to functions from Σ* to K. This model allows us to compute (multivariate) polynomial functions with coefficients in K. We provide a decidable characterization of polynomials with coefficients in Q that can be computed by K-weighted automata for K = (N,+,×) and for K = (Z+,×). As a consequence, we can decide which functions computed by Z-weighted automata are computed by N-weighted automata, under the assumption of commutativity (the order of the letters in the input does not matter) and polynomial growth (the output of the function is bounded by a polynomial in the size of the input). This surprisingly allows us to decide whether such functions are star-free, a notion borrowed from the theory of regular languages.
"Lille-Salle B21"
Fri 11th Oct
10:30 am
12:00 pm
Add event to google
Seminar from Alexis de Colnet
Speaker: Alexis de Colnet (www.ac.tuwien.ac.at/people/decolnet/)

Title: An FPRAS for #NFA and #nFBDD

Abstract:

#NFA is the problem of counting the words of a given length accepted by
a non-deterministic finite automaton (NFA). The problem is #P-hard but
the approximate variant admits polynomial-time randomized algorithms
(FPRAS, or fully-polynomial time randomized approximation schemes).
Arenas, Croquevielle, Jayaram and Riveros were the first to show that
#NFA admits an FPRAS and that this result extends to several other
counting problems, in fact all problems in the class SpanL. In this talk
we present another FPRAS for #NFA which applies to problems not covered
by Arenas et al.'s result. In particular, the FPRAS described in this
talk can be used for the problem of counting the satisfying assignments
of non-deterministic read-once branching programs (nFBDD).
Show in Google map
Atrium bâtiment Esprit

Lien Permanent pour cet article : https://team.inria.fr/links/fr/seminars/