Seminars

«Juin Août»

2025
Fri 28th Mar 11:00 am 12:15 pm	Seminar by Joël Mba Kouhoue Speaker: Joël Mba Kouhoue Title: Modélisation et Intégration des Données Multi-vues : Approches basées sur l’Apprentissage et la Sémantique " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle A12"
Fri 14th Mar 11:00 am 12:30 pm	Seminar by Tito Speaker: Lê Thành Dũng Nguyễn, aka “Tito” — nguyentito.eu/ Title: The structure of polynomial growth for tree automata/transducers and MSO set queries Abstract: Given an ℕ-weighted tree automaton, we give a decision procedure for exponential vs polynomial growth (with respect to the input size) in quadratic time, and an algorithm that computes the exact polynomial degree of growth in cubic time. As a special case, they apply to the growth of the ambiguity of a nondeterministic tree automaton, i.e. the number of distinct accepting runs over a given input. Our time complexities match the fine-grained lower bounds known for these problems restricted to ambiguity of word automata (Drabik, Dürr, Frei, Mazowiecki & Węgrzycki 2025). We deduce analogous decidability results (ignoring complexity) for the growth of the number of results of set queries in Monadic Second-Order logic (MSO) over ranked trees. In the case of polynomial growth of degree k, we also prove a reparameterization theorem for such queries: their results can be mapped to k-tuples of input nodes in a finite-to-one and MSO-definable fashion. This property of MSO set queries leads directly to a generalization of Bojańczyk's dimension minimization theorem for string-to-string polyregular functions. Our generalization applies to MSO set interpretations from trees, which subsume (as we show) tree-walking tree transducers and invisible pebble tree-to-string transducers. Finally, with a bit more work we obtain the following: * a new, short and conceptual proof that macro tree transducers of linear growth compute only tree-to-tree MSO transductions (a seminal theorem of Engelfriet and Maneth); * a procedure to decide polynomial size-to-height increase for macro tree transducers and compute the polynomial degree, which extends the recent decidability result of Gallot, Maneth, Nakano and Peyrat (2024) concerning linear size-to-height increase. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
Fri 31st Jan 11:00 am 1:15 pm	Séminar by Thaïs Baudon Speaker: Thaïs Baudon (perso.ens-lyon.fr/thais.baudon/) Room: B21, speaker will be online Title: Algebraic Data Types for High-Performance Computing Abstract: Initially present only in functional languages such as OCaml and Haskell, Algebraic Data Types (ADTs) have now become pervasive in mainstream languages, providing nice data abstractions and an elegant way to express functions through pattern matching. Unfortunately, ADTs remain seldom used in low-level programming. One reason is that their increased convenience comes at the cost of abstracting away the exact memory layout of values. Even Rust, which tries to optimize data layout, severely limits control over memory representation. Our goal is to let programmers specify highly optimized memory layouts for inductive data structures in a flexible and expressive way, while still enjoying high-level programming constructs such as ADTs and pattern matching to manipulate this data. To this end, we propose a language dubbed Ribbit which combines a high-level language, consisting of ADTs, pattern matching and basic manipulation of immutable values, with memory types specifying the precise memory layout of each high-level type, providing full control over the memory representation of values. We provide formal semantics of both (high-level and memory) languages, criteria defining an agreement relation between ADTs and their layouts, and compilation algorithms emitting low-level code to manipulate data following custom memory layouts. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
Fri 24th Jan 11:00 am 12:15 pm	Seminar by Théo Matricon Speaker: Théo Matricon (theomat.github.io/) Room: B21 Titre : Cost-guided enumeration of regular tree languages in constant delay Abstract : Program Synthesis from a few input output examples can be solved in an enumerative fashion, this enables domain agnostic solving assuming the specification can be checked. The enumeration problem is given a reguar tree language represented by a top down deterministic tree automaton, to enumerate all such trees. The complexity of the problem is measured in the delay: the time complexity between the enumeration of one tree and the next. Enumeration by increasing size or depth does not scale for program synthesis due to the combinatorial explosion of the number of trees. Therefore weights are added on the transition of the input automaton. Summing these weights define a unique cost for each tree inducing a total order over trees. The goal is to enumerate in a cost-guided fashion i.e. that is in non-decreasing (>=) order of cost. Algorithms in the literature since the first such search algorithm have all been in logarithmic delay. We describe the first cost-guided enumeration algorithm with constant delay. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
Fri 10th Jan 11:00 am 12:15 pm	Séminar by Thaïs Baudon This seminar was cancelled and is postponed to Jan 31 Speaker: Thaïs Baudon (perso.ens-lyon.fr/thais.baudon/) Room: B21 Title: Algebraic Data Types for High-Performance Computing Abstract: Initially present only in functional languages such as OCaml and Haskell, Algebraic Data Types (ADTs) have now become pervasive in mainstream languages, providing nice data abstractions and an elegant way to express functions through pattern matching. Unfortunately, ADTs remain seldom used in low-level programming. One reason is that their increased convenience comes at the cost of abstracting away the exact memory layout of values. Even Rust, which tries to optimize data layout, severely limits control over memory representation. Our goal is to let programmers specify highly optimized memory layouts for inductive data structures in a flexible and expressive way, while still enjoying high-level programming constructs such as ADTs and pattern matching to manipulate this data. To this end, we propose a language dubbed Ribbit which combines a high-level language, consisting of ADTs, pattern matching and basic manipulation of immutable values, with memory types specifying the precise memory layout of each high-level type, providing full control over the memory representation of values. We provide formal semantics of both (high-level and memory) languages, criteria defining an agreement relation between ADTs and their layouts, and compilation algorithms emitting low-level code to manipulate data following custom memory layouts. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
2024
Fri 20th Dec 11:00 am 12:15 pm	Seminar by Nina Pardal Speaker: Nina Pardal (glyc.dc.uba.ar/nina/) Room: B21, via zoom Title: The Distributional Uncertainty of the SHAP score in Explainable Machine Learning Abstract: Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an instantiation of the general Shapley value used in coalition game theory. The definition of this score relies on a probability distribution on the entity population. Since the exact distribution is generally unknown, it needs to be assigned subjectively or be estimated from data, which may lead to misleading feature scores. In this paper, we propose a principled framework for reasoning on SHAP scores under unknown entity population distributions. In our framework, we consider an uncertainty region that contains the potential distributions, and the SHAP score of a feature becomes a function defined over this region. We study the basic problems of finding maxima and minima of this function, which allows us to determine tight ranges for the SHAP scores of all features, and performed experiments on a real-world dataset, showing that our framework may contribute to a more robust feature scoring. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
Wed 20th Nov 11:00 am 12:00 pm	Seminar by Bastien Degardins Speaker: Bastien Degardins Room: Amphi Atrium (RdC Bâtiment ESPRIT) Title: Visualization and queries for biology in de Bruijn Graphs using relational databases. Abstract: Since the emergence of modern sequencers, sequence bioinformatics has become crucial for processing sequencing data, essential in medicine, biology, archaeology, and beyond. However, the growing data volumes require software adaptations to meet these new challenges, despite significant hardware advancements. Vizitig is a genomic and transcriptomic data visualization software based on De Bruijn graphs, offering an intuitive query system without needing to load indexes into RAM, allowing direct work on disk. By leveraging relational databases, which are highly optimized and backed by decades of research, Vizitig opens new avenues for research.It relies on NetworkDisk, a Python package that manages graphs in a relational database, simplifying software engineering. With a domain-specific language (DSL), Vizitig enables intuitive queries and easy manipulation of graph metadata, even for non-technical users. For experts, its compatibility with NetworkX provides additional possibilities in terms of graph manipulation.
Fri 15th Nov 11:00 am 12:15 pm	Seminar by Gabriel Bathie Speaker: Gabriel Bathie (perso.ens-lyon.fr/gabriel.bathie/) Room: B21 Title: The complexity of Testing Regular Languages - Gabriel Bathie, Corto Mascle and Nathanaël Fijalkow (LaBRI, Université de Bordeaux) Abstract: Property testing is concerned with the design of algorithms making sublinear number of queries to distinguish whether the input satisfies a given property or is far from having this property. A seminal paper of Alon, Krivelevich, Newman, and Szegedy in 2001 introduced property testing of formal languages: the goal is to determine whether an input word belongs to a given language, or is far from any word in that language (in terms of Hamming distance). They constructed the first property testing algorithm for the class of all regular languages. Somewhat surprisingly, their algorithm uses a number of queries that does not depend on the length of the input word. This opened up a line of work with improved complexity results and applications to streaming algorithms. In this work, we show a trichotomy result: the class of regular languages can be divided into three classes, each of which is associated with an optimal testing complexity. Our analysis yields effective characterizations for all three classes using so-called minimal blocking sequences, reasoning directly and combinatorially on automata. This talk will give an overview of the methods used since the work of Alon et al. and highlight the main tools used for our combinatorial characterization. Based on joint work with Corto Mascle and Nathanaël Fijalkow. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
Tue 12th Nov 2:00 pm 3:30 pm	Seminar from Aliaume Lopez Speaker: Aliaume Lopez (www.lsv.fr/~lopez/) Title: Which polynomials are computed by N-weighted automata? Room: B21 Abstract: Given a semiring K, the notion of K-weighted automata generalizes regular languages to functions from Σ* to K. This model allows us to compute (multivariate) polynomial functions with coefficients in K. We provide a decidable characterization of polynomials with coefficients in Q that can be computed by K-weighted automata for K = (N,+,×) and for K = (Z+,×). As a consequence, we can decide which functions computed by Z-weighted automata are computed by N-weighted automata, under the assumption of commutativity (the order of the letters in the input does not matter) and polynomial growth (the output of the function is bounded by a polynomial in the size of the input). This surprisingly allows us to decide whether such functions are star-free, a notion borrowed from the theory of regular languages. " target="_BLANK" title="Show location in Google Maps" > "Lille-Salle B21"
Fri 11th Oct 10:30 am 12:00 pm	Seminar from Alexis de Colnet Speaker: Alexis de Colnet (www.ac.tuwien.ac.at/people/decolnet/) Title: An FPRAS for #NFA and #nFBDD Abstract: #NFA is the problem of counting the words of a given length accepted by a non-deterministic finite automaton (NFA). The problem is #P-hard but the approximate variant admits polynomial-time randomized algorithms (FPRAS, or fully-polynomial time randomized approximation schemes). Arenas, Croquevielle, Jayaram and Riveros were the first to show that #NFA admits an FPRAS and that this result extends to several other counting problems, in fact all problems in the class SpanL. In this talk we present another FPRAS for #NFA which applies to problems not covered by Arenas et al.'s result. In particular, the FPRAS described in this talk can be used for the problem of counting the satisfying assignments of non-deterministic read-once branching programs (nFBDD). Atrium bâtiment Esprit

← −− − 10 50 100 + ++ →