Links' Seminars and Public Events |
2024 | |
---|---|
Wed 20th Nov 11:00 am 12:00 pm | Seminar by Bastien Degardins Speaker: Bastien Degardins Room: Amphi Atrium (RdC Bâtiment ESPRIT) Title: Visualization and queries for biology in de Bruijn Graphs using relational databases. Abstract: Since the emergence of modern sequencers, sequence bioinformatics has become crucial for processing sequencing data, essential in medicine, biology, archaeology, and beyond. However, the growing data volumes require software adaptations to meet these new challenges, despite significant hardware advancements. Vizitig is a genomic and transcriptomic data visualization software based on De Bruijn graphs, offering an intuitive query system without needing to load indexes into RAM, allowing direct work on disk. By leveraging relational databases, which are highly optimized and backed by decades of research, Vizitig opens new avenues for research.It relies on NetworkDisk, a Python package that manages graphs in a relational database, simplifying software engineering. With a domain-specific language (DSL), Vizitig enables intuitive queries and easy manipulation of graph metadata, even for non-technical users. For experts, its compatibility with NetworkX provides additional possibilities in terms of graph manipulation. |
Fri 15th Nov 11:00 am 12:15 pm | Seminar by Gabriel Bathie Speaker: Gabriel Bathie (perso.ens-lyon.fr/gabriel.bathie/) Room: B21 Title: The complexity of Testing Regular Languages - Gabriel Bathie, Corto Mascle and Nathanaël Fijalkow (LaBRI, Université de Bordeaux) Abstract: Property testing is concerned with the design of algorithms making sublinear number of queries to distinguish whether the input satisfies a given property or is far from having this property. A seminal paper of Alon, Krivelevich, Newman, and Szegedy in 2001 introduced property testing of formal languages: the goal is to determine whether an input word belongs to a given language, or is far from any word in that language (in terms of Hamming distance). They constructed the first property testing algorithm for the class of all regular languages. Somewhat surprisingly, their algorithm uses a number of queries that does not depend on the length of the input word. This opened up a line of work with improved complexity results and applications to streaming algorithms. In this work, we show a trichotomy result: the class of regular languages can be divided into three classes, each of which is associated with an optimal testing complexity. Our analysis yields effective characterizations for all three classes using so-called minimal blocking sequences, reasoning directly and combinatorially on automata. This talk will give an overview of the methods used since the work of Alon et al. and highlight the main tools used for our combinatorial characterization. Based on joint work with Corto Mascle and Nathanaël Fijalkow. "Lille-Salle B21" |
Tue 12th Nov 2:00 pm 3:30 pm | Seminar from Aliaume Lopez Speaker: Aliaume Lopez (www.lsv.fr/~lopez/) Title: Which polynomials are computed by N-weighted automata? Room: B21 Abstract: Given a semiring K, the notion of K-weighted automata generalizes regular languages to functions from Σ* to K. This model allows us to compute (multivariate) polynomial functions with coefficients in K. We provide a decidable characterization of polynomials with coefficients in Q that can be computed by K-weighted automata for K = (N,+,×) and for K = (Z+,×). As a consequence, we can decide which functions computed by Z-weighted automata are computed by N-weighted automata, under the assumption of commutativity (the order of the letters in the input does not matter) and polynomial growth (the output of the function is bounded by a polynomial in the size of the input). This surprisingly allows us to decide whether such functions are star-free, a notion borrowed from the theory of regular languages. "Lille-Salle B21" |
Fri 11th Oct 10:30 am 12:00 pm | Seminar from Alexis de Colnet Speaker: Alexis de Colnet (www.ac.tuwien.ac.at/people/decolnet/) Title: An FPRAS for #NFA and #nFBDD Abstract: #NFA is the problem of counting the words of a given length accepted by a non-deterministic finite automaton (NFA). The problem is #P-hard but the approximate variant admits polynomial-time randomized algorithms (FPRAS, or fully-polynomial time randomized approximation schemes). Arenas, Croquevielle, Jayaram and Riveros were the first to show that #NFA admits an FPRAS and that this result extends to several other counting problems, in fact all problems in the class SpanL. In this talk we present another FPRAS for #NFA which applies to problems not covered by Arenas et al.'s result. In particular, the FPRAS described in this talk can be used for the problem of counting the satisfying assignments of non-deterministic read-once branching programs (nFBDD). Atrium bâtiment Esprit |
Fri 7th Jun 10:00 am 11:00 am | Séminaire Sam Van Gool dualité de Stone |
Thu 30th May to Fri 31st May all day | Pysemigroup Hackaton |
Fri 24th May 11:00 am 11:30 am | Séminaire Sophie Tison Speaker: Sophie Tison Title: Containment of Regular Path Queries Under Constraints |
Thu 16th May 2:00 pm 4:00 pm | Seminar Arkaprava Title: Efficient Optimization of Network Metrics in Large Uncertain Graphs Abstract: Graphs constitute an omnipresent data structure that can model objects and their relationships in a wide variety of real-world scenarios. The optimization of network metrics finds use in a plethora of real-world applications. Most of the exact techniques for such tasks turn out to be prohibitively time-consuming and memory-intensive for the huge graphs that are usually encountered. Thus, there is a need for efficient approximation algorithms. This talk focuses on the efficient optimization of network metrics in large uncertain graphs, and specifically the following three research problems. The first problem aims to find, between a given pair of nodes in an uncertain graph, the path having the highest probability of being a shortest path. The second problem aims to find, in an uncertain graph, the subgraph having the highest probability of being densest. The third problem is a novel variant of the well-known opinion maximization problem where, given a social network of users with real-valued opinions (about different candidates), the goal is to choose the top-k seed users maximizing a specific voting-based score at a given finite time horizon. Best Regards, Arkaprava "Lieu : Lille, Salle : B11" |
Fri 19th Apr 11:00 am 12:00 pm | Seminar Pierre Lermusiaux Speaker: Pierre Lermusiaux (plermusi.github.io/) Title: Detection of Uncaught Exceptions in Functional Programs by Abstract Interpretation Abstract: Exception handling is a key feature in modern programming languages. Exceptions can be used to deal with errors, or as a means to control the flow of execution of a program. Since they might unexpectedly terminate a program, unhandled exceptions are a serious safety concern. We propose a static analysis to detect uncaught exceptions in functional programs, that is defined as an abstract interpreter. It computes a description of the values potentially returned by a program using a novel abstract domain, that can express inductively defined sets of values. Simultaneously, the analysis infers the possibly raised exceptions, by computing in the abstract exception monad. This abstract interpreter has been implemented as an effective static analyser for a large subset of OCaml programs, that supports mutable data types, the OCaml module system, and dynamically extensible data types such as the exception type. The analyser has been evaluated on several hundreds of OCaml programs. |
Fri 5th Apr 10:30 am 11:30 am | Séminaire Guillaume Lagarde Titre: Scaling Neural Program Synthesis with Distribution-based Search Abstract: In this talk, we will discuss the problem of automatically constructing computer programs from input-output examples, especially when the target language is domain-specific and defined using a context-free grammar. I will introduce a theoretical framework called distribution-based search, discuss its challenges, and present several search strategies based on learning the weights of a probabilistic context-free grammar (PCFG) and then using this PCFG to enumerate the most promising candidate programs efficiently. The presentation will be based on the following paper published at AAAI'2022: arxiv.org/abs/2110.12485 Joint work with Nathanaël Fijalkow, Théo Matricon, Kevin Ellis, Pierre Ohlmann, Akarsh Potta |
Fri 2nd Feb 10:30 am 11:30 am | Mikaël Monet: Probabiliste Shapley value |
Fri 26th Jan 10:00 am 11:00 am | Séminaire: Klara Nosan Sujet: TBA |
2023 | |
Thu 14th Dec 2:00 pm 5:00 pm | Claire Soyez-Martin PhD defense Amphi IRCICA |
Fri 1st Dec 10:00 am 11:00 am | Séminaire Oliver Titre: Direct Access for Conjunctive Queries with Negation Abstract: Direct Access is the operation of returning, given an index j, the jth answer of a conjunctive query on a given database for a given order. While this problem is #P-hard in general (wrt combined complexity), many conjunctive queries are structured enough so that for some lexicographical ordering of their answers, one can have a direct access to the answer set of a query Q that takes polylogarithmic time in the size of the database after a polynomial time precomputation. Previous work has precisely characterised the tractable classes and given fined-grained lower bounds on the time needed for precomputation depending on the structure of the query. We give a generalisation of these tractability results to the case of signed conjunctive queries, that is, conjunctive queries that may contain negative atoms. Our technique is based on solving the direct access task for a class of circuits that can represent relational data. Our result then follows from the fact that the tractable (signed) conjunctive queries can be transformed into polynomial size circuits. We recover the known tractable classes from the literature in the case of positive conjunctive queries using this technique and also discover new islands of tractability for signed conjunctive queries. In particular, our result generalises to the Direct Access Problem the known tractabilities of counting the number of answers to beta-acyclic negative queries and of deciding whether a negative query with bounded nested-width has an answer. This is joint work with Florent Capelli. |
Fri 24th Nov 10:00 am 11:00 am | Séminaire Pierre Vandenhove |
Fri 17th Nov 10:00 am 11:00 am | Séminaire Charles (RsonPath) TBA |
Fri 10th Nov 10:00 am 11:00 am | Séminaire Nils Vortmeier title: TBA |
Fri 20th Oct 10:30 am 12:30 pm | Aurelien part II |
Fri 22nd Sep 11:00 am 12:00 pm | Séminaire Théo Losekoot Title: Automata-based verification of relational properties of functions over algebraic data structures |
Fri 15th Sep 11:00 am 12:30 pm | Charles: Présentation de rsonpath |
Fri 23rd Jun 11:00 am 12:00 pm | Seminar by Florent Capelli Speaker: Florent Capelli — florent.capelli.me/ Title: A simpler FPRAS for nOBDD Abstract: A simpler FPRAS for nOBDD Abstract: In this talk, we revisit the algorithm by Arenas, Croquevielle, Jayaram and Riveros that allows to approximate the number of words of length n of a non deterministic finite automaton. We explain the algorithm and techniques in a modular and general way, without relating to the particular case of counting words in automaton. We illustrate the soundness of the approach by applying it to the problem of approximatively counting the number of satisfying assignments of a non-deterministic OBDD. B21 |
Fri 2nd Jun 11:00 am 12:30 pm | Séminaire Martin Berger Title: Search-Based Regular Expression Inference on a GPU Abstract: Regular expression inference (REI) is a supervised machine learning and program synthesis problem that takes a cost metric for regular expressions, and positive and negative examples of strings as input. It outputs a regular expression that is precise (i.e., accepts all positive and rejects all negative examples), and minimal w.r.t. to the cost metric. We present a novel algorithm for REI over arbitrary alphabets that is enumerative and trades off time for space. Our main algorithmic idea is to implement the search space of regular expressions succinctly as a contiguous matrix of bitvectors. Collectively, the bitvectors represent, as characteristic sequences, all sub-languages of the infix-closure of the union of positive and negative examples. Mathematically, this is a semiring of (a variant of) formal power series. Infix-closure enables bottom-up compositional construction of larger from smaller regular expressions using the operations of our semiring. This minimises data movement and data-dependent branching, hence maximises data-parallelism. In addition, the infix-closure remains unchanged during the search, hence search can be staged: first pre-compute various expensive operations, and then run the compute intensive search process. We provide two C++ implementations, one for general purpose CPUs and one for Nvidia GPUs (using CUDA). We benchmark both on Google Colab Pro: the GPU implementation is on average over 1000x faster than the CPU implementation on the hardest benchmarks. Joint work with Mojtaba Valizadeh Download: martinfriedrichberger.net/pldi2023.html |
Thu 13th Apr 11:00 am 12:00 pm | Séminaire Yann Strozecki Esprit salle Agora 2 (rez-de-chaussée) |
Tue 11th Apr 2:00 pm 3:00 pm | Séminaire Mamadou Esprit Agora 1 (rez-de-chaussée) |
Fri 24th Mar 10:00 am 11:00 am | Séminaire Mamadou KANTE |
Fri 20th Jan 11:00 am 12:00 pm | Seminar by Tito Speaker: Lê Thành Dũng Nguyễn, aka “Tito” — nguyentito.eu/ Title: Polyregular functions: some recent developments Abstract: The class of polyregular functions is composed of the string-to-string functions computed by pebble transducers. While this machine model (which extends two-way finite transducers) is two decades old, several alternative characterizations of polyregular functions have been discovered recently [Bojańczyk 2018; Bojańczyk, Kiefer & Lhote 2019], demonstrating their canonicity. The name comes from the polynomial bound on the growth rate of these functions: |f(w)| = |w|^O(1) where |w| is the length of the string w. In this talk, after recalling this context, I will present some subsequent developments in which I have been involved: * the subclass of comparison-free polyregular (or “polyblind”) functions, definable through a natural restriction of pebble transducers, which Pierre Pradic and I actually discovered while studying a linear λ-calculus; * some results that either relate the growth rate of a polyregular function (comparison-free or not) to the “resources” needed to compute it, or show that there is no such relationship. |
Fri 13th Jan 11:00 am 12:00 pm | Seminar by Sarah Winter Speaker: Sarah Winter — sarahwinter.net/ Title: A Regular and Complete Notion of Delay for Streaming String Transducers Abstract: The notion of delay between finite transducers is a core element of numerous fundamental results of transducer theory. The goal of this work is to provide a similar notion for more complex abstract machines: we introduce a new notion of delay tailored to measure the similarity between streaming string transducers (SST). We show that our notion is regular: we design a finite automaton that can check whether the delay between any two SSTs executions is smaller than some given bound. As a consequence, our notion enjoys good decidability properties: in particular, while equivalence between non-deterministic SSTs is undecidable, we show that equivalence up to fixed delay is decidable. Moreover, we show that our notion has good completeness properties: we prove that two SSTs are equivalent if and only if they are equivalent up to some (computable) bounded delay. Together with the regularity of our delay notion, it provides an alternative proof that SSTs equivalence is decidable. Finally, the definition of our delay notion is machine-independent, as it only depends on the origin semantics of SSTs. As a corollary, the completeness result also holds for equivalent machine models such as deterministic two-way transducers, or MSO transducers. This is joint work with Emmanuel Filiot, Ismaël Jecker, and Christof Löding. |