Carview!

CARVIEW

MOTORHOMES

Select Language

HTTP/1.1 200 OK Date: Sat, 31 Jan 2026 02:25:27 GMT Server: Apache/2.4.65 (Unix) OpenSSL/3.5.1 X-Powered-By: PHP/8.3.4 Vary: Accept-Encoding,User-Agent Content-Encoding: gzip Content-Length: 23910 Content-Type: text/html; charset=UTF-8 Cristian Riveros - Website

CRISTIAN RIVEROS

Website

"In their capacity as a tool, computers will be but a ripple on the surface of our culture. In their capacity as intellectual challenge, they are without precedent in the cultural history of mankind." E. W. Dijkstra, Turing Award Lecture, 1972

Department of Computer Science
Vicuna Mackenna 4860
Edificio San Agustin, 4to piso
Macul, Santiago, 7820436
+56 2 23547407

cristian.riveros@uc.cl

Conference papers

2021

When is Approximate Counting for Conjunctive Queries Tractable? [ pdf ]
With Marcelo Arenas, Rajesh Jayaram, and Luis Alberto Croquevielle. To appear in STOC. [More info]

Conference: 53rd Annual ACM Symposium on Theory of Computing (STOC) - Rome, Italy.

Abstract: Conjunctive queries are one of the most common class of queries used in database systems, and the best studied in the literature. A seminal result of Grohe, Schwentick, and Segoufin (STOC 2001) demonstrates that for every class G of graphs, the evaluation of all conjunctive queries whose underlying graph is in G is tractable if, and only if, G has bounded treewidth. In this work, we extend this characterization to the counting problem for conjunctive queries. Specifically, for every class C of conjunctive queries with bounded treewidth, we introduce the first fully polynomial-time randomized approximation scheme (FPRAS) for counting answers to a query in C, and the first polynomial-time algorithm for sampling answers uniformly from a query in C. As a corollary, it follows that for every class G of graphs, the counting problem for conjunctive queries whose underlying graph is in G admits an FPRAS if, and only if, G has bounded treewidth (unless BPP != P)}. In fact, our FPRAS is more general, and also applies to conjunctive queries with bounded hypertree width, as well as unions of such queries. The key ingredient in our proof is the resolution of a fundamental counting problem from automata theory. Specifically, we demonstrate the first FPRAS and polynomial time sampler for the set of trees of size n accepted by a tree automaton, which improves the prior quasi-polynomial time randomized approximation scheme (QPRAS) and sampling algorithm of Gore, Jerrum, Kannan, Sweedyk, and Mahaney '97. We demonstrate how this algorithm can be used to obtain an FPRAS for many hitherto open problems, such as counting solutions to constraint satisfaction problems (CSP) with bounded hypertree-width, counting the number of error threads in programs with nested call subroutines, and counting valid assignments to structured DNNF circuits.

Expressive power of linear algebra query languages [ pdf ]
With Floris Geerts, Thomas Muñoz, and Domagoj Vrgoč. To appear in PODS. [More info]

Ranked enumeration of MSO logic on words [ pdf ]
With Pierre Bourhis, Alejandro Grez, and Louis Jachiet. To appear in ICDT. [More info]

2020

A Family of Centrality Measures for Graph Data Based on Subgraphs [ pdf ]
With Jorge Salas. In ICDT. [More info]

On the Expressiveness of Languages for Complex Event Recognition [ pdf ]
With Alejandro Grez, Martin Ugarte, and Stijn Vansummeren. In ICDT. [More info]

Conference: 23nd International Conference on Database Theory (ICDT) - Copenhagen, Denmark.

Abstract: Complex Event Recognition (CER for short) has recently gained attention as a mechanism for detecting patterns in streams of continuously arriving event data. Numerous CER systems and languages have been proposed in the literature, commonly based on combining operations from regular expressions (sequencing, iteration, and disjunction) and relational algebra (e.g., joins and filters). While variables in these languages can only bind single elements, they also provide capabilities for filtering sets of events that occur inside iterative patterns; for example requiring sequences of numbers to be increasing. Unfortunately, these type of filters usually present ad-hoc syntax and under-defined semantics, precisely because variables cannot bind sets of events. As a result, CER languages that provide filtering of sequences commonly lack rigorous semantics and their expressive power is not understood. In this paper we embark on two tasks: First, to define a denotational semantics for CER that naturally allows to bind and filter sets of events; and second, to compare the expressive power of this semantics with that of CER languages that only allow for binding single events. Concretely, we introduce Set-based Complex Event Logic (S-CEL for short), a variation of the CER language introduced by Grez et al. in which all variables bind to sets of matched events. We then compare S-CEL with Event-based CEL (E-CEL), the language proposed by Grez et al. where variables bind single events. We show that they are equivalent in expressive power when restricted to unary predicates but, surprisingly, incomparable in general. Nevertheless, we show that if we restrict to sets of binary predicates, then S-CEL is strictly more expressive than E-CEL. To get a better understanding of the expressive power, computational capabilities, and limitations of S-CEL, we also investigate the relationship between S-CEL and Complex Event Automata (CEA), a natural computational model for CER languages. We define a property on CEA called the *-property and show that, under unary predicates, S-CEL captures precisely the subclass of CEA that satisfy this property. Finally, we identify the operations that SCEL is lacking to characterize CEA and introduce a natural extension of the language that captures the complete class of CEA under unary predicates.

Towards Streaming Evaluation of Queries with Correlation in Complex Event Processing [ pdf ]
With Alejandro Grez. In ICDT. [More info]

Conference: 23nd International Conference on Database Theory (ICDT) - Copenhagen, Denmark.

Abstract: Complex event processing (CEP) has gained a lot of attention for evaluating complex patterns over high-throughput data streams. Recently, new algorithms for the evaluation of CEP patterns have emerged with strong guarantees of efficiency, i.e. constant update-time per tuple and constant-delay enumeration. Unfortunately, these techniques are restricted for patterns with local filters, limiting the possibility of using joins for correlating the data of events that are far apart. In this paper, we embark on the search for efficient evaluation algorithms of CEP patterns with joins. We start by formalizing the so-called partition-by operator, a standard operator in data stream management systems to correlate contiguous events on streams. Although this operator is a restricted version of a join query, we show that partition-by (without iteration) is equally expressive as hierarchical queries, the biggest class of full conjunctive queries that can be evaluated with constant update-time and constant-delay enumeration over streams. To evaluate queries with partition-by we introduce an automata model, called chain complex event automata (chain-CEA), an extension of complex event automata that can compare data values by using equalities and disequalities. We show that chain-CEA is closed under determinization and is expressive enough to capture queries with partition-by. More importantly, we provide an algorithm with constant update time and constant delay enumeration for evaluating any query definable by chain-CEA, showing that all CEP queries with partition-by can be evaluated with these strong guarantees of efficiency.

2019

Efficient Logspace Classes for Enumeration, Counting, and Uniform Generation [ pdf | slides ] ( Best paper award )
With Marcelo Arenas, Rajesh Jayaram, and Luis Alberto Croquevielle. In PODS. [More info]

Conference: 38th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS) - Amsterdam, The Netherlands.

Abstract: In this work, we study two simple yet general complexity classes, based on logspace Turing machines, which provide a unifying framework for efficient query evaluation in areas like information extraction and graph databases, among others. We investigate the complexity of three fundamental algorithmic problems for these classes: enumeration, counting and uniform generation of solutions, and show that they have several desirable properties in this respect. Both complexity classes are defined in terms of nondeterministic logspace transducers (NL transducers). For the first class, we consider the case of unambiguous NL transducers, and we prove constant delay enumeration, and both counting and uniform generation of solutions in polynomial time. For the second class, we consider unrestricted NL transducers, and we obtain polynomial delay enumeration, approximate counting in polynomial time, and polynomial-time randomized algorithms for uniform generation. More specifically, we show that each problem in this second class admits a fully polynomial-time randomized approximation scheme (FPRAS) and a polynomial-time Las Vegas algorithm for uniform generation. Interestingly, the key idea to prove these results is to show that the fundamental problem #NFA admits an FPRAS, where #NFA is the problem of counting the number of strings of length n accepted by a nondeterministic finite automaton (NFA). While this problem is known to be #P-complete and, more precisely, SPANL-complete, it was open whether this problem admits an FPRAS. In this work, we solve this open problem, and obtain as a welcome corollary that every function in SPANL admits an FPRAS.

A Formal Framework for Complex Event Processing [ pdf ]
With Alejandro Grez and Martin Ugarte. In ICDT. [More info]

A Worst-Case Optimal Join Algorithm for SPARQL [ pdf ]
With Aidan Hogan, Carlos Rojas, and Adrian Soto. In ISWC. [More info]

2018

Constant Delay Algorithms for Regular Document Spanners [ pdf | slides ]
With Fernando Florenzano, Martin Ugarte, Stijn Vansummeren, and Domagoj Vrgoč. In PODS. [More info]

Document Spanners for Extracting Incomplete Information: Expressiveness and Complexity [ pdf ]
With Francisco Maturana and Domagoj Vrgoč. In PODS. [More info]

Pumping Lemmas for Weighted Automata [ pdf ]
With Filip Mazowiecki. In STACS. [More info]

2017

Descriptive Complexity for counting complexity classes [ pdf | slides ]
With Marcelo Arenas and Martin Muñoz. In LICS. [More info]

Probabilistic Automata of Bounded Ambiguity [ pdf ]
With Nathanaël Fijalkow and James Worrell. In CONCUR. [More info]

2016

Copyless Cost-Register Automata: Structure, Expressiveness, and Closure Properties [ pdf ]
With Filip Mazowiecki. In STACS. [More info]

A framework for annotating CSV-like data [ pdf ]
With Marcelo Arenas, Francisco Maturana, and Domagoj Vrgoč. In VLDB. [More info]

Querying Wikidata: Comparing SPARQL, Relational and Graph Databases [ pdf ]
With Daniel Hernández, Aidan Hogan, Carlos Rojas, and Enzo Zerega. In ISWC. [More info]

2015

Maximal Partition Logic: Towards a Logical Characterization of Copyless Cost Register Automata [ pdf ]
With Filip Mazowiecki. In CSL. [More info]

2013

Quantitative Monadic Second-Order Logic [ pdf | slides ]
With Stephan Kreutzer. In LICS. [More info]

Which DTDs are streaming bounded repairable? [ pdf | slides ]
With Pierre Bourhis and Gabriele Puppis. In ICDT. [More info]

2012

Bounded repairability for regular tree languages [ pdf | slides ]
With Gabriele Puppis and Slawek Staworko. In ICDT. [More info]

2011

The cost of traveling between languages [ pdf | slides ]
With Michael Benedikt and Gabriele Puppis. In ICALP. [More info]

Regular repair of specifications [ pdf | slides ]
With Michael Benedikt and Gabriele Puppis. In LICS. [More info]

2010

Foundations of schema mapping management [ pdf ]
With Marcelo Arenas, Jorge Pérez, and Juan Reutter. In PODS. [More info]

2009

Inverting schema mappings: Bridging the gap between theory and practice [ pdf ]
With Marcelo Arenas, Jorge Pérez, and Juan Reutter. In VLDB. [More info]

Conference: 35th International Conference on Very Large Data Bases (VLDB) - Lyon, France.

Abstract: The inversion of schema mappings has been identified as one of the fundamental operators for the development of a general framework for metadata management. In fact, during the last years three alternative notions of inversion for schema mappings have been proposed (Fagin-inverse, quasi-inverse and maximum recovery). However, the procedures that have been developed for computing these operators have some features that limit their practical applicability. First, these algorithms work in exponential time and produce inverse mappings of exponential size. Second, these algorithms express inverses in some mappings languages which include features that are difficult to use in practice. A typical example is the use of disjunction in the conclusion of the mapping rules, which makes the process of exchanging data much more complicated. In this paper, we propose solutions for the two problems mentioned above. First, we provide a polynomial time algorithm that computes the three inverse operators mentioned above given a mapping specified by a set of tuple-generating dependencies (tgds). This algorithm uses an output mapping language that can express these three operators in a compact way and, in fact, can compute inverses for a much larger class of mappings. Unfortunately, it has already been proved that this type of mapping languages has to include some features that are difficult to use in practice and, hence, this is also the case for our output mapping language. Thus, as our second contribution, we propose a new and natural notion of inversion that overcomes this limitation. In particular, every mapping specified by a set of tgds admits an inverse under this new notion that can be expressed in a mapping language that slightly extends tgds, and that has the same good properties for data exchange as tgds. Finally, as our last contribution, we provide an algorithm for computing such inverses.

2008

The Recovery of a schema mapping: Bringing exchanged data back [ pdf ]
With Marcelo Arenas and Jorge Pérez. In PODS. [More info]

Journal versions

2020

Efficient Enumeration Algorithms for Regular Document Spanners [ pdf ]
With Fernando Florenzano, Martin Ugarte, Stijn Vansummeren, and Domagoj Vrgoč. In TODS. [More info]

Descriptive Complexity for Counting Complexity Classes [ pdf ]
With Marcelo Arenas and Martin Muñoz. In LMCS. [More info]

Efficient Logspace Classes for Enumeration, Counting, and Uniform Generation [ pdf ]
With Marcelo Arenas, Rajesh Jayaram, and Luis Alberto Croquevielle. In SIGMOD Record. [More info]

Journal: ACM SIGMOD Record (SIGMOD Record)

Volume: 49 (1)

Pages: 52-59

Abstract: We study two simple yet general complexity classes, which provide a unifying framework for efficient query evaluation in areas like graph databases and information extraction, among others. We investigate the complexity of three fundamental algorithmic problems for these classes: enumeration, counting and uniform generation of solutions, and show that they have several desirable properties in this respect. Both complexity classes are defined in terms of non deterministic logarithmic-space transducers (NL transducers). For the first class, we consider the case of unambiguous NL transducers, and we prove constant delay enumeration, and both counting and uniform generation of solutions in polynomial time. For the second class, we consider unrestricted NL transducers, and we obtain polynomial delay enumeration, approximate counting in polynomial time, and polynomial-time randomized algorithms for uniform generation. More specifically, we show that each problem in this second class admits a fully polynomial-time randomized approximation scheme (FPRAS) and a polynomial-time Las Vegas algorithm (with preprocessing) for uniform generation. Remarkably, the key idea to prove these results is to show that the fundamental problem #NFA admits an FPRAS, where #NFA is the problem of counting the number of strings of length n (given in unary) accepted by a non-deterministic finite automaton (NFA). While this problem is known to be #P-complete and, more precisely, SpanL-complete, it was open whether this problem admits an FPRAS. In this work, we solve this open problem, and obtain as a welcome corollary that every function in SpanL admits an FPRAS.

Probabilistic Automata of Bounded Ambiguity [ pdf ]
With Nathanaël Fijalkow and James Worrell. In Information and Computation. [More info]

2019

Copyless cost-register automata: Structure, expressiveness, and closure properties [ pdf ]
With Filip Mazowiecki. In JCSS. [More info]

2016

Bounded Repairability for Regular Tree Languages [ pdf ]
With Pierre Bourhis, Gabriele Puppis, and Slawek Staworko. In TODS. [More info]

Journal: ACM Transactions on Database Systems (TODS)

Volume: 41 (3)

Pages: 18:1-18:45

Abstract: We study the problem of bounded repairability of a given restriction tree language R into a target tree language T. More precisely, we say that R is bounded repairable with respect to T if there exists a bound on the number of standard tree editing operations necessary to apply to any tree in R to obtain a tree in T. We consider a number of possible specifications for tree languages: bottom-up tree automata (on curry encoding of unranked trees) that capture the class of XML schemas and document type definitions (DTDs). We also consider a special case when the restriction language R is universal (i.e., contains all trees over a given alphabet). We give an effective characterization of bounded repairability between pairs of tree languages represented with automata. This characterization introduces two tools—synopsis trees and a coverage relation between them—allowing one to reason about tree languages that undergo a bounded number of editing operations. We then employ this characterization to provide upper bounds to the complexity of deciding bounded repairability and show that these bounds are tight. In particular, when the input tree languages are specified with arbitrary bottom-up automata, the problem is coNExp-complete. The problem remains coNExp-complete even if we use deterministic nonrecursive DTDs to specify the input languages. The complexity of the problem can be reduced if we assume that the alphabet, the set of node labels, is fixed: the problem becomes PSpace-complete for nonrecursive DTDs and coNP-complete for deterministic nonrecursive DTDs. Finally, when the restriction tree language R is universal, we show that the bounded repairability problem becomes Exp-complete if the target language is specified by an arbitrary bottom-up tree automaton and becomes tractable (P-complete, in fact) when a deterministic bottom-up automaton is used.

A framework for annotating CSV-like data [ pdf ]
With Marcelo Arenas, Francisco Maturana, and Domagoj Vrgoč. In VLDB. [More info]

2015

Which XML Schemas are Streaming Bounded Repairable? [ pdf ]
With Pierre Bourhis and Gabriele Puppis. In ToCS. [More info]

2014

The per-character cost of repairing word languages [ pdf ]
With Michael Benedikt and Gabriele Puppis. In TCS. [More info]

2013

Bounded repairability of word languages [ pdf ]
With Michael Benedikt and Gabriele Puppis. In JCSS. [More info]

The language of plain SO-tgds: composition, inversion and structural properties [ pdf ]
With Marcelo Arenas, Jorge Pérez, and Juan Reutter. In JCSS. [More info]

Journal: Journal of Computer and System Sciences (JCSS)

Volume: 79 (6)

Pages: 763–784

Abstract: The problems of composing and inverting schema mappings speciﬁed by source-to-target tuple-generating dependencies (st-tgds) have attracted a lot of attention, as they are of fundamental importance for the development of Bernstein’s metadata management framework. In the case of the composition operator, a natural semantics has been proposed and the language of second-order tuple generating dependencies (SO-tgds) has been identiﬁed as the right language to express it. In the case of the inverse operator, several semantics have been proposed, most notably the maximum recovery, the only inverse notion that guarantees that every mapping speciﬁed by st-tgds is invertible. Unfortunately, less attention has been paid to combining both operators, which is the motivation of this paper. More precisely, we start our investigation by showing that SO-tgds are not good for inversion, as there exist mappings speciﬁed by SO-tgds that are not invertible under any of the notions of inversion proposed in the literature. To overcome this limitation, we borrow the notion of CQ-composition, which is a relaxation obtained by parameterizing the composition of mappings by the class of conjunctive queries (CQ), and we propose a restriction over the class of SO-tgds that gives rise to the language of plain SO-tgds. Then we show that plain SO-tgds are the right language to express the CQ-composition of mappings given by st-tgds, in the same sense that SO-tgds are the right language to express the composition of st-tgds, and we prove that every mapping speciﬁed by a plain SOtgd admits a maximum recovery, thus showing that plain SO-tgds have a good behavior w.r.t. inversion. Moreover, we show that the language of plain SO-tgds shares some fundamental structural properties with the language of st-tgds, but being much more expressive, and we provide a polynomial-time algorithm to compute maximum recoveries for mappings speciﬁed by plain SO-tgds (which can also be used to compute maximum recoveries for mappings given by st-tgds). All these results suggest that the language of plain SO-tgds is a good alternative to be implemented in data exchange and data integration applications.

2012

Query language-based inverses of schema mappings: semantics, computation, and closure properties [ pdf ]
With Marcelo Arenas, Jorge Pérez, and Juan Reutter. In VLDBJ. [More info]

2009

Composition and inversion of schema mappings [ pdf ]
With Marcelo Arenas, Jorge Pérez, and Juan Reutter. In SIGMOD Record. [More info]

The Recovery of a schema mapping: Bringing exchanged data back [ pdf ]
With Marcelo Arenas and Jorge Pérez. In TODS. [More info]

Journal: ACM Transactions on Database Systems (TODS)

Volume: 34 (4)

Pages: Article No. 22

Abstract: A schema mapping is a specification that describes how data from a source schema is to be mapped to a target schema. Once the data has been transferred from the source to the target, a natural question is whether one can undo the process and recover the initial data, or at least part of it. In fact, it would be desirable to find a reverse schema mapping from target to source that specifies how to bring the exchanged data back. In this paper, we introduce the notion of a recovery of a schema mapping: it is a reverse mapping M' for a mapping M that recovers sound data with respect to M. We further introduce an order relation on recoveries. This allows us to choose mappings that recover the maximum amount of sound information. We call such mappings maximum recoveries. We study maximum recoveries in detail, providing a necessary and sufficient condition for their existence. In particular, we prove that maximum recoveries exist for the class of mappings specified by FO-TO-CQ source-to-target dependencies. This class subsumes the class of source-to-target tuple-generating dependencies used in previous work on data exchange. For the class of mappings specified by FO-TO-CQ dependencies, we provide an exponential-time algorithm for computing maximum recoveries, and a simplified version for full dependencies that works in quadratic time. We also characterize the language needed to express maximum recoveries, and we include a detailed comparison with the notion of inverse (and quasi-inverse) mapping previously proposed in the data exchange literature. In particular, we show that maximum recoveries strictly generalize inverses. We finally study the complexity of some decision problems related to the notions of recovery and maximum recovery.

I am an Assistant Professor at the Department of Computer Science at the Pontificia Universidad Catolica de Chile.

I received a D.Phil from the University of Oxford in 2013 and a M.Sc from Pontificia Universidad Católica de Chile in 2008. I was previously studying at Pontificia Universidad Católica de Chile as an undergraduate student where I received a B.A. in Mathematics in 2006 and my Professional Degree in Computer Engineering in 2008.

My research interests are mostly in data management systems, specifically, in data streams, information extraction, and graph data. Also, I do research on theoretical computer science, mostly in automata theory, logics, and computational complexity.

Warning: Version warning: Imagick was compiled against ImageMagick version 1692 but version 1693 is loaded. Imagick will run but may behave surprisingly in Unknown on line 0

Original Source | Taken Source