Oliver Irwin
01/12/2023 - LINKS Seminar
Join Query :
where
Example:
id | name | city |
---|---|---|
1 | Alice | Paris |
2 | Bob | Lens |
3 | Chiara | Rome |
4 | Djibril | Berlin |
5 | Émile | Dortmund |
6 | Francesca | Rome |
city | country |
---|---|
Berlin | Germany |
Paris | France |
Rome | Italy |
city | country | name | id |
---|---|---|---|
Paris | France | Alice | 1 |
Rome | Italy | Chiara | 3 |
Berlin | Germany | Djibril | 4 |
Rome | Italy | Francesca | 6 |
We want to access the
Make
We want to access the
Make
city | country | name | id |
---|---|---|---|
Berlin | Germany | Djibril | 4 |
Paris | France | Alice | 1 |
Rome | Italy | Chiara | 3 |
Rome | Italy | Francesca | 6 |
We want to access the
Make
city | country | name | id |
---|---|---|---|
|
|
|
|
Berlin | Germany | Djibril | 4 |
|
|
|
|
Paris | France | Alice | 1 |
|
|
|
|
Rome | Italy | Chiara | 3 |
Rome | Italy | Francesca | 6 |
|
|
|
|
Precomputation : very costly
Access : nearly free
We need another way to represent the data
Central class of queries because of their tractability
An
1 is an
2 is an
3 is an
4 is an
A query
[Brault-Baron, 2014], also known as “without disruptive trio” [Carmeli, Tziavelis, Gatterbauer, Kimelfeld, Riedewald, 2020]
In the previous example, 1,
2, 3, 4 is an
Given a join query
[Carmeli, Tziavelis, Gatterbauer, Kimelfeld, Riedewald, 2020]
The order used here is
Load data inside the bags
Annotate by computing the number of extensions
We want to access the
Make a table, sort it, and done?
city | country | name | id |
---|---|---|---|
Berlin | Germany | Djibril | 4 |
Paris | France | Alice | 1 |
Rome | Italy | Chiara | 3 |
Rome | Italy | Francesca | 6 |
We need another solution 😢
Join Tree Approach
use a join tree to answer tasks in an
efficient way
Works for
Signed Query:
Big difference:
positively encoding
is acyclic
is not acyclic
Let
For any database
But
Good candidate for another measure of tractability: every
This is known as
Intuition: a
A
Characterisation: A query
We want to access the
Make a table, sort it, and done?
city | country | name | id |
---|---|---|---|
Berlin | Germany | Djibril | 4 |
Paris | France | Alice | 1 |
Rome | Italy | Chiara | 3 |
Rome | Italy | Francesca | 6 |
We need another solution 😢
Join Tree Approach
Works for
No notion of join tree for
We propose a new approach!
Recovers former results 🥳
Handles signed queries 😍
|
|
|
---|---|---|
0 | 0 | 0 |
0 | 0 | 1 |
0 | 1 | 0 |
0 | 1 | 1 |
1 | 0 | 1 |
1 | 0 | 2 |
1 | 1 | 1 |
1 | 1 | 2 |
1 | 2 | 0 |
1 | 2 | 1 |
2 | 0 | 1 |
2 | 0 | 2 |
2 | 2 | 1 |
2 | 2 | 2 |
factorised representation of relations
circuit with 3 kinds of gates :
paths from decision gates are labelled by the domain values
factorised representation of relations
circuit with 3 kinds of gates :
paths from decision gates are labelled by the domain values
+ order
For
Idea : for each gate
compute the size of the relation where
Compute the 7th solution
Compute the 13th solution
then
then
recursive implementation + cache
Let
(Generalisation of [Capelli, 2017])
Corollary:
Direct Access for
Side Note:
Join-Tree based approaches fail for
For a query
Algorithm schema:
For a query
Algorithm schema:
This technique generalises to:
Going further with circuits
study the tractability of the circuit approach for DA on CQs with aggregation
How should we integrate the aggregation in the lexicographical order?
How does the aggregation fit in to the compiled circuits?
generalise the circuit approach to queries over annotated databases (FAQ and AJAR queries)
recent works (Zhao, Fan, Ouyang, Koutris, 2023)