Query planner gaining the ability to replanning after start of query execution.

public inbox for pgsql-performance@postgresql.org  
help / color / mirror / Atom feed

Query planner gaining the ability to replanning after start of query execution.
8+ messages / 5 participants
[nested] [flat]

* Query planner gaining the ability to replanning after start of query execution.
@ 2017-11-13 16:44 Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
  0 siblings, 1 reply; 8+ messages in thread

From: Oliver Mattos @ 2017-11-13 16:44 UTC (permalink / raw)
  To: pgsql-performance

I am interested in giving the query planner the ability to replan (or
re-rank plans) after query execution has begun, based on the
progression of the query so far.

Example use case:

*  A LIMIT 1 query is planned using an expensive scan which the
planner expects to return a large number of results, and to terminate
early.   The reality is the query actually produces no results, and
the scan must run to completion, potentially taking thousands of times
longer than expected.

*  If this plans costs were adjusted mid-execution to reflect the fact
that the scan is producing far fewer rows than expected, then another
query plan might come out ahead, which would complete far faster.

Has this been done before?   Are there any pitfalls to beware of?

-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
@ 2017-11-13 20:06 ` Arne Roland <A.Roland@index.de>
  2017-11-13 21:51   ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-15 19:57   ` Re: Query planner gaining the ability to replanning after start of query execution. Gunter <raj@gusw.net>
  0 siblings, 2 replies; 8+ messages in thread

From: Arne Roland @ 2017-11-13 20:06 UTC (permalink / raw)
  To: Oliver Mattos <omattos@gmail.com>; pgsql-performance

Hello,

I'd love to have some sort of dynamic query feedback, yet it's very complicated to do it right. I am not convinced that changing the plan during a single execution is the right way to do it, not only because it sounds intrusive to do crazy things in the executor, but also because don't understand why the new plan should be any better than the old one. Can you be more elaborate how you'd want to go about it?

In your example (which presumably just has a single relation), we have no notion of whether the scan returns no rows because we were unlucky, because just the first few pages were empty of matching rows (which in my experience happens more often), or because the cardinality estimation is wrong. Even if the cardinality estimation is wrong, we have no notion of which predicate or predicate combination actually caused the misestimation. If the first few pages where empty, the same might happen with every order (so also with every available indexscan). Imagine a very simple seqscan plan of 
select * from mytab where a = 3 and b = 40 limit 1
Even if we know the cardinality is overestimated, we have no idea whether the cardinality of a = 3 or b = 40 is wrong or they just correlate, so there is no notion of which is actually the cheapest plan. Usual workaround for most of these queries is to add an order by (which has the nice addition of having a deterministic result) with an appropriate complex index, usually resulting in indexscans.

While we actually know more after the first execution of a nodes like materialize, sort or hash nodes, I rarely encounter materialize nodes in the wild. Consequently that is the place where the work is usually already done, which is especially true with the hash node. Even though it still might be more optimal to switch from a mergejoin to a hashjoin in some cases, I doubt that's worth any work (and even less the maintenance).

Best regards
Arne Roland

-----Original Message-----
From: pgsql-performance-owner@postgresql.org [mailto:pgsql-performance-owner@postgresql.org] On Behalf Of Oliver Mattos
Sent: Monday, November 13, 2017 5:45 PM
To: pgsql-performance@postgresql.org
Subject: [PERFORM] Query planner gaining the ability to replanning after start of query execution.

I am interested in giving the query planner the ability to replan (or re-rank plans) after query execution has begun, based on the progression of the query so far.

Example use case:

*  A LIMIT 1 query is planned using an expensive scan which the planner expects to return a large number of results, and to terminate
early.   The reality is the query actually produces no results, and
the scan must run to completion, potentially taking thousands of times longer than expected.

*  If this plans costs were adjusted mid-execution to reflect the fact that the scan is producing far fewer rows than expected, then another query plan might come out ahead, which would complete far faster.

Has this been done before?   Are there any pitfalls to beware of?

--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

-----Original Message-----
From: pgsql-performance-owner@postgresql.org [mailto:pgsql-performance-owner@postgresql.org] On Behalf Of Oliver Mattos
Sent: Monday, November 13, 2017 5:45 PM
To: pgsql-performance@postgresql.org
Subject: [PERFORM] Query planner gaining the ability to replanning after start of query execution.

I am interested in giving the query planner the ability to replan (or re-rank plans) after query execution has begun, based on the progression of the query so far.

Example use case:

*  A LIMIT 1 query is planned using an expensive scan which the planner expects to return a large number of results, and to terminate
early.   The reality is the query actually produces no results, and
the scan must run to completion, potentially taking thousands of times longer than expected.

*  If this plans costs were adjusted mid-execution to reflect the fact that the scan is producing far fewer rows than expected, then another query plan might come out ahead, which would complete far faster.

Has this been done before?   Are there any pitfalls to beware of?

--
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
@ 2017-11-13 21:51   ` Oliver Mattos <omattos@gmail.com>
  2017-11-13 22:48     ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
  2017-11-13 22:49     ` Re: Query planner gaining the ability to replanning after start of query execution. Tom Lane <tgl@sss.pgh.pa.us>
  1 sibling, 2 replies; 8+ messages in thread

From: Oliver Mattos @ 2017-11-13 21:51 UTC (permalink / raw)
  To: Arne Roland <A.Roland@index.de>; +Cc: pgsql-performance

> Can you be more elaborate how you'd want to go about it?

My initial approach would be to try to identify places in the plan
where selectivity is seriously over or underestimated.   I would reuse
the instrumentation infrastructure's counts of filtered and returned
tuples for each execnode, and periodically call back into the planner
(for example at every power of 2 tuples processed).

The planner would have a wrapper to clauselist_selectivity which
somehow combines the existing estimate with the filtered and returned
tuples so far.   Exactly how to merge them isn't clear, but I could
imagine using a poisson distribution to calculate the probability that
the selectivity estimate is representative of the filtered and
returned numbers, and then blending the two linearly based on that
estimate.

When the planner has re-estimated the cost of the current plan, a
discount would be applied for the percentage of each execnode
completed (rows processed / estimated rows), and all candidate plans
compared.

If another candidate plan is now lower cost, the current plan would be
terminated[1] by setting a flag instructing each execnode to return as
if it had reached the end of the input, although still caching the
node selectivity values, and the new plan started from scratch.

The old plan is kept within the query planner candidate list, together
with it's cached selectivity values.  If at some point it again is
cheaper, it is started from scratch too.

> Even if we know the cardinality is overestimated, we have no idea whether the cardinality of a = 3 or b = 40 is wrong or they just correlate

The goal would be not to know which is wrong, but to try each,
discarding it if it turns out worse than we estimated.  Processing a
few hundred rows of each of 5 plans is tiny compared to a scan of 1M
rows...

[1]:   An improvement here (with much more code complexity) is to keep
multiple partially executed plans around, so that whichever one is
most promising can be worked on, but can be halted and resumed later
as selectivity (and hence cost) estimates change.

On Mon, Nov 13, 2017 at 8:06 PM, Arne Roland <A.Roland@index.de> wrote:
> Hello,
>
> I'd love to have some sort of dynamic query feedback, yet it's very complicated to do it right. I am not convinced that changing the plan during a single execution is the right way to do it, not only because it sounds intrusive to do crazy things in the executor, but also because don't understand why the new plan should be any better than the old one. Can you be more elaborate how you'd want to go about it?
>
> In your example (which presumably just has a single relation), we have no notion of whether the scan returns no rows because we were unlucky, because just the first few pages were empty of matching rows (which in my experience happens more often), or because the cardinality estimation is wrong. Even if the cardinality estimation is wrong, we have no notion of which predicate or predicate combination actually caused the misestimation. If the first few pages where empty, the same might happen with every order (so also with every available indexscan). Imagine a very simple seqscan plan of
> select * from mytab where a = 3 and b = 40 limit 1
> Even if we know the cardinality is overestimated, we have no idea whether the cardinality of a = 3 or b = 40 is wrong or they just correlate, so there is no notion of which is actually the cheapest plan. Usual workaround for most of these queries is to add an order by (which has the nice addition of having a deterministic result) with an appropriate complex index, usually resulting in indexscans.
>
> While we actually know more after the first execution of a nodes like materialize, sort or hash nodes, I rarely encounter materialize nodes in the wild. Consequently that is the place where the work is usually already done, which is especially true with the hash node. Even though it still might be more optimal to switch from a mergejoin to a hashjoin in some cases, I doubt that's worth any work (and even less the maintenance).
>
> Best regards
> Arne Roland
>
> -----Original Message-----
> From: pgsql-performance-owner@postgresql.org [mailto:pgsql-performance-owner@postgresql.org] On Behalf Of Oliver Mattos
> Sent: Monday, November 13, 2017 5:45 PM
> To: pgsql-performance@postgresql.org
> Subject: [PERFORM] Query planner gaining the ability to replanning after start of query execution.
>
> I am interested in giving the query planner the ability to replan (or re-rank plans) after query execution has begun, based on the progression of the query so far.
>
> Example use case:
>
> *  A LIMIT 1 query is planned using an expensive scan which the planner expects to return a large number of results, and to terminate
> early.   The reality is the query actually produces no results, and
> the scan must run to completion, potentially taking thousands of times longer than expected.
>
> *  If this plans costs were adjusted mid-execution to reflect the fact that the scan is producing far fewer rows than expected, then another query plan might come out ahead, which would complete far faster.
>
>
> Has this been done before?   Are there any pitfalls to beware of?
>
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>
>
>
>
> -----Original Message-----
> From: pgsql-performance-owner@postgresql.org [mailto:pgsql-performance-owner@postgresql.org] On Behalf Of Oliver Mattos
> Sent: Monday, November 13, 2017 5:45 PM
> To: pgsql-performance@postgresql.org
> Subject: [PERFORM] Query planner gaining the ability to replanning after start of query execution.
>
> I am interested in giving the query planner the ability to replan (or re-rank plans) after query execution has begun, based on the progression of the query so far.
>
> Example use case:
>
> *  A LIMIT 1 query is planned using an expensive scan which the planner expects to return a large number of results, and to terminate
> early.   The reality is the query actually produces no results, and
> the scan must run to completion, potentially taking thousands of times longer than expected.
>
> *  If this plans costs were adjusted mid-execution to reflect the fact that the scan is producing far fewer rows than expected, then another query plan might come out ahead, which would complete far faster.
>
>
> Has this been done before?   Are there any pitfalls to beware of?
>
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>
>
>

-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
  2017-11-13 21:51   ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
@ 2017-11-13 22:48     ` Arne Roland <A.Roland@index.de>
  1 sibling, 0 replies; 8+ messages in thread

From: Arne Roland @ 2017-11-13 22:48 UTC (permalink / raw)
  To: Oliver Mattos <omattos@gmail.com>; +Cc: pgsql-performance

Hello,

that method is bound to introduce errors if the physical location of a row correlates strongly with a column, imagine "and my_timestamp> now() - INTERVAL '1 year'" as part of a where clause of a query without limit on an insert only table which is one and a half years old with a seqscan. There might be similar effects if an index on the timestamp is used to go about a query, if other rows of the filter correlate.

This method furthermore only judges filter predicates.
So it's not that easy to just go about the expected rowcount of a single node, since the underling node might return a totally inaccurate number of rows. While that isn't that common with underlying seqscans, it is very frequent if an indexscan is used without being able to rely on the MCV. While it's still possible to notice a misestimation there is no sense of how wrong it is, until the rows are already processed.

Furthermore: Did you think about parallel plans and most importantly cursors?

Best regards
Arne Roland

-----Original Message-----
From: Oliver Mattos [mailto:omattos@gmail.com] 
Sent: Monday, November 13, 2017 10:52 PM
To: Arne Roland <A.Roland@index.de>
Cc: pgsql-performance@postgresql.org
Subject: Re: [PERFORM] Query planner gaining the ability to replanning after start of query execution.

> Can you be more elaborate how you'd want to go about it?

My initial approach would be to try to identify places in the plan
where selectivity is seriously over or underestimated.   I would reuse
the instrumentation infrastructure's counts of filtered and returned tuples for each execnode, and periodically call back into the planner (for example at every power of 2 tuples processed).

The planner would have a wrapper to clauselist_selectivity which somehow combines the existing estimate with the filtered and returned
tuples so far.   Exactly how to merge them isn't clear, but I could
imagine using a poisson distribution to calculate the probability that the selectivity estimate is representative of the filtered and returned numbers, and then blending the two linearly based on that estimate.

When the planner has re-estimated the cost of the current plan, a discount would be applied for the percentage of each execnode completed (rows processed / estimated rows), and all candidate plans compared.

If another candidate plan is now lower cost, the current plan would be terminated[1] by setting a flag instructing each execnode to return as if it had reached the end of the input, although still caching the node selectivity values, and the new plan started from scratch.

The old plan is kept within the query planner candidate list, together with it's cached selectivity values.  If at some point it again is cheaper, it is started from scratch too.

> Even if we know the cardinality is overestimated, we have no idea 
> whether the cardinality of a = 3 or b = 40 is wrong or they just 
> correlate

The goal would be not to know which is wrong, but to try each, discarding it if it turns out worse than we estimated.  Processing a few hundred rows of each of 5 plans is tiny compared to a scan of 1M rows...

[1]:   An improvement here (with much more code complexity) is to keep
multiple partially executed plans around, so that whichever one is most promising can be worked on, but can be halted and resumed later as selectivity (and hence cost) estimates change.

On Mon, Nov 13, 2017 at 8:06 PM, Arne Roland <A.Roland@index.de> wrote:
> Hello,
>
> I'd love to have some sort of dynamic query feedback, yet it's very complicated to do it right. I am not convinced that changing the plan during a single execution is the right way to do it, not only because it sounds intrusive to do crazy things in the executor, but also because don't understand why the new plan should be any better than the old one. Can you be more elaborate how you'd want to go about it?
>
> In your example (which presumably just has a single relation), we have 
> no notion of whether the scan returns no rows because we were unlucky, 
> because just the first few pages were empty of matching rows (which in my experience happens more often), or because the cardinality estimation is wrong. Even if the cardinality estimation is wrong, we have no notion of which predicate or predicate combination actually caused the misestimation. If the first few pages where empty, the same might happen with every order (so also with every available indexscan). Imagine a very simple seqscan plan of select * from mytab where a = 3 and b = 40 limit 1 Even if we know the cardinality is overestimated, we have no idea whether the cardinality of a = 3 or b = 40 is wrong or they just correlate, so there is no notion of which is actually the cheapest plan. Usual workaround for most of these queries is to add an order by (which has the nice addition of having a deterministic result) with an appropriate complex index, usually resulting in indexscans.
>
> While we actually know more after the first execution of a nodes like materialize, sort or hash nodes, I rarely encounter materialize nodes in the wild. Consequently that is the place where the work is usually already done, which is especially true with the hash node. Even though it still might be more optimal to switch from a mergejoin to a hashjoin in some cases, I doubt that's worth any work (and even less the maintenance).
>
> Best regards
> Arne Roland
>
> -----Original Message-----
> From: pgsql-performance-owner@postgresql.org 
> [mailto:pgsql-performance-owner@postgresql.org] On Behalf Of Oliver 
> Mattos
> Sent: Monday, November 13, 2017 5:45 PM
> To: pgsql-performance@postgresql.org
> Subject: [PERFORM] Query planner gaining the ability to replanning after start of query execution.
>
> I am interested in giving the query planner the ability to replan (or re-rank plans) after query execution has begun, based on the progression of the query so far.
>
> Example use case:
>
> *  A LIMIT 1 query is planned using an expensive scan which the planner expects to return a large number of results, and to terminate
> early.   The reality is the query actually produces no results, and
> the scan must run to completion, potentially taking thousands of times longer than expected.
>
> *  If this plans costs were adjusted mid-execution to reflect the fact that the scan is producing far fewer rows than expected, then another query plan might come out ahead, which would complete far faster.
>
>
> Has this been done before?   Are there any pitfalls to beware of?
>
>
> --
> Sent via pgsql-performance mailing list 
> (pgsql-performance@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>
>
>
>
> -----Original Message-----
> From: pgsql-performance-owner@postgresql.org 
> [mailto:pgsql-performance-owner@postgresql.org] On Behalf Of Oliver 
> Mattos
> Sent: Monday, November 13, 2017 5:45 PM
> To: pgsql-performance@postgresql.org
> Subject: [PERFORM] Query planner gaining the ability to replanning after start of query execution.
>
> I am interested in giving the query planner the ability to replan (or re-rank plans) after query execution has begun, based on the progression of the query so far.
>
> Example use case:
>
> *  A LIMIT 1 query is planned using an expensive scan which the planner expects to return a large number of results, and to terminate
> early.   The reality is the query actually produces no results, and
> the scan must run to completion, potentially taking thousands of times longer than expected.
>
> *  If this plans costs were adjusted mid-execution to reflect the fact that the scan is producing far fewer rows than expected, then another query plan might come out ahead, which would complete far faster.
>
>
> Has this been done before?   Are there any pitfalls to beware of?
>
>
> --
> Sent via pgsql-performance mailing list 
> (pgsql-performance@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance
>
>
>

-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
  2017-11-13 21:51   ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
@ 2017-11-13 22:49     ` Tom Lane <tgl@sss.pgh.pa.us>
  2017-11-13 23:20       ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  1 sibling, 1 reply; 8+ messages in thread

From: Tom Lane @ 2017-11-13 22:49 UTC (permalink / raw)
  To: Oliver Mattos <omattos@gmail.com>; +Cc: Arne Roland <A.Roland@index.de>; pgsql-performance

Oliver Mattos <omattos@gmail.com> writes:
>> Can you be more elaborate how you'd want to go about it?

> ... If another candidate plan is now lower cost, the current plan would be
> terminated[1] by setting a flag instructing each execnode to return as
> if it had reached the end of the input, although still caching the
> node selectivity values, and the new plan started from scratch.

Quite aside from the implementation difficulties you'll have, that
approach is a show-stopper right there.  You can't just restart from
scratch, because we may already have shipped rows to the client, or
for DML cases already inserted/updated/deleted rows (and even if you
could roll those back, we've possibly fired triggers with unpredictable
side effects).  Queries containing volatile functions are another no-fly
zone for this approach.

I can't see any way around that without unacceptable performance costs
(i.e. buffering all the rows until we're done) or wire-protocol breakage.

I think that a more practical way to address the class of problems
you're talking about is to teach the planner to have some notion of
worst-case as well as expected-case costs, and then incorporate some
perhaps-configurable amount of risk aversion in its choices.

			regards, tom lane

PS: please do not top-post, and do not quote the entire darn thread
in each message.

-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
  2017-11-13 21:51   ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 22:49     ` Re: Query planner gaining the ability to replanning after start of query execution. Tom Lane <tgl@sss.pgh.pa.us>
@ 2017-11-13 23:20       ` Oliver Mattos <omattos@gmail.com>
  2017-11-14 21:26         ` Re: Query planner gaining the ability to replanning after start of query execution. legrand legrand <legrand_legrand@hotmail.com>
  0 siblings, 1 reply; 8+ messages in thread

From: Oliver Mattos @ 2017-11-13 23:20 UTC (permalink / raw)
  To: Tom Lane <tgl@sss.pgh.pa.us>; +Cc: Arne Roland <A.Roland@index.de>; pgsql-performance

>  You can't just restart from scratch, because we may already have shipped rows to the client

For v1, replanning wouldn't be an option if rows have already been
shipped, or for DML statements.

> parallel plans and most importantly cursors?
Parallel plans look do-able with the same approach, but cursor use I'd
probably stop replanning as soon as the first row is delivered to the
client, as above.   One could imagine more complex approaches like a
limited size buffer of 'delivered' rows, allowing a new plan to be
selected and the delivered rows excluded from the new plans resultset
via a special extra prepending+dupe filtering execnode.   The memory
and computation costs of that execnode would be factored into the
replanning decision like any other node.


>errors if the physical location of a row correlates strongly with a column
This is my largest concern.  These cases already lead to large errors
currently (SELECT * FROM foo WHERE created_date = today LIMIT 1) might
scan all data, only to find all of today's records in the last
physical block.

It's hard to say if replacing one bad estimate with another will lead
to overall better/worse results...   My hope is that in most cases a
bunch of plans will be tried, all end up with cost estimates revised
up a lot, and then one settled on as rows start getting passed to
upper layers.

>underling node might return a totally inaccurate number of rows for index scans
One might imagine using the last returned row as an extra histogram
point when estimating how many rows are left in an index scan.   That
should at least make the estimate more accurate than it is without
feedback.


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance



^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
  2017-11-13 21:51   ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 22:49     ` Re: Query planner gaining the ability to replanning after start of query execution. Tom Lane <tgl@sss.pgh.pa.us>
  2017-11-13 23:20       ` Re: Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
@ 2017-11-14 21:26         ` legrand legrand <legrand_legrand@hotmail.com>
  0 siblings, 0 replies; 8+ messages in thread

From: legrand legrand @ 2017-11-14 21:26 UTC (permalink / raw)
  To: pgsql-performance

As a first step, before relaunching the query with a new plan I would be
happy to be able to get information about sql queries having wrong
estimates.

Maybe thoses SQL queries could be collected using something similar to
"auto_explain" module (tracing finished or cancelled queries).

If the "corrected plan" taking into account real cardinalities was proposed
(with some advices on how to get it) it would be a great tuning adviser ;o)





--
Sent from: http://www.postgresql-archive.org/PostgreSQL-performance-f2050081.html


-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance



^ permalink  raw  reply  [nested|flat] 8+ messages in thread

* Re: Query planner gaining the ability to replanning after start of query execution.
  2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
  2017-11-13 20:06 ` Re: Query planner gaining the ability to replanning after start of query execution. Arne Roland <A.Roland@index.de>
@ 2017-11-15 19:57   ` Gunter <raj@gusw.net>
  1 sibling, 0 replies; 8+ messages in thread

From: Gunter @ 2017-11-15 19:57 UTC (permalink / raw)
  To: pgsql-performance

May I suggest the recent discussion over the dreaded NL issue ... I had some
similar ideas.

http://www.postgresql-archive.org/OLAP-reporting-queries-fall-into-nested-loops-over-seq-scans-or-ot...

This could be massively useful and a huge leap in the industry.

--
Sent from: http://www.postgresql-archive.org/PostgreSQL-performance-f2050081.html

-- 
Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance

^ permalink  raw  reply  [nested|flat] 8+ messages in thread

end of thread, other threads:[~2017-11-15 19:57 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed)
-- links below jump to the message on this page --
2017-11-13 16:44 Query planner gaining the ability to replanning after start of query execution. Oliver Mattos <omattos@gmail.com>
2017-11-13 20:06 ` Arne Roland <A.Roland@index.de>
2017-11-13 21:51   ` Oliver Mattos <omattos@gmail.com>
2017-11-13 22:48     ` Arne Roland <A.Roland@index.de>
2017-11-13 22:49     ` Tom Lane <tgl@sss.pgh.pa.us>
2017-11-13 23:20       ` Oliver Mattos <omattos@gmail.com>
2017-11-14 21:26         ` legrand legrand <legrand_legrand@hotmail.com>
2017-11-15 19:57   ` Gunter <raj@gusw.net>

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox