PostgreSQL 9.6 聚合运算180倍性能提升如何做到? 聚合代码优化OP复用浅析

digoal

2016-10-08

postgresql , 9.6 , 内核优化 , 聚合代码优化 , op复用

聚合操作指将分组的数据聚合为一个结果输出。

聚合通常用在统计应用中，例如统计分组的最大值，最小值，记录数，平均值，方差，截距，相关性。

聚合也可能被用于文本分析或者图像分析等，例如最佳相似度，行列变换，聚合为数组或json，图像堆叠等。

因此聚合通常需要启动值，行的处理，以及结果的格式转换3个过程。

postgresql的聚合也包含了以上三个过程，创建一个聚合函数的语法如下：

例子

参考

<a href="https://www.postgresql.org/docs/9.6/static/xaggr.html">https://www.postgresql.org/docs/9.6/static/xaggr.html</a>

<a href="https://www.postgresql.org/docs/9.6/static/sql-createaggregate.html">https://www.postgresql.org/docs/9.6/static/sql-createaggregate.html</a>

postgresql 聚合处理流程如图

PostgreSQL 9.6 聚合运算180倍性能提升如何做到? 聚合代码优化OP复用浅析

如果initcond与sfunc一致，在同一个聚合分组内，sfunc只需要计算一遍所有记录，而不需要计算多遍。

<a href="https://git.postgresql.org/gitweb/?p=postgresql.git;a=commit;h=804163bc25e979fcd91b02e58fa2d1c6b587cc65">https://git.postgresql.org/gitweb/?p=postgresql.git;a=commit;h=804163bc25e979fcd91b02e58fa2d1c6b587cc65</a>

我们可以通过以下sql查看可以共享op的聚合函数，rank一致的都可以共享。

rank

aggfnoid

aggtransfn

agginitval

pg_catalog.sum

float4pl

none

pg_catalog.avg

float4_accum

{0,0,0}

pg_catalog.variance

pg_catalog.stddev

pg_catalog.var_samp

pg_catalog.stddev_samp

pg_catalog.var_pop

pg_catalog.stddev_pop

pg_catalog.max

float4larger

pg_catalog.min

float4smaller

float8pl

float8_accum

float8larger

float8smaller

text_larger

text_smaller

array_larger

array_smaller

int4larger

int4smaller

int2larger

int2smaller

cash_pl

cashlarger

cashsmaller

bpchar_larger

bpchar_smaller

date_larger

date_smaller

interval_pl

timestamptz_smaller

timestamptz_larger

interval_smaller

interval_larger

pg_catalog.count

int8inc

int8larger

int8smaller

time_larger

time_smaller

timetz_larger

timetz_smaller

pg_catalog.bit_and

bitand

pg_catalog.bit_or

bitor

numeric_smaller

numeric_larger

numeric_accum

int2_accum

int4_accum

int8_accum

int2_sum

int4_sum

interval_accum

{0 second,0 second}

int2and

int2or

int4and

int4or

int8and

int8or

int2_avg_accum

{0,0}

int4_avg_accum

oidlarger

oidsmaller

timestamp_smaller

timestamp_larger

pg_catalog.array_agg

array_agg_transfn

bool_and

booland_statefunc

every

bool_or

boolor_statefunc

int8_avg_accum

tidlarger

100

tidsmaller

101

int8inc_any

102

regr_count

int8inc_float8_float8

103

regr_sxx

float8_regr_accum

{0,0,0,0,0,0}

regr_syy

regr_sxy

regr_avgx

regr_avgy

regr_r2

regr_slope

regr_intercept

covar_pop

covar_samp

corr

114

numeric_avg_accum

116

xmlagg

xmlconcat2

117

json_agg

json_agg_transfn

118

json_object_agg

json_object_agg_transfn

119

jsonb_agg

jsonb_agg_transfn

120

jsonb_object_agg

jsonb_object_agg_transfn

121

enum_smaller

122

enum_larger

123

pg_catalog.string_agg

string_agg_transfn

124

bytea_string_agg_transfn

125

network_larger

126

network_smaller

127

pg_catalog.percentile_disc

ordered_set_transition

pg_catalog.percentile_cont

mode

134

pg_catalog.rank

ordered_set_transition_multi

pg_catalog.percent_rank

pg_catalog.cume_dist

pg_catalog.dense_rank

138

array_agg_array_transfn

我接下来抽取几个数据统计相关的，验证9.6的优化效果

这几个聚合函数的用法如下

<a href="https://www.postgresql.org/docs/9.6/static/functions-aggregate.html">https://www.postgresql.org/docs/9.6/static/functions-aggregate.html</a>

function

argument type

return type

partial mode

description

corr(y, x)

double precision

yes

correlation coefficient

covar_pop(y, x)

population covariance

covar_samp(y, x)

sample covariance

regr_avgx(y, x)

average of the independent variable (sum(x)/n)

regr_avgy(y, x)

average of the dependent variable (sum(y)/n)

regr_intercept(y, x)

y-intercept of the least-squares-fit linear equation determined by the (x, y) pairs

regr_r2(y, x)

square of the correlation coefficient

regr_slope(y, x)

slope of the least-squares-fit linear equation determined by the (x, y) pairs

regr_sxx(y, x)

sum(x^2) - sum(x)^2/n ("sum of squares" of the independent variable)

regr_sxy(y, x)

sum(x*y) - sum(x) * sum(y)/n ("sum of products" of independent times dependent variable)

regr_syy(y, x)

sum(y^2) - sum(y)^2/n ("sum of squares" of the dependent variable)

测试5000万条记录

1. 9.6 非并行

聚合计算耗费了7.1秒

2. 9.5

聚合计算耗费了36.1秒

3. 9.6 并行

聚合计算约耗费0.2秒

9.6的优化效果很明显，在没有使用并行的情况下，聚合操作已经有约5倍的性能提升。

结果对比

版本

9.6

9.5

9.6并行(32)

5000万记录(11个聚合函数)耗时(秒)

7.1

36.1

0.2

涉及如下

在统计学中，大多数的统计算法的中间结果都是可以共用的，例如sum,avg; 方差,相关性,count,sum等运算;

postgresql 9.6很好的抓住了这样的特征，对初始条件一致，中间算法一致的聚合函数，在同一个分组中数据只需要计算一遍，大大降低了cpu的开销，提高了统计效率。

这个思路与llvm有一些神似的地方，不过llvm的适用场景更广。

<a href="http://info.flagcounter.com/h9v1">count</a>

PostgreSQL 9.6 聚合运算180倍性能提升如何做到? 聚合代码优化OP复用浅析

继续阅读

极大似然法(ML)与最大期望法(EM)

[HTML5]自定义属性 data-* 和 jQuery.data 详解

七牛云-C#SDK-上传-前期准备

C++ 第十五周报告1--《冒泡法排序》

[转]iOS微信小视频优化心得

笔试面试题目：滑动窗口(二)

数据结构与算法（27）——排序（二）

android 主线程的相关问题

neo4j之cypher使用文档

Dijkstra--简易版（最短路径）

GitHub连夜封杀！这份阿里 10W 字内部 Java 字面试手册到底有多强？

vue-cli简介（中文翻译）

sqlServer根据经纬查距离

Ajax发送和获取json数据到Spring mvc 1.spring mvc后端2.web前段

JSONObject包导入异常 java.lang.NoClassDefFoundErrorweb项目的导入包的问题

hdu7108哈希