背景
CTE 递归语法是PG 8.4引入的功能, 至今已经10多年, 非常文档.
CTE 递归可以解决很多问题: 时序场景取所有传感器最新的value, 图式数据的搜索(一度人脉,N度人脉,最近的路径关系), 树状数据的累加分析, 知识图谱, 去稀疏数据的唯一值等.
使用CTE递归比通用的方法通常有数百倍的性能提升.
https://github.com/digoal/blog/blob/master/202105/20210529_01.md#%E7%94%A8%E4%BE%8B 用例
假设传感器有1万个, 每个传感器每秒上传一条记录.
取出今天处于活跃状态(有数据)的传感器的最后一个值.
1、创建测试表
create unlogged table tbl_sensor_log (
id serial8 ,
sid int, -- 传感器ID (例如 网约车、警车、巡逻车、共享单车、物联网传感器等设备)
val jsonb, -- 传感器上传的数据
crt_time timestamp -- 上传时间
)
partition by range (crt_time)
;
2、创建分区
do language plpgsql $$
declare
begin
for i in 0..365 loop
execute format($_$
create unlogged table tbl_sensor_log_%s PARTITION of tbl_sensor_log
for values from (%L) to (%L)
$_$, to_char(current_date+i, 'yyyymmdd'), current_date+i, current_date+i+1);
end loop;
end;
$$;
3、创建索引
insert into tbl_sensor_log (sid, val, crt_time)
select random()*10000 , row_to_json(row(random(),random(),clock_timestamp())), current_date+(round(random()::numeric*72::numeric,2)||' hour')::interval
from generate_series(1,50000000);
4、写入5000万条记录, 均匀分布在最近3天的分区内
insert into tbl_sensor_log (sid, val, crt_time)
select random()*10000 , row_to_json(row(random(),random(),clock_timestamp())), current_date+(round(random()::numeric*72::numeric,2)||' hour')::interval
from generate_series(1,50000000);
方法1: 使用传统的窗口查询
select id,sid,val,crt_time from
(
select *, row_number() over w as RN
from tbl_sensor_log
where crt_time >= current_date and crt_time < current_date+1
window w as (partition by sid order by crt_time desc)
) t
where rn=1;
QUERY PLAN
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Subquery Scan on t (cost=5057407.30..5598899.49 rows=83306 width=119) (actual time=40763.507..52716.622 rows=10001 loops=1)
Filter: (t.rn = 1)
Rows Removed by Filter: 16653784
-> WindowAgg (cost=5057407.30..5390633.26 rows=16661298 width=127) (actual time=40763.503..51533.043 rows=16663785 loops=1)
-> Sort (cost=5057407.30..5099060.55 rows=16661298 width=119) (actual time=40763.483..44945.556 rows=16663785 loops=1)
Sort Key: tbl_sensor_log.sid, tbl_sensor_log.crt_time DESC
Sort Method: external merge Disk: 2177080kB
-> Append (cost=0.00..1257703.57 rows=16661298 width=119) (actual time=0.065..5597.541 rows=16663785 loops=1)
Subplans Removed: 365
-> Seq Scan on tbl_sensor_log_20210529 tbl_sensor_log_1 (cost=0.00..683635.38 rows=16660933 width=119) (actual time=0.064..4066.655 rows=16663785 loops=1)
Filter: ((crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
Planning Time: 219.559 ms
Execution Time: 53133.463 ms
(13 rows)
QUERY PLAN
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Subquery Scan on t (cost=65.16..10962058.77 rows=83306 width=119) (actual time=0.041..104082.386 rows=10001 loops=1)
Filter: (t.rn = 1)
Rows Removed by Filter: 16653784
-> WindowAgg (cost=65.16..10753792.55 rows=16661298 width=127) (actual time=0.040..102532.935 rows=16663785 loops=1)
-> Merge Append (cost=65.16..10462219.83 rows=16661298 width=119) (actual time=0.029..92266.387 rows=16663785 loops=1)
Sort Key: tbl_sensor_log.sid, tbl_sensor_log.crt_time DESC
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 tbl_sensor_log_1 (cost=0.44..9177871.39 rows=16660933 width=119) (actual time=0.029..89908.207 rows=16663785 loops=1)
Index Cond: ((crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
Planning Time: 39.511 ms
Execution Time: 104088.128 ms
(11 rows)
方法2: 使用索引链表跳跳糖
递归, 每次扫描定位到1个目标SID, 然后跳到第二个SID, 而不是通过索引链表顺序扫描.
链表顺序扫描的缺点: 整张索引的时间范围内的所有叶子结点的每个page都要扫描到, 性能烂到家.
with recursive tmp as (
(select t from tbl_sensor_log as t where crt_time >= current_date and crt_time < current_date+1 order by sid, crt_time desc limit 1)
union all
select (select tbl_sensor_log from tbl_sensor_log where sid>(tmp.t).sid
and crt_time >= current_date and crt_time < current_date+1 order by sid, crt_time desc limit 1) as t
from tmp where tmp.* is not null
)
select (tmp.t).* from tmp
where tmp.* is not null;
QUERY PLAN
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------
CTE Scan on tmp (cost=6650.50..6652.52 rows=100 width=52)
Filter: (tmp.* IS NOT NULL)
CTE tmp
-> Recursive Union (cost=65.16..6650.50 rows=101 width=32)
-> Subquery Scan on "*SELECT* 1" (cost=65.16..65.80 rows=1 width=32)
-> Limit (cost=65.16..65.79 rows=1 width=44)
-> Merge Append (cost=65.16..10462219.83 rows=16661298 width=44)
Sort Key: t.sid, t.crt_time DESC
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 t_1 (cost=0.44..9177871.39 rows=16660933 width=44)
Index Cond: ((crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
-> WorkTable Scan on tmp tmp_1 (cost=0.00..658.27 rows=10 width=32)
Filter: (tmp_1.* IS NOT NULL)
(13 rows)
QUERY PLAN
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
CTE Scan on tmp (cost=6650.50..6652.52 rows=100 width=52) (actual time=0.036..124.680 rows=10001 loops=1)
Filter: (tmp.* IS NOT NULL)
Rows Removed by Filter: 1
CTE tmp
-> Recursive Union (cost=65.16..6650.50 rows=101 width=32) (actual time=0.032..119.486 rows=10002 loops=1)
-> Subquery Scan on "*SELECT* 1" (cost=65.16..65.80 rows=1 width=32) (actual time=0.031..0.033 rows=1 loops=1)
-> Limit (cost=65.16..65.79 rows=1 width=44) (actual time=0.031..0.032 rows=1 loops=1)
-> Merge Append (cost=65.16..10462219.83 rows=16661298 width=44) (actual time=0.030..0.030 rows=1 loops=1)
Sort Key: t.sid, t.crt_time DESC
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 t_1 (cost=0.44..9177871.39 rows=16660933 width=44) (actual time=0.029..0.030 rows=1 loops=1)
Index Cond: ((crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
-> WorkTable Scan on tmp tmp_1 (cost=0.00..658.27 rows=10 width=32) (actual time=0.011..0.011 rows=1 loops=10002)
Filter: (tmp_1.* IS NOT NULL)
Rows Removed by Filter: 0
SubPlan 1
-> Limit (cost=65.16..65.81 rows=1 width=44) (actual time=0.011..0.011 rows=1 loops=10001)
-> Merge Append (cost=65.16..3572009.02 rows=5554009 width=44) (actual time=0.011..0.011 rows=1 loops=10001)
Sort Key: tbl_sensor_log.sid, tbl_sensor_log.crt_time DESC
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 tbl_sensor_log_1 (cost=0.44..3115515.85 rows=5553644 width=44) (actual time=0.010..0.010 rows=1 loops=10001)
Index Cond: ((sid > (tmp_1.t).sid) AND (crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
Planning Time: 69.016 ms
Execution Time: 125.866 ms
(24 rows)
方法3: 使用subquery
但是, 需要维护一张SID表, 实际业务逻辑可能比这复杂, SID表可能没有这么好维护.
而且还有1个问题: 今天没有活跃的SID也会被查出来. 如果选择过滤今天不活跃的记录, 需要多次评估, 性能就会下降.
create table tbl_sensor (sid int primary key);
insert into tbl_sensor select generate_series(0,10010);
select (select tbl_sensor_log from tbl_sensor_log where sid=t.sid and crt_time >= current_date and crt_time < current_date+1 order by crt_time desc limit 1) as val
from tbl_sensor as t;
QUERY PLAN
--------------------------------------------------------------------------------------------------------------------------------------------------------------------
Index Only Scan using tbl_sensor_pkey on tbl_sensor t (cost=0.29..509812.03 rows=10011 width=32)
SubPlan 1
-> Limit (cost=49.58..50.91 rows=1 width=40)
-> Append (cost=49.58..2751.07 rows=2036 width=40)
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 tbl_sensor_log_1 (cost=0.44..1880.54 rows=1671 width=40)
Index Cond: ((sid = t.sid) AND (crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
(7 rows)
QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Index Only Scan using tbl_sensor_pkey on tbl_sensor t (cost=0.29..509812.03 rows=10011 width=32) (actual time=0.039..116.315 rows=10011 loops=1)
Heap Fetches: 0
SubPlan 1
-> Limit (cost=49.58..50.91 rows=1 width=40) (actual time=0.011..0.011 rows=1 loops=10011)
-> Append (cost=49.58..2751.07 rows=2036 width=40) (actual time=0.011..0.011 rows=1 loops=10011)
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 tbl_sensor_log_1 (cost=0.44..1880.54 rows=1671 width=40) (actual time=0.010..0.010 rows=1 loops=10011)
Index Cond: ((sid = t.sid) AND (crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
Planning Time: 40.798 ms
Execution Time: 117.059 ms
(10 rows)
过滤不活跃的SID将增加计算量, 性能有下降.
select * from
(
select (select tbl_sensor_log from tbl_sensor_log where sid=t.sid and crt_time >= current_date and crt_time < current_date+1 order by crt_time desc limit 1) as val
from tbl_sensor as t
) t1
where t1.val is not null;
QUERY PLAN
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Index Only Scan using tbl_sensor_pkey on tbl_sensor t (cost=0.29..1016895.27 rows=9961 width=32) (actual time=0.044..173.412 rows=10001 loops=1)
Filter: ((SubPlan 2) IS NOT NULL)
Rows Removed by Filter: 10
Heap Fetches: 0
SubPlan 1
-> Limit (cost=49.58..50.91 rows=1 width=40) (actual time=0.007..0.007 rows=1 loops=10001)
-> Append (cost=49.58..2751.07 rows=2036 width=40) (actual time=0.007..0.007 rows=1 loops=10001)
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 tbl_sensor_log_1 (cost=0.44..1880.54 rows=1671 width=40) (actual time=0.006..0.006 rows=1 loops=10001)
Index Cond: ((sid = t.sid) AND (crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
SubPlan 2
-> Limit (cost=49.58..50.91 rows=1 width=40) (actual time=0.010..0.010 rows=1 loops=10011)
-> Append (cost=49.58..2751.07 rows=2036 width=40) (actual time=0.010..0.010 rows=1 loops=10011)
Subplans Removed: 365
-> Index Scan using tbl_sensor_log_20210529_sid_crt_time_idx on tbl_sensor_log_20210529 tbl_sensor_log_3 (cost=0.44..1880.54 rows=1671 width=40) (actual time=0.009..0.009 rows=1 loops=10011)
Index Cond: ((sid = t.sid) AND (crt_time >= CURRENT_DATE) AND (crt_time < (CURRENT_DATE + 1)))
Planning Time: 68.114 ms
Execution Time: 174.239 ms
(18 rows)