PostgreSQL bloom filter index 擴充 for bigint

2018-10-22 23:50:00

背景

凡是支援HASH函數，以及相等operator的類型，都可以使用bloom filter index .

擴充方法見本文。

原文

https://obartunov.livejournal.com/201027.html

Bloom index by default works for int4 and text, but other types with hash function and equality operator could be supported.

Just use opclass interface, for example, for type bigint

create extension bloom;

postgres=# select * from pg_opclass where opcname='bigint_ops';

(0 rows)

建立bigint的bloom filter index支援。

CREATE OPERATOR CLASS bigint_ops

DEFAULT FOR TYPE bigint USING bloom AS

OPERATOR 1 = (bigint, bigint),

FUNCTION 1 hashint8(bigint);

(1 row)

Now, you can build bloom index for bigint data type.

Data types, which could be supported by bloom index.

查詢可以支援bloom filter的類型，HASH函數

凡是支援HASH函數，以及相等operator操作符的類型，都可以使用bloom filter index .

postgres=# select oid,* from pg_am;

oid	amname	amhandler	amtype
403	btree	bthandler	i
405	hash	hashhandler
783	gist	gisthandler
2742	gin	ginhandler
4000	spgist	spghandler
3580	brin	brinhandler
18204	rum	rumhandler
136050	bloom	blhandler

(8 rows)

SELECT oc.opcintype::regtype, p.amproc FROM pg_opclass oc

JOIN pg_amproc p ON p.amprocfamily = oc.opcfamily

WHERE oc.opcmethod = 405 -- hash am

AND oc.opcdefault -- 預設proc for this am

https://www.postgresql.org/docs/devel/static/xindex.html

Strategies number

AND p.amprocnum = 1

AND p.amproclefttype = oc.opcintype

AND p.amprocrighttype = oc.opcintype;

opcintype          |     amproc

character	hashbpchar
"char"	hashchar
date	hashint4
anyarray	hash_array
real	hashfloat4
double precision	hashfloat8
inet	hashinet
smallint	hashint2
integer
bigint	hashint8
interval	interval_hash
macaddr	hashmacaddr
name	hashname
	hashoid
oidvector	hashoidvector
text	hashtext
time without time zone	time_hash
numeric	hash_numeric
timestamp with time zone	timestamp_hash
time with time zone	timetz_hash
timestamp without time zone
boolean
bytea	hashvarlena
xid
cid
abstime
reltime
aclitem	hash_aclitem
uuid	uuid_hash
pg_lsn	pg_lsn_hash
macaddr8	hashmacaddr8
anyenum	hashenum
anyrange	hash_range
jsonb	jsonb_hash

(34 rows)

建立索引，例子

postgres=# create table test(id int, c1 int8, c2 int8, c3 int8);

CREATE TABLE

postgres=# create index idx_test_1 on test using bloom (c1,c2,c3);

CREATE INDEX

postgres=# set enable_seqscan=off;

SET

postgres=# explain select * from test where c1=1::int8 and c2=1::int8 and c3=1::int8;

QUERY PLAN

Bitmap Heap Scan on test (cost=15.73..15.75 rows=1 width=28)

Recheck Cond: ((c1 = '1'::bigint) AND (c2 = '1'::bigint) AND (c3 = '1'::bigint))

-> Bitmap Index Scan on idx_test_1 (cost=0.00..15.73 rows=1 width=0)

Index Cond: ((c1 = '1'::bigint) AND (c2 = '1'::bigint) AND (c3 = '1'::bigint))

(4 rows)

postgres=# set enable_seqscan =on;

QUERY PLAN

Seq Scan on test (cost=0.00..9.85 rows=1 width=28)

Filter: ((c1 = '1'::bigint) AND (c2 = '1'::bigint) AND (c3 = '1'::bigint))

(2 rows)

參考

轉自阿裡雲德哥

PostgreSQL bloom filter index 擴充 for bigint

繼續閱讀

輸出蛇形矩陣 C語言輸出蛇形矩陣

1.54寸TFT ST7789液晶屏圖檔如何取模

三子棋——年輕人的第一款小遊戲前言正文總結

DOG（4）：解析器的部分實作細節先來說說parser一些可能迷惑的地方結果如何傳回?pcd其實是一回事最後的一點說明

NYOJ 269--VF

二叉樹三種周遊(先序，中序，後序)----超詳細引入先序周遊後序周遊中序周遊總結

UVA 110 Meta-Loopless Sorts

為什麼要選擇UniDAC

CRC32和CRC8校驗代碼，C語言版

241 Different Ways to Add Parentheses（C代碼版）

C語言：初學者必定看懂的注釋！！！猴子吃桃問題。猴子第一天摘下若幹個桃子，每天都吃了前一天剩下的一半零一個，到第10天早上想再吃的時候，就剩下一個桃子. 求第一天共摘多少個桃子。

[轉]九大排序算法——C語言實作及詳解

while 循環、do- while 循環和 for 循環之間的那點事C語言自學之三種循環比較

結構體：typedef與struct的差別

hdu7108哈希