Editlog與FileChannel Log的Group Commit

2017-11-20 23:50:00

先來看看Hadoop是怎麼處理的：

Editlog是可以被多個線程并發寫入的，每個線程維護了自己最新的一個事務ID：

privatestaticfinalThreadLocal<TransactionId> myTransactionId = newThreadLocal<TransactionId>() {

protectedsynchronizedTransactionId initialValue() {

returnnewTransactionId(Long.MAX_VALUE);

}

};

在送出的時候，首先獲得送出時最新的事務ID：

synchronized(this){

TransactionId id = myTransactionId.get();

id.txid= txid;

然後開始同步(代碼被删減)：

//拿到自己的事務ID

longmytxid = myTransactionId.get().txid;

booleansync = false;

try{

EditLogOutputStream logStream = null;

try {

//如果自己的事務未被同步，但是同步正在被其他線程處理，那麼就阻塞

while (mytxid > synctxid && isSyncRunning) {

wait(1000);

} catch (InterruptedException ie) {}

//當被喚醒或者逾時發現自己的事務已經被group commit了，那麼就傳回

if (mytxid <= synctxid) {

return;

//否則開始進行sync

isSyncRunning = true;

sync = true;

//Hadoop的editlog使用了double buffer來達到重新整理和寫不阻塞；這裡置換buffer

editLogStream.setReadyToFlush();

logStream = editLogStream;

if (logStream != null) {

logStream.flush();

} catch(IOException ex) {}

} finally{

if (sync) {

isSyncRunning = false;

//重新整理完成，喚醒阻塞線程

this.notifyAll();

而在Flume File-channel裡的group commit也是類似的方式，不過更為簡潔：

一樣是分兩個階段，每個階段都是同步方法,并且Flume的transactionid和position是分開的，每次隻需同步檔案末尾位置：

Commit();

Sync();

//在送出的時候更新最後送出位置

synchronizedvoidcommit(ByteBuffer buffer) throws IOException {

write(buffer);

lastCommitPosition= position();

//若已經被同步了則什麼也不做，傳回

synchronizedvoidsync() throwsIOException {

if(lastSyncPosition< lastCommitPosition){

getFileChannel().force(false);

lastSyncPosition = position();

syncCount++;

本文轉自MIKE老畢 51CTO部落格，原文連結：http://blog.51cto.com/boylook/1300543，如需轉載請自行聯系原作者

Editlog與FileChannel Log的Group Commit

繼續閱讀

hadoop 用MR實作join操作

Centos7 下 Hadoop 2.6.4 分布式叢集環境搭建摘要叢集準備安裝JDK 安裝 Hadoop 2.6.4 部署 slaver1-slaver4 啟動 hadoop 叢集成功了

寶塔面闆mysql恢複2018.1.8更新

Centos7 MySQL 5.7 安裝MySQL 5.7 安裝

查找入職員工時間排名倒數第三的員工所有資訊

Hibernate使用Hibernate的“3個準備，7個步驟”Hibernate API簡介操作實體對象對象識别

雲計算面試題——mysql/存儲引擎/備份

SQL語言基礎：常用的資料查詢語句

MapReduce的幾個企業級經典面試案例MapReduce的幾個企業級經典面試案例

Ubuntu16.04安裝Apache+MySQL+PHP1. 安裝Apache2. 安裝MySQL3. 安裝PHP4. 安裝phpMyAdmin

ubuntu14.04下安裝hbse1.0.1.1

MySQL的4種隔離級别？出現問題

User Defined Hadoop DataType

neo4j之cypher使用文檔

Ambari介紹和架構原理

mysql使用source指令導入.sql檔案