《Viewstamped Replication Revisited》簡要翻譯

本文主要是對《Viewstamped Replication Revisited》這篇論文做簡要翻譯，用白話總結每個章節的主要內容。
本文發表於2012年，作者為MIT的研究人員，其中 Barbara Liskov 是2008年圖靈獎
得主，主要內容是對《Viewstamped Replication: A New Primary Copy Method to Support Highly-Available Distributed Systems》這篇文章的改進（作者於1988年發表）。1988年的這篇文章實際上比Lamport的《The Part-Time Parliament》（1989）還要早一年，可以說VR協議與Paxos協議是在相近的時間發明的，且據說兩文作者發表之前並無交流，因此可以認為這兩個一致性協議都是開創性的。本文之後，我將繼續嘗試寫其他一致性協議的解讀及不同協議的對比，歡迎留言交流。

Abstract

這篇文章介紹viewstampd replication的改進版本。

1 Introduction

VR 工作在非同步網路中，能夠處理節點crash。VR 提供 state machine replication，適用於實現replicated service，例如lock manager、file system。

與最初版本相比，有幾點不同：

- 更簡單、性能更好，一些改進點來源於PBFT演算法。

- 不需要利用磁碟，利用replicated state來提供persistence；- 展示了reconfiguration 協議，用於成員變更。- 獨立於application，而原始版本與database結合。

VR幾乎與Paxos同時發表，但二者並不相同：

1）VR是replication protocol，而paxos是consensus protocol，VR利用了類似於paxos的consensus protocol來實現state machine replication；
2）VR的consensus protocol不需要寫盤。

2 Background

2.1 assumptions

VR僅處理crash failure：一個node要麼正常工作，要麼crash，不處理拜占庭問題。 VR為非同步網路設計，消息可能延遲、丟失、亂序、重複，但假設重發一條消息最終一定能成功發送。

2.2 replica groups

VR在不超過f個副本出錯時能夠保證可靠性和可用性，通過2f+1個副本實現。

多於2f+1個副本沒有太大意義，因為閾值f固定時（至多容忍f個副本出錯），quorum需要取 K-f，其中K為總的副本數量。

《Viewstamped Replication Revisited》簡要翻譯

Abstract

1 Introduction

2 Background

2.1 assumptions

2.2 replica groups

2.3 架構

3 overview

4 The VR protocol

4.1 normal operation

4.2 view change

4.3 recovery

4.4 non-deterministic operation

4.5 client recovery

5 Pragmatics

5.1 efficient recovery

5.2 state transfer

5.3 view change

6 optimizations

6.1 witness

6.2 batching

6.3 fast reads

6.3.1 reads at the primary

6.3.2 reads at backups

7 reconfiguration

7.1 reconfiguration details

7.1.1 processing in the new group

7.1.2 processing at replicas being replaced

7.2 other protocol changes

7.3 shutting down old replicas

7.4 locating the group

7.5 discussion

8. correctness

8.1 correctness of view changes

8.2 correctness of the recovery protocol

8.3 correctness of reconfiguration

9 conclusion

熱門新聞

週熱門

《Viewstamped Replication Revisited》簡要翻譯

Abstract

1 Introduction

2 Background

2.1 assumptions

2.2 replica groups

2.3 架構

3 overview

4 The VR protocol

4.1 normal operation

4.2 view change

4.3 recovery

4.4 non-deterministic operation

4.5 client recovery

5 Pragmatics

5.1 efficient recovery

5.2 state transfer

5.3 view change

6 optimizations

6.1 witness

6.2 batching

6.3 fast reads

6.3.1 reads at the primary

6.3.2 reads at backups

7 reconfiguration

7.1 reconfiguration details

7.1.1 processing in the new group

7.1.2 processing at replicas being replaced

7.2 other protocol changes

7.3 shutting down old replicas

7.4 locating the group

7.5 discussion

8. correctness

8.1 correctness of view changes

8.2 correctness of the recovery protocol

8.3 correctness of reconfiguration

9 conclusion

超長文本，用什麼資料庫儲存？

現在會後端，想從安卓客戶端開始學然後搞到前端，，在跳到後端，最後做成全棧可行麼?

有沒有像分散式存儲一樣，硬體級別的分散式內存?

分散式存儲做存儲負載平衡時，怎麼處理新來的讀寫請求呢？

ShardingSphere x Seata，一致性更強的分散式資料庫中間件

TiKV 源碼解析系列文章（十）Snapshot 的發送和接收

區塊鏈擴容方案之—分片（sharding）

直面PHP微服務架構挑戰

論文翻譯：Haystack

Linux伺服器設計（五）：大文件存儲簡析

LevelDB 寫操作源碼分析--寫入限制

GFS論文筆記

phxpaxos源碼分析5. init network (下)

JITStack(集特)：「Ceph淺析」系列之一——Ceph概況

一篇文章讓你理解Ceph的三種存儲介面(塊設備、文件系統、對象存儲)

熱門新聞

週熱門