TIME: 12:00 noon - to approximately 1:00 pm EDT
PLACE: Virtual - a zoom link will be emailed closer to the seminar
SPEAKER: Bibek Wagle
Senior Member of Technical Staff, Oracle
Designing for Consensus in Data Plane
Fault tolerance is an important facet of a highly available distributed system. Replicated state machines are used to build fault tolerant systems. RAFT is a popular consensus algorithm used in state machine replication. It finds its popularity in the fact that it is easy to understand and implement. RAFT is based on the idea of a "leader" in the cluster. A leader node (among others in the cluster) is elected and is responsible for replying to client requests. Replies can only be sent if the underlying state machine has been replicated into all the servers. Raft uses messaging-based protocol, such as TCP, for state machine replication. Vanilla RAFT implementation may not be suitable for use in high velocity data planes and applications requiring low tail latencies.
In this talk we will focus on improvements to the original RAFT algorithm such as:
BIO: Bibek Wagle is a senior member of technical staff at Oracle. His interests include parallel and distributed computing, asynchronous task-based programming, and distributed runtime systems. He has a Ph.D. in computer science from Louisiana State University.
Director, Parallel Data Lab
VOICE: (412) 268-1297
Executive Director, Parallel Data Lab
VOICE: (412) 268-5485
PDL Administrative Manager
VOICE: (412) 268-6716