Next: The
problem
The rsync algorithm
Andrew Tridgell
Paul Mackerras
Department of
Computer Science
Australian National University
Canberra, ACT 0200,
Australia
Abstract:
This report presents an algorithm for updating a file on one machine to be
identical to a file on another machine. We assume that the two machines are
connected by a low-bandwidth high-latency bi-directional communications link.
The algorithm identifies parts of the source file which are identical to some
part of the destination file, and only sends those parts which cannot be matched
in this way. Effectively, the algorithm computes a set of differences without
having both files on the same machine. The algorithm works best when the files
are similar, but will also function correctly and reasonably efficiently when
the files are quite different.
Andrew Tridgell
1998-11-09