changedFiles Detect which Files Have Changed
Description
fileSnapshot takes a snapshot of a selection of files, recording summary information about each. changedFiles compares two snapshots, or compares one snapshot to the current state of the file system. The snapshots need not be the same directory; this could be used to compare two directories.
Usage
fileSnapshot(path = ".", file.info = TRUE, timestamp = NULL,
md5sum = FALSE, digest = NULL, full.names = length(path) > 1,
...)
changedFiles(before, after, path = before$path, timestamp = before$timestamp,
check.file.info = c("size", "isdir", "mode", "mtime"),
md5sum = before$md5sum, digest = before$digest,
full.names = before$full.names, ...)
## S3 method for class 'fileSnapshot'
print(x, verbose = FALSE, ...)
## S3 method for class 'changedFiles'
print(x, verbose = FALSE, ...)
Arguments
path | character vector; the path(s) to record. |
file.info | logical; whether to record |
timestamp | character string or |
md5sum | logical; whether MD5 summaries of each file should be taken as part of the snapshot. |
digest | a function or |
full.names | logical; whether full names (as in |
... | additional parameters to pass to |
before, after | objects produced by |
check.file.info | character vector; which columns from |
x | the object to print. |
verbose | logical; whether to list all data when printing. |
Details
The fileSnapshot function uses list.files to obtain a list of files, and depending on the file.info, md5sum, and digest arguments, records information about each file.
The changedFiles function compares two snapshots.
If the timestamp argument to fileSnapshot is length 1, a file with that name is created. If it is length 1 in changedFiles, the file_test function is used to compare the age of all files common to both before and after to it. This test may be unreliable: it compares the current modification time of the after files to the timestamp; that may not be the same as the modification time when the after snapshot was taken. It may also give incorrect results if the clock on the file system holding the timestamp differs from the one holding the snapshot files.
If the check.file.info argument contains a non-empty character vector, the indicated columns from the result of a call to file.info will be compared.
If md5sum is TRUE, fileSnapshot will call the tools::md5sum function to record the 32 byte MD5 checksum for each file, and changedFiles will compare the values. The digest argument allows users to provide their own digest function.
Value
fileSnapshot returns an object of class "fileSnapshot". This is a list containing the fields
info | a data frame whose rownames are the filenames, and whose columns contain the requested snapshot data |
path | the normalized |
timestamp, file.info, md5sum, digest, full.names | a record of the other arguments from the call |
args | other arguments passed via |
changedFiles produces an object of class "changedFiles". This is a list containing
added, deleted, changed, unchanged | character vectors of filenames from the before and after snapshots, with obvious meanings |
changes | a logical matrix with a row for each common file, and a column for each comparison test. |
print methods are defined for each of these types. The print method for "fileSnapshot" objects displays the arguments used to produce them, while the one for "changedFiles" displays the added, deleted and changed fields if non-empty, and a submatrix of the changes matrix containing all of the TRUE values.
Author(s)
Duncan Murdoch, using suggestions from Karl Millar and others.
See Also
Examples
# Create some files in a temporary directory
dir <- tempfile()
dir.create(dir)
writeBin(1L, file.path(dir, "file1"))
writeBin(2L, file.path(dir, "file2"))
dir.create(file.path(dir, "dir"))
# Take a snapshot
snapshot <- fileSnapshot(dir, timestamp = tempfile("timestamp"), md5sum=TRUE)
# Change one of the files.
writeBin(3L:4L, file.path(dir, "file2"))
# Display the detected changes. We may or may not see mtime change...
changedFiles(snapshot)
changedFiles(snapshot)$changes
Copyright (©) 1999–2012 R Foundation for Statistical Computing.
Licensed under the GNU General Public License.