Replace non-ASCII characters with legal characters that won't cause problems when writing to a REDCap project.
redcap_column_sanitize( d, column_names = colnames(d), encoding_initial = "latin1", substitution_character = "?" )
An array of
character values indicating the names of
the variables to sanitize. Optional.
An array of
character values indicating the names
of the variables to sanitize. Optional.
character value that replaces characters
that were unable to be appropriately matched.
A data frame with same columns, but whose character values have been sanitized.
Letters like an accented 'A' are replaced with a plain 'A'.
This is a thin wrapper around
ASCII//TRANSLIT option does the actual transliteration work. As of
R 3.1.0, the OSes use similar, but different, versions to convert the
characters. Be aware of this in case you notice OS-dependent differences.
# Typical examples are not shown because they require non-ASCII encoding, # which makes the package documentation less portable. dirty <- data.frame( id = 1:3, names = c("Ekstr\xf8m", "J\xf6reskog", "bi\xdfchen Z\xfcrcher") ) REDCapR::redcap_column_sanitize(dirty) #> id names #> 1 1 Ekstrom #> 2 2 J"oreskog #> 3 3 bisschen Z"urcher