data_dictionary dataset added to define the columns in each dataset used or exported by the functions in this package
@jacobvjk is now the maintainer.
match_name() gains a new argument in its signature, sector_classification, which is placed before the ... argument. Where users have used ... to pass additional unnamed arguments by position, they will need to update their code to pass these arguments by name so that they are not confused as an input to sector_classfication. The new argument sector_classification is optional and defaults to r2dii.data::sector_classifications. Explicitly passing a sector_classification to match_name(), enables users to use their own sector classification systems to match loan books to the abcd, assuming the sector_classification is mapped appropriately to PACTA sectors and has the same format as r2dii.data::sector_classifications. Own sector classifications can no longer be passed to match_name() via options.Complete deprecation of ald in favour of abcd (#399).
match_name gains argument join_id allowing an optional perfect join based on a mutual ID column between loanbook and abcd inputs, prior to attempting fuzzy matching (#135).
to_alias can now handle strange encodings without error (#425, @kalashsinghal @Tilmon).r2dii.match has transferred to a new organization
https://github.com/RMI-PACTA/.abcd of match_name() supersedes the argument ald (#399).With options(r2dii.match.sector_classifications = own) users can inject
their own sector_classififications instead of the default
r2dii.data::sector_classifications. With this feature, user may or may not
choose to request their sector_classifications to be added to r2dii.data.
This feature is experimental and may be dropped and/or become a new argument
to match_name() (#356 @georgeharris2deg @daisy-pacheco)`.
match_name() now errors if the column id_loan of the input loanbook has
any duplicated value id_loan (@georgeharris2deg #349).
match_name() gains ... to pass additional arguments to
stringdist::stringsim() (@evgeniadimi #310).prioritize() with 0-row input now returns the input untouched (#284).match_name(): Remove dependency on nest_by() from dplyr 1.0.0 (#303).match_name() with large loanbooks.match_name() now outputs a new column borderline (#258).crucial_lbk() helps select the minimum loanbook columns for
match_name() to run (#236).match_name() now runs faster and uses less memory (@georgeharris2deg #214).match_name() now converts ald$sector to lower case before matching
(@georgeharris2deg #257). It now returns identical output with, for example,
either "POWER" or "power". Notice that the input "POWER" in ald$sector
becomes "power" in the column sector_ald of the output.match_name() now errors with a more informative message if loanbook has
reserved columns -- alias, rowid, or sector (#233).This version includes only internal changes.