Changelog
Source:NEWS.md
slideimp 1.0.0
Breaking changes
group_imp()enforces stricter data validation. The requested feature subset must be a subset the object’s column names which must be a subset of the mappingdata.frame. Setallow_unmapped = TRUEto bypass errors when intersections are incomplete.group_imp()andtune_imp()now error when arguments are supplied that do not apply to the chosen imputation method, rather than silently ignoring them.knn_imp()now uses a logicaltreeargument to toggle between Ball tree (TRUE) and brute force (FALSE). KD tree is no longer supported.knn_imp()andpca_imp()gain more early errors and early exits.pca_imp()gains the samecolmaxandpost_imparguments asknn_imp().prep_groups()(formerlygroup_features()) is the new name for the grouping function. It now accepts a column name vector instead of a full matrix.sample_na_loc()(formerlyinject_na()) is now exported. The original remains accessible viaslideimp:::inject_na()for legacy code.sim_mat()now returns a matrix in sample-by-column format for immediate compatibility with other package functions.perc_NAis renamed toperc_total_na, and dimensions are now specified vian(rows) andp(columns).tune_imp()gains a unifiedmethodargument that applies to bothpca_imp()andknn_imp(), replacingpca_methodandknn_method. Therepargument is renamed ton_reps.tune_imp()results from v0.5.4 are no longer reproducible because internal NA generation now usessample_na_loc().The
khanmiss1dataset has been removed.
New features
compute_metrics()now supports data frames with aresultlist column containing truth and estimate columns, similar to yardstick.group_imp()andprep_groups()automatically look up Illumina manifests using the register-on-load pattern for{slideimp.extra}.knn_imp()gainsmax_cacheto control the internal cache size (defaults to 4GB).sim_mat()gains arhoargument to support compound symmetry correlation structures in simulated matrices.sim_mat()andtune_imp()gain dedicated print methods that provide concise summaries instead of dumping raw data to the console.slide_imp()gainslocation,flank, anddry_runarguments for fixed-window imputation, “flank mode” for features surrounding a subset, and pre-computation inspection of window statistics.tune_imp()gains granular control over NA injection vian_cols,n_rows,num_na, andna_col_subset. Pre-calculated locations can also be passed tona_locto compare methods using identical NA patterns.
Minor improvements and fixes
col_vars()andmean_imp_col()have been overhauled to use the faster RcppArmadillo backend and now support parallel computation with OpenMP.Dependencies are streamlined. tibble and purrr are removed as hard dependencies, cli is added for more informative messaging, and carrier is added as an explicit dependency.
Documentation is thoroughly overhauled with numerous consistency improvements and bug fixes.
RhpcBLASctl is added as a suggested package to allow pinning BLAS cores and avoid thrashing during parallel runs.
group_imp()andtune_imp()prioritize process-level parallelization via mirai.knn_imp()supports OpenMP-controlled parallelization via thecoresargument when mirai daemons are not active.knn_imp()andpca_imp()use optimized internal Rcpp functions for better performance.
slideimp 0.5.4
CRAN release: 2026-01-07
CRAN resubmission.
group_features()is added to help with creating the group tibble needed forgroup_imp().pca_imp()now allowsrow.w = "n_miss"to scale row weights by the number of missing values per row.