Optimize dm_apply_filters_impl() to use reduce() instead of map() #2313
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The
dm_apply_filters_impl()function was inefficiently processing all tables in a dm object, even those unaffected by filters. This caused unnecessary computation, especially problematic for large dms with sparse filtering or remote databases.Changes Made
Before:
After:
Key Optimizations
reduce()instead ofmap()as requested in the issuePerformance Impact
Testing
This optimization addresses the performance concerns that triggered duckplyr issue #441 while maintaining full backward compatibility.
Fixes #2277.
Warning
Firewall rules blocked me from connecting to one or more addresses
I tried to connect to the following addresses, but was blocked by firewall rules:
cran.r-project.org/usr/lib/R/bin/exec/R -e install.packages(c('devtools',~+~'testthat'),~+~repos='REDACTED')(dns block)esm.ubuntu.com/usr/lib/apt/methods/https(dns block)If you need me to access, download, or install something from one of these locations, you can either:
💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.