March 17, 2026

How to choose a disk space analyzer for Mac: A transparent evaluation methodology

Written by

A Mac specialist with a QA engineering background, focused on troubleshooting and how-to guides.

Approved by

Reviewed by a QA engineer at Nektony and Apple Certified Support Professional with hands-on experience testing Mac software.

Alex Holovchenko

54 criteria to objectively compare disk space analyzers — from a team that’s been building macOS utilities for 15 years.

This Article Contains

TL;DR

For users:

We’ve created 54 verifiable criteria to evaluate disk space analyzers
Test your analyzer in 15–20 minutes using the Level 1 criteria (9 total) — that’s the bare minimum for safety
If a tool fails even one Level 1 criterion — don’t use it
Use our evaluation calculator for automatic results

For reviewers and QA:

54 criteria across 3 levels: 9 blockers, 22 important, 23 bonuses
Maximum score: 94 (formula: L1×3 + L2×2 + L3×1)
4 test datasets with ground truth (scripts attached)
7 testing phases, 3–5 hours for a full cycle
Download the full methodology with templates and scripts

Why this methodology exists

The Mac App Store and developer websites are packed with dozens of disk space analyzers. They all promise to “quickly show you what’s eating up your disk and help free up space.” But how do you tell which one is actually accurate and safe?

Picking the wrong tool can cost you:

Incorrect folder sizes (the tool doesn’t account for APFS clones or purgeable space)
Important files removed (system caches, Application Support)
“10 GB Freed up” — but the actual space didn’t change (snapshots, purgeable)
Crashes when dealing with a large number of files

macOS makes this harder than it looks. APFS clones, purgeable space, Time Machine snapshots, iCloud evicted files — standard tools (du, stat, Finder) don’t always show the real picture. That’s not a bug — it’s how the APFS file system works. A good analyzer needs to take this into account, or at least match what Finder shows.

At Nektony, we’ve been developing macOS utilities since 2011. Over the years, we’ve analyzed the behavior of dozens of competing products and collected feedback from hundreds of thousands of users. Based on that experience, we created a transparent evaluation methodology: 54 criteria that actually matter for accurate and safe analysis.

We’re publishing it transparently so that:

You can evaluate any tool on your own
Journalists and reviewers have an objective comparison framework
Quality standards in the category become higher for everyone

Who this methodology is for

Audience	How to use
Regular users	Go through the Level 1 criteria (15–20 minutes) or use the calculator
Journalists and reviewers	Use the full methodology to compare tools objectively
QA engineers	Download the full methodology with templates and scripts
Developers	Use it as a requirements checklist for your own product

Core principle

When evaluating tools, we follow a clear priority:

Safety > Accuracy > Usability > Speed

A fast tool that shows wrong sizes or removes important files is useless. A slow but accurate and safe tool is useful.

Terms and definitions

Before starting the evaluation, it’s important to understand the terminology:

Term	Definition
APFS (Apple File System)	Apple’s file system, used by default since macOS High Sierra (2017). Supports clones, snapshots, and transparent compression
APFS clone	A copy-on-write file copy. The data isn’t physically duplicated until one of the files is modified. `du`, `stat`, and Finder show the full logical size and don’t see clone sharing
Purgeable space	Space that macOS can free up automatically (caches, iCloud data, logs). Shown in `diskutil info /`
Time Machine snapshot	A local file system snapshot. It takes up disk space but isn’t visible as files or folders
Cloud-only evicted file	An iCloud Drive file uploaded to the cloud. Locally it takes up minimal space (stub), but Finder shows the full size
Hard link	Multiple names pointing to the same data. Removing one name doesn’t remove the data as long as other links exist
Symlink	A shortcut to a file or folder path. Contains no data of its own
Sparse file	A file with “empty” blocks that don’t take up space. Logical size is larger than physical size
Firmlink	macOS links between the Data and System APFS volumes
Logical size	The size of a file’s contents in bytes (apparent size)
Physical size	The actual space used subject to FS blocks, clones, and compression (on-disk size)
`du` (disk usage)	A Terminal command that shows allocated size. Doesn’t see APFS clone sharing
Ground truth	A known correct result used to verify accuracy
FDA (Full Disk Access)	A macOS permission to access protected areas (~Library, /Library, Mail, Safari, etc.)

Scoring system

All 54 criteria are divided into three levels by importance:

Level	Name	Criteria	Weight	Max points	Description
L1	Must Have	9	×3	27	Blockers. Fail = unsafe
L2	Should Have	22	×2	44	Important. Affect accuracy and usability
L3	Nice to Have	23	×1	23	Bonuses. Nice extras
	Total	54		94

Scoring formula

Total score = (L1 × 3) + (L2 × 2) + (L3 × 1)
Maximum: (9 × 3) + (22 × 2) + (23 × 1) = 27 + 44 + 23 = 94 points

How to interpret the result

73–94

✅ Recommended

47–72

⚠️ With caveats

<47

❌ Not recommended

Important!

Failing any Level 1 criterion automatically falls under “Not recommended,” regardless of the total score.

Red flags

If a tool matches any of the following, don’t use it:

🚩 Red flag

Data loss

Removes files without user confirmation
Removes the wrong files (mismatch between what’s shown and what’s removed)
Permanently removes files by default (bypasses Trash without warning)
Removes system files without warning

Safety and trust

Sends your file list or paths to an external server
Installs additional software without consent (bundleware or adware)
Requests root/admin access without a good reason
Unsigned, not notarized — a disk analyzer can see every file on your Mac, so trusting that kind of access to an unsigned tool is risky: there’s no way to verify the developer or guarantee there’s no malicious code

Stability

Crashes or loses data during a basic scenario
Infinitely loops / freezes with no way to cancel
Consumes all available RAM and gets force-killed by the system

Critical deception

Shows deliberately false sizes (off by more than 50% from actual)
Claims “X GB freed up,” but the actual space didn’t change
Scans and shows results, then demands payment for any further action (without warning you beforehand)

Evaluation criteria

Level 1: Must Have (blockers)

Safety and basic accuracy criteria.

If a tool fails at least one criterion, using it is risky.

1.1Basic analysis accuracy (±5%)

Why it matters: The tool should show folder sizes that match reality. Errors over 5% can lead to bad cleanup decisions.

How to check:

Create a test folder with a known structure (Dataset 01, ~4.6 GB, ~2,800 files).
Scan it with the tool.
Compare the reported top-level folder sizes with the ground truth.
Deviation should be ≤ 5% for each folder.

Result:

✅ All folders are within ±5% of ground truth
❌ FAIL: At least one folder > ±5%

1.2Stability (with no crashes on Dataset 01)

Why it matters: The tool shouldn’t crash or freeze on any of the test datasets (Dataset 01–04).

How to check:

Scan all 4 test datasets one after another
Monitor the tool in Activity Monitor
Are there any crashes, freezes, or uncontrolled memory usage spike?

Result:

✅ The tool runs stable on all datasets
❌ FAIL: The tool crashes or freezes on any dataset

1.3Confirmation before removal

Why it matters: An accidental click shouldn’t lead to data loss. Removal should only happen after explicit confirmation.

How to check:

Select a file from the results and click Remove
Does a confirmation dialog appear?
Does it show the number of files and size? Where does it go after removal?

Result:

✅ The tool shows a confirmation dialog
❌ FAIL: The tool removes immediately without confirmation

1.4Removal to Trash by default

Why it matters: Removed files should go to Trash and be recoverable. It’s your safety net against mistakes.

How to check:

Remove a file through the tool
Open Trash in the Dock
Is the file there? Can you restore it?

Result:

✅ Files go to Trash by default
❌ FAIL: The tool removes files permanently, or there is no Trash option

1.5Removal matches selection

Why it matters: Only the selected files should be removed — nothing more, nothing less.

How to check:

Select 3 specific files, keep in mind the names and paths
Remove them
Check in Finder: Were only the selected files removed? Are other files untouched?
Restore them from Trash — are the files intact?

Result:

✅ Exactly the selected files were removed
❌ FAIL: Mismatch between selected and removed

1.6System file protection

Why it matters: Removing system files can break macOS or other tools.

How to check:

Scan the boot volume
Try to remove a file from
```
~/Library/Preferences/
```
Copy
Does the tool warn you about the risks or block the action?

Result:

✅ The tool blocks or warns
❌ FAIL: The tool allows removal without warning

1.7The tool doesn't send data to external servers

Why it matters: The tool shouldn’t transmit your file list or paths to external servers. That’s confidential information.

How to check:

Check network activity via Activity Monitor (Network) or Little Snitch
Can it be used completely offline?
Is there an explicit privacy policy?

Result:

✅ There are no file/path data leaks
❌ FAIL: The tool sends data to an external server

1.8Signed and notarized by Apple

Why it matters: If a tool is signed and notarized, this means Apple has checked it for malicious code.

How to check:

Terminal:

codesign -dv --verbose=2 /path/to/app.app

Copy

Terminal:

spctl --assess --verbose /path/to/app.app

Copy

Result:

✅ The tool is signed and notarized
❌ FAIL: The tool is not signed or not notarized

1.9The tool doesn't install third-party software

Why it matters: The installation shouldn’t bundle third-party software, adware, or toolbars.

How to check:

Install the tool, watching the process carefully
Are there any extra offers, checkboxes, or ad pop-ups?
Check
```
/Applications
```
Copy
and
```
~/Library
```
Copy
after installation — anything new show up?

Result:

✅ Clean install, just the main app
❌ FAIL: The tool installs third-party software or shows ads

Level 2: Should Have (important)

These criteria affect accuracy, usability, and completeness. Without them, a tool can still be used, but with limitations.

Data source coverage

#	Criterion	How to check	Result
2.1	Scanning the boot APFS volume	Select the root drive in the analyzer and scan the boot APFS volume. Does it show the size correctly?	✅ Yes / ❌ No
2.2	Scanning external volumes (ExFAT, HFS+)	Connect an external ExFAT and HFS+ volume — does it see and scan them?	✅ Both / ⚠️ One / ❌ No
2.3	Selecting volume or folder for scanning	Can you pick a specific volume or folder (not the whole disk)?	✅ Yes / ❌ No
2.4	Distinguishing iCloud cloud-only from local files	Does it differentiate between cloud-stored and locally downloaded files?	✅ Yes / ⚠️ Partially / ❌ No
2.5	Handling Full Disk Access	Revoke FDA — does it show a clear warning + instructions?	✅ Yes / ❌ No

Performance

#	Criterion	How to check	Result
2.6	Scan time within baseline	Compare the time against the baseline table (see below)	✅ Excellent / ⚠️ Acceptable / ❌ Unacceptable
2.7	RAM < 1 GB during scan	Activity Monitor — peak usage?	✅ < 500 MB / ⚠️ 500 MB-1 GB / ⚠️ 1-2 GB / ❌ > 2 GB
2.8	Progress indicator during scan	Can you see progress (%, files, path)? Is there no stuck at 99%?	✅ Yes / ❌ No
2.9	Pause/cancel scan	Are there Pause and Cancel buttons?	✅ Both / ⚠️ Cancel only / ❌ None
2.10	UI stays responsive (no beach ball)	During a Dataset 04 scan — can you interact with the UI?	✅ Yes / ❌ No

Accuracy

#	Criterion	How to check	Result
2.11	Accuracy on external volume (Dataset 02)	Do sizes on ExFAT/HFS+ match ground truth?	✅ ±5% / ⚠️ ±10% / ❌ > ±10%
2.12	APFS correctness (Dataset 03)	Clones, hard links, sparse — correct sizes?	✅ Physical or explanation / ⚠️ Logical size, same as Finder / ❌ Different numbers
2.13	Accuracy under stress (Dataset 04)	120,000 files — do the number and size match?	✅ Yes / ⚠️ Minor deviations / ❌ Major gaps

Visualization and navigation

#	Criterion	How to check	Result
2.14	Visualization (treemap/sunburst/list)	How many visualization modes are available?	✅ 2+ / ⚠️ 1 / ❌ None
2.15	Folder drill-down navigation	Can you click a folder and drill down into it?	✅ Yes / ❌ No
2.16	Item info (size/date/path)	Does each item show full details?	✅ All three / ⚠️ In part / ❌ None
2.17	Search by file/folder name	Is there a search or filter by name in the results?	✅ Yes / ❌ No

Operations on results

#	Criterion	How to check	Result
2.18	Multi-selecting files	Can you select multiple items for an operation?	✅ Yes / ❌ No
2.19	Showing total size of selection	Can you see the total size of selected items?	✅ Yes / ❌ No

Reliability

#	Criterion	How to check	Result
2.20	Correct symlink handling	The tool neither follows symlinks nor counts the target file twice. Is it so?	✅ Yes / ❌ No
2.21	Unicode/special characters/long paths	Can it work with emoji, Cyrillic, paths ~900 characters long?	✅ Yes / ❌ No
2.22	Error message quality	Are errors clear? Can you keep working in case of partial failures?	✅ Yes / ⚠️ Partially / ❌ No

Level 3: Nice to Have (bonuses)

Nice extras that improve the overall experience. Check for:

#	Feature	How to check
3.1	Network drive scanning (SMB)	Connect an SMB volume — does it see and scan it?
3.2	System area access with FDA	With FDA granted, does it show ~/Library and /Library?
3.3	Filtering by type and size	Can you filter results by type or size?
3.4	Sorting (size, name, date, type)	Can you sort the results?
3.5	Color coding in visualization	Are types/folders color-coded?
3.6	Reveal in Finder / Copy path	Can you open the file in Finder or copy its path?
3.7	Quick Look preview	Does a space bar or a button trigger a preview?
3.8	Move to… / Compress (alternatives)	Can you move to another folder or compress?
3.9	Undo and removal history	Is there an undo action or a log of removed files?
3.10	Scan exclusions (exclude list)	Can you specify folders or patterns to skip?
3.11	Priority settings / background scanning	Is there a background mode or priority setting?
3.12	Parallel scanning	Can it scan multiple volumes simultaneously?
3.13	Display customization (grouping, etc.)	Can you group by type or customize columns?
3.14	Logical / physical size toggle	Is there a toggle between logical and physical size?
3.15	Export results (CSV, JSON, PDF)	Can you export scan results?
3.16	Save/load scan results	Are results saved between sessions?
3.17	CLI / AppleScript / Automation	Is automation supported?
3.18	Keyboard navigation and shortcuts	Keyboard navigation, standard macOS shortcuts?
3.19	VoiceOver / accessibility	Is VoiceOver supported for visually impaired users?
3.20	Dark mode / System Appearance	System Settings → Appearance → Dark — does it switch?
3.21	Localization + macOS folder names	3+ languages, “Desktop” vs localized name?
3.22	Requests only necessary permissions	Only FDA, no Contacts/Calendar/Camera?
3.23	Stability on SMB disconnect	Is there no crash when the network connection drops?

Test preparation

Note: The tools you’re testing may remove files, damage the file system, or break macOS. Before you start, create a full system backup (Time Machine or equivalent). Don’t test on a production machine with real data.

Step 1Set up your test environment

Create a new macOS user or use a fresh session.
Quit all unnecessary apps.
Turn off Time Machine for the duration of the test.
Open Activity Monitor to track resources.

Step 2Make a note of the disk state

macOS version: _______________
Processor: Intel / Apple Silicon
RAM: _______________
Disk type and capacity: _______________
diskutil info / → available: _____ / purgeable: _____
tmutil listlocalsnapshots / → snapshots: _____
iCloud optimization: Yes / No

Step 3Install the tool

Download and install the tool to be tested

Make sure it is signed:

codesign -dv --verbose=2 /path/to/app.app

Copy

Grant Full Disk Access: System Settings → Privacy & Security → Full Disk Access
Make a note of: name, version, source (App Store / website)

Fair test rules

Test under the same conditions for all tools
Run 3 scans, make a note of the median time
Before each run:
```
sudo purge
```
Copy
(flush file cache), wait 30 seconds
Make sure Spotlight isn’t indexing the test folder
Don’t run resource-heavy tasks during the test

Test datasets

Create a folder

~/Disk_Space_Analyzer_Test_Files/

Copy

and prepare 4 datasets:

Dataset	Contents	Expected result	What it tests
01 Basic	~2,800 files, ~4.6 GB (documents, photos, videos, caches)	Folder sizes ±5% of ground truth	Accuracy, visualization, operations
02 External	~250 files, ~500 MB (copied to an external volume)	Sizes ±5%, hidden files visible	Source coverage, non-APFS
03 APFS Traps	~40 files, ~2.5 GB logical / ~350 MB physical	Clones, hard links, sparse — correct	APFS edge cases
04 Stress	~120,000 files, ~200 MB	All files found, no crashes	Performance, stability

Dataset 03: APFS traps (in detail)

This is the key dataset to test accuracy:

Trap	What’s created	Expected
APFS clones	`cp -c` Copy files of 100 MB each	Logical 400 MB, physical ~100 MB
Hard links	Multiple names → same data	Don’t count twice
Sparse files	Logical size > physical	Show actual on-disk size
Symlinks	Links to files and folders	Don’t follow, don’t count twice
Mixed folder	Clones + regular files	Correct total

Important!

du, stat, and Finder DON’T see APFS clone sharing — they all show the logical size. This is standard macOS behavior, not a bug. Most tools will show the logical size — that’s okay. The actual physical size can be measured via diskutil info / (available space difference).

Performance baseline (SSD + Apple Silicon)

Dataset	Excellent	Acceptable	Unacceptable
01 Basic (~4.6 GB, ~2,800 files)	< 30 sec	< 90 sec	> 180 sec
02 External (~500 MB)	< 15 sec	< 45 sec	> 90 sec
03 APFS Traps (~40 files)	< 5 sec	< 15 sec	> 30 sec
04 Stress (~120K files)	< 3 min	< 10 min	> 20 min

For Intel Macs, multiplying these values by 1.5–2x is acceptable.

Memory (RAM)

Level	Usage
Excellent	< 500 MB
Acceptable	500 MB – 1 GB
Borderline	1 – 2 GB
Unacceptable	> 2 GB or crash

False positives and common testing mistakes.

False positives

Not a red flag, but a serious downside:

Category	Error
False inflation	Counts iCloud cloud-only files as locally occupied space
False inflation	Sums up APFS clones as separate files (doubling/tripling)
False inflation	Counts hard links multiple times
False inflation	Follows symlinks and counts target files twice
False recommendations	Suggests removing system caches that macOS will recreate
False recommendations	Labels Application Support files as “junk”
Misleading	Shows “X GB available” without taking purgeable space into account
Misleading	After removal, shows “X GB freed up,” but diskutil says otherwise

Common testing mistakes

Comparing App Store and non-App Store versions without taking the Sandbox into account
Testing without Full Disk Access (incomplete results)
One scan instead of 2 (scan times vary)
Not to make a note of the disk state before the test (baseline)
Not to turn off Time Machine (snapshots affect measurements)

Testing procedure

Testing follows phases — groups of actions — not the criteria number order. Each phase covers specific criteria. All 54 criteria are distributed across 7 phases (0–6) with no gaps.

Preparation (Criteria 1.8, 1.9)

0.1Signature check and installation

Why it matters: Unsigned tools, which Apple hasn’t verified, may contain malicious code. Bundleware during installation is a sign of a shady developer.

How to check:

codesign -dv --verbose=2 /path/to/app.app

Copy

— signed?

spctl --assess --verbose /path/to/app.app

Copy

— notarized?

Install the tool — any extra offers or ad pop-ups?
Check /Applications after installation — anything extra show up?

1.8: ✅ Signed and notarized / ❌ Unsigned

1.9: ✅ Clean install / ❌ Third-party software during installation

0.2Monetization (transparency)

Check before you start testing:

Question	Answer
Are the free version limitations clear BEFORE you start?	✅ Yes / ❌ No
Is the price visible before you buy?	✅ Yes / ❌ No
No “trap”: scans for free, but charges for any further action without warning?	✅ Yes / ❌ No

Scanning and performance (Criteria 1.2, 2.6–2.10)

1.1–1.4Scanning test datasets

Why it matters: The tool should run stable and meet a reasonable deadline on all data types — from basic to stress.

For each dataset (Dataset 01–04):

Select the dataset folder in the analyzer and start the scan
Note down the time from start to finish (3 scans, the median)
Note down peak RAM from Activity Monitor
Note down whether there were any crashes or freezes.

Fill in the summary table:

Dataset	Time (sec)	Baseline	RAM (MB)	Crash
01 Basic				✅ Yes / ❌ No
02 External				✅ Yes / ❌ No
03 APFS Traps				✅ Yes / ❌ No
04 Stress				✅ Yes / ❌ No

1.2: ✅ No crash/freeze on any dataset / ❌ Crash on any dataset

2.6: ✅ Time within baseline / ❌ Time outside baseline

2.7: ✅ RAM < 1 GB / ❌ > 1 GB

1.5UI responsiveness

Why it matters: You should be able to see progress and interact with the tool during a long scan.

For each dataset (Dataset 01–04):

Check when scanning a Dataset 04: is there a progress bar, percentage, or file number?
Does it get stuck at 99%?
Can you click the interface without the beach ball showing up?

2.8: ✅ Informative progress shown / ❌ No progress

2.10: ✅ UI stays responsive / ❌ Beach ball or freeze

1.6Pause/cancel scan

Why it matters: You should be able to stop or pause a long scan.

Start scanning Dataset 04
Is there a Pause button? Does it work?
Is there a Cancel/Stop button? Does it work?
After canceling, can you see partial results?

2.9: ✅ Pause + cancel / ⚠️ Cancel only / ❌ Nothing

Analysis accuracy (Criteria 1.1, 2.11–2.13, 2.20, 2.21)

2.1Ground truth — Dataset 01CRITICAL

Why it matters: This is the main accuracy test. Folder sizes should match the known ground truth.

How to check:

Select Dataset_01_BasicDisk/ in the analyzer
For each top-level folder, compare the reported size with ground_truth.json

Folder Ground truth Analyzer Difference ±5%?

Documents/

Photos/

Videos/

Applications/

Developer/

Caches/

TOTAL
Does the total value match the sum of subfolders?

1.1: ✅ All folders ±5% / ❌ At least one > ±5%

2.2Ground truth — Dataset 02, external volume

Why it matters: The app should work correctly not just with APFS, but with ExFAT/HFS+ volumes too.

How to check:

Copy Dataset 02 to an external ExFAT or HFS+ volume
Select the external volume in the analyzer
Compare sizes with ground_truth.json
Does it see hidden files (.dotfiles)? Does it display Unicode correctly?

2.11: ✅ ±5% and sees everything / ⚠️ ±10% / ❌ > ±10% or missing items

2.3APFS traps — Dataset 03

Why it matters: APFS has quirks (clones, sparse files, hard links) where logical and physical sizes can differ. Finder and du also show logical size — that’s okay. But a good analyzer should explain the difference or offer switching between logical and physical size.

For each trap, note down the reported size and compare it with ground truth:

Trap	Logical	Rating
APFS clones		Excellent / Good / Acceptable / Poor
Hard links
Sparse files
Symlinks	Don’t count twice	Correct / Double-counted
Mixed folder

Scale:

Excellent — the tool shows physical size, explains the difference
Good — the tool shows logical size, but there’s a logical/physical toggle
Acceptable — the tool shows logical size, it matches Finder’s “Get Info”
Poor — the tool shows its own numbers that don’t match Finder or du
Fail — crash, lag, absurd values

2.12: ✅ Physical size or explanation / ⚠️ Logical size as in Finder / ❌ Own numbers or crash

2.20: ✅ Symlinks handled correctly (no loops, no double-counting) / ❌ Incorrect

2.4Stress — Dataset 04

Why it matters: With a large number of files, the tool should maintain accuracy and correctly handle Unicode, special characters, and long paths.

Does the total file number match ground truth (~120,000)?
Does the total size match (~200 MB)?
Are all 1,000 subdirectories displayed?
Does it reach the 100th nesting level?
Are files with long paths (~900 characters) found?
Are files with special characters/emoji displayed without artifacts?

2.13: ✅ Everything matches / ⚠️ Minor deviations / ❌ Major gaps

2.21: ✅ Handles Unicode correctly / ❌ No

Operation safety (Criteria 1.3–1.6, 2.18, 2.19, 2.22)

Use the Dataset 01 scan results.

3.1Confirmation before removalCRITICAL

Why it matters: Accidental removal without confirmation is a direct threat to user data.

Select a file, click Remove
Does a confirmation appear? Does it show number, size, and destination?

1.3: ✅ Confirmation appears / ❌ The tool removes files immediately

3.2Removal to TrashCRITICAL

Why it matters: Moving to Trash is your safety net against mistakes. Permanent removal by default is unacceptable.

Remove a file → check Trash
Default behavior: Trash or Permanent?

1.4: ✅ Trash by default / ❌ Permanent or no Trash option

3.3Removal matches selectionCRITICAL

Why it matters: Removing the wrong files is the most dangerous mistake an analyzer can make.

Select 3 files, keep in mind the names
Remove them → check in Finder: only those 3 removed? Everything else untouched?
Restore from Trash — do the files open?

1.5: ✅ Exactly the selected files removed / ❌ Mismatch

3.4System file protectionCRITICAL

Why it matters: Removing files from ~/Library/Preferences/ or /System can break macOS.

Scan the boot volume
Try to remove a file from ~/Library/Preferences/
Does the tool block, warn, or allow it without warning?

1.6: ✅ The tool blocks or warns / ❌ The tool allows removal without warning

3.5File selection

Why it matters: You should be able to select multiple items and see their total size before removal.

Can you select multiple items (multi-select)?
Can you see the total size of the selected items before confirmation?

2.18: ✅ Multi-select available / ❌ No

2.19: ✅ I can see the total size / ❌ No

3.6Removal error handling

Why it matters: When an error occurs (file locked, no access), the tool should make it clear and proceed with the rest.

Lock a file (Get Info → Locked)
Try to remove it through the tool
Is the message clear? Does it proceed with the remaining files?

2.22: ✅ Clear errors + proceeds / ❌ Crash or unclear

Visualization, navigation, and UI (Criteria 2.14–2.17, 3.3–3.7)

Use the scan results from Dataset 01 and Dataset 04.

4.1Visualization

Why it matters: Visualization lets you quickly see what’s taking up space. Color coding helps distinguish data types.

Available modes: treemap, sunburst, list, bar chart?
Are types/folders distinguished by color?
Does it work with a large number of items (Dataset 04)?

2.14: ✅ 2+ modes / ⚠️ 1 mode / ❌ Text list only

3.5: ✅ Color coding available / ❌ No

4.2Folder drill-down navigation

Why it matters: Drill-down is the main way to find what’s specifically eating up space.

Click a folder — can you drill down into it?
Is there back navigation / breadcrumbs?
Do proportions update when entering a subfolder?
Does it work at 100-level depth (Dataset 04)?

2.15: ✅ Drill-down with updated proportions / ❌ No

4.3Information on items

Why it matters: To decide whether to remove something, you need to see its size, path, and date.

Does each item show: size, full path, date modified?

2.16: ✅ All three (size + path + date) / ⚠️ Some of them / ❌ None

4.4Search

Why it matters: With thousands of files, you need a search to find specific items quickly.

Is there a search by file/folder name in the results?
Does the search return correct results?

2.17: ✅ Search available / ❌ No

4.5Filters and sorting

Filtering: by type (photos, videos, documents)? By size (> 100 MB)?
Sorting: by size, name, date, type?

3.3: ✅ Filtering available / ❌ No

3.4: ✅ Sorting available / ❌ No

4.6System integration

Reveal in Finder / Copy path?
Quick Look (Space bar or button)?

3.6: ✅ Reveal in Finder or Copy path available / ❌ No

3.7: ✅ Quick Look works / ❌ No

Source coverage and permissions (Criteria 2.1–2.5, 1.7, 3.1, 3.2, 3.22, 3.23)

5.1Boot APFS volume

Why it matters: The main disk is the primary analysis target for the vast majority of users.

Can the tool scan the boot volume (/)?
Does it show the correct total size?

2.1: ✅ The tool scans correctly / ❌ No

5.2External volumes

Why it matters: Users often store data on external drives with file systems other than APFS.

Connect an ExFAT volume — does it see and scan it?
Connect an HFS+ volume — does it see and scan it?

2.2: ✅ ExFAT + HFS+ / ⚠️ Only one / ❌ Neither

5.3Volume/folder selection

Does it show a list of available volumes?
Can you scan a specific folder (not the whole volume)?

2.3: ✅ Volume and folder selection available / ❌ No

5.4iCloud Drive

Why it matters: If the tool treats cloud-only files as locally occupied space, you get an inflated result.

Does it distinguish between cloud-stored and locally downloaded files?
Doesn’t it count cloud-only files as disk space used?
Does it work correctly with Desktop & Documents under iCloud sync?

2.4: ✅ The tool distinguishes and counts them correctly / ⚠️ Partially / ❌ No

5.5Full Disk Access

Why it matters: Without FDA, the tool can’t see ~/Library, Mail, Safari, and other protected areas. You need to understand why the results are incomplete.

Revoke FDA, restart the tool
Scan the boot volume
Behavior: clear warning + instructions? Or crash/silence?
Important: re-enable FDA after the test!

2.5: ✅ The tool warns + explains / ❌ Crash or silence

5.6Network drives

Connect an SMB volume — does it see and scan it?
Disconnect the network during a scan — crash or graceful exit?

3.1: ✅ The tool scans SMB / ❌ No

3.23: ✅ It is stable when disconnecting / ❌ Crash

5.7System areas

With the FDA, does it show
```
~/Library
```
Copy
and
```
/Library
```
Copy
in the results?

3.2: ✅ The tool shows them / ❌ No

5.8Network activityCRITICAL

Why it matters: A disk space analyzer can see ALL your files. Transmitting that data is a serious privacy violation.

Little Snitch or a firewall: what connections does it make? What does it send?
Is there a privacy policy? Does it describe what data is collected?
Can it be used completely offline?

1.7: ✅ No data leaks / ❌ It sends files/paths

5.9Permissions

Does it only request FDA, or also Contacts/Calendar/Camera?

3.22: ✅ Only necessary permissions / ❌ Unnecessary permissions

Bonus features (Criteria 3.8–3.21)

For each item: feature is available and works → pass, otherwise → fail.

6.1Alternative actions (3.8)

Is there Move to… or Compress?

6.2Undo and history (3.9)

Is there an undo for the last action or a removal log?

6.3Scan exclusions (3.10)

Can you specify folders or patterns (*.log, .DS_Store) to skip?

6.4Priority and background scanning (3.11)

Is there a background mode or priority setting (low / background)?

6.5Parallel scanning (3.12)

Can it scan multiple volumes at the same time?

6.6Display customization (3.13)

Grouping by type, customizable columns?

6.7Logical / Physical size (3.14)

Is there a toggle between logical and physical size? Test with Dataset 03 (APFS clones).

6.8Export results (3.15)

Can you export to CSV, JSON, PDF, or text?

6.9Scan saving (3.16)

Quit and reopen the tool — are the results still there?

6.10CLI / automation (3.17)

Is there CLI, AppleScript, or Shortcuts support?

6.11Keyboard navigation (3.18)

Keyboard navigation, hotkeys (⌘A, ⌘C, ⌘Delete)?

6.12VoiceOver / accessibility (3.19)

Cmd+F5, try navigating — are the main items announced?

6.13Dark mode (3.20)

System Settings → Appearance → Dark — does it switch?

6.14Localization (3.21)

3+ languages? Does the tool correctly display localized folder names vs “Desktop”?

Scoring formula

L1 Level 1 (×3)

#	Criterion	Result
1.1	Basic analysis accuracy (±5%)
1.2	Stability (no crashes on Dataset 01)
1.3	Confirmation before removal
1.4	Removal to Trash by default
1.5	Removal matches selection
1.6	System file protection
1.7	The tool doesn’t send data to external servers
1.8	Signed and notarized by Apple
1.9	The tool doesn’t install third-party software
	L1 total:	___ / 9
	L1 points:	___ × 3 = ___ / 27

If any L1 criterion fails → the tool is automatically “Not recommended.”

L2 Level 2 (×2)

Calculate the number of passes from criteria 2.1–2.22:

#	Criterion	Result
	L2 total:	___ / 22
	L2 points:	___ × 2 = ___ / 44

Note: ⚠️ counts as 0.5 points

L3 Level 3 (×1)

Calculate the number of passes from criteria 3.1–3.23:

#	Criterion	Result
	L3 total:	___ / 23
	L3 points:	___ × 1 = ___ / 23

Total score

L1: ___ + L2: ___ + L3: ___ = TOTAL: ___ / 94

Report template

================================================================
         DISK SPACE ANALYZER EVALUATION REPORT
================================================================
TOOL
  Name: ______________________
  Version: ______________________
  Source: App Store / Website
TEST ENVIRONMENT
  macOS: ______________________
  Processor: Intel / Apple Silicon
  RAM: ______________________
  Disk type and capacity: ______________________
  Test date: ______________________
________________________________________________________________
RESULTS BY LEVEL
________________________________________________________________
  Level 1 (Must Have):     ___ / 9   ×3  =  ___ / 27
  Level 2 (Should Have):   ___ / 22  ×2  =  ___ / 44
  Level 3 (Nice to Have):  ___ / 23  ×1  =  ___ / 23
________________________________________________________________
  TOTAL:                                     ___ / 94
________________________________________________________________
PERFORMANCE
________________________________________________________________
  01 Basic:    _____ sec / _____ MB RAM
  02 External: _____ sec / _____ MB RAM
  03 APFS:     _____ sec / _____ MB RAM
  04 Stress:   _____ sec / _____ MB RAM
________________________________________________________________
RECOMMENDATION
________________________________________________________________
  [ ] RECOMMENDED (73–94 points, no L1 failures)
  [ ] WITH CAVEATS (47–72 points)
  [ ] NOT RECOMMENDED (<47 points or L1 failure) 
________________________________________________________________
NOTES
________________________________________________________________
Strengths: ________________________________________
Key weaknesses: _____________________________________
Blockers: ____________________________________
================================================================
Methodology: Nektony v1.1.2 | March 2026 | CC BY 4.0
================================================================

Evaluation calculator

We’ve built an interactive calculator that automates the entire process:

→ Open the evaluation calculator

The calculator will:

Walk you through all 54 criteria
Calculate the score automatically
Show the recommendation
Save your progress (you can come back later)
Export results to a file

Download the full methodology

For printing and offline use:

📄 Download methodology v1.1.2 (PDF)

📦 Download DSA_QuickTest.zip

A ready-made minimal dataset for quick testing without scripts.

Includes:

All 54 criteria with detailed instructions
Test dataset creation scripts
Final report template
CSV template for results

Frequently asked questions

Why do you publish this methodology if you develop a disk space analyzer yourself?

We believe transparency builds trust. A transparent methodology allows anyone to verify our statements. If our product is good, it will show. If not, we’ll learn what needs to be improved.

Why is the maximum 94 points and not 100?

This is a result of the formula: (9×3) + (22×2) + (23×1) = 27 + 44 + 23 = 94. We didn’t want to round it up artificially. That would break the link between the number of criteria and their weights.

Why are APFS traps a separate test dataset?

Because APFS clones, sparse files, and purgeable space create a gap between logical and physical size. Even Finder and du show the logical size — that’s standard macOS behavior. A separate dataset allows you to check whether the analyzer explains this difference to users, rather than penalizing it for behaving like Finder.

Can this methodology be used for other categories of tools?

In part. The removal safety criteria (L1) and UX criteria (L3) apply to any cleaning utilities. The specific criteria (APFS clones, purgeable space, Dataset 03) apply only to disk space analyzers.

Can I suggest a new criterion?

Yes! Email us at support@nektony.com. We’ll consider your suggestion for the next version.

Disclaimer

This methodology is provided “as is” for reference only.

Limitations

Test results may vary depending on the tool version, macOS version, hardware configuration, and test data.
The methodology is not exhaustive and does not cover all possible use cases.
Scores and recommendations are advisory in nature.

Testing risks

The tools you’re testing may remove user files, damage the file system, or break macOS.
A malfunctioning test tool can lead to irreversible data loss.
Testing on a production machine with real data is risky. Use a dedicated test environment or a backup (Time Machine).

Liability

Nektony shall not be liable for any damage resulting from testing under this methodology, including but not limited to: data loss, system damage, disruption of tools or hardware.
The user shall be solely liable for choosing their software, testing it, and for the consequences of using it.
Before you start testing, we strongly recommend creating a full system backup (Time Machine or equivalent).

Conflict of interest

Nektony develops a disk space analyzer — a product in the category being tested. We publish this methodology transparently and apply it to our own products. Independent verification of results is welcome.

License

This methodology is licensed under Creative Commons Attribution 4.0 (CC BY 4.0).

You are free to:

Copy and redistribute the material in any medium.
Adapt and create derivative works.
Use it for commercial purposes.

Full license text: creativecommons.org/licenses/by/4.0/

Folder	Ground truth	Analyzer	Difference	±5%?
Documents/
Photos/
Videos/
Applications/
Developer/
Caches/
TOTAL

How to choose a disk space analyzer for Mac: A transparent evaluation methodology

TL;DR

Why this methodology exists

Who this methodology is for

Core principle

Terms and definitions

Scoring system

Scoring formula

How to interpret the result

Red flags

Evaluation criteria

Level 1: Must Have (blockers)

Level 2: Should Have (important)

Data source coverage

Performance

Accuracy

Visualization and navigation

Operations on results

Reliability

Level 3: Nice to Have (bonuses)

Test preparation

Test datasets

Testing procedure

Preparation (Criteria 1.8, 1.9)

Scanning and performance (Criteria 1.2, 2.6–2.10)

Analysis accuracy (Criteria 1.1, 2.11–2.13, 2.20, 2.21)

Operation safety (Criteria 1.3–1.6, 2.18, 2.19, 2.22)

Visualization, navigation, and UI (Criteria 2.14–2.17, 3.3–3.7)

Source coverage and permissions (Criteria 2.1–2.5, 1.7, 3.1, 3.2, 3.22, 3.23)

Bonus features (Criteria 3.8–3.21)

Scoring formula

L1 Level 1 (×3)

L2 Level 2 (×2)

L3 Level 3 (×1)

Total score

Report template

Evaluation calculator

Download the full methodology

Disclaimer

Limitations

Testing risks

Liability

Conflict of interest

License

Related materials