How Pickrack tests tools

Every comparison and best-of article on Pickrack is grounded in real testing. This page documents the exact methodology so you can verify our claims — or replicate them yourself.

TL;DR

Real-world tasks (not synthetic benchmarks). 5-10 candidates per category. Same input file across all tools. Grade on output quality, free-tier limits, signup friction, watermarks, and edge cases. Output files and test inputs published on GitHub. Ranking ignores affiliate revenue — we explicitly note when our top pick has no affiliate program at all.

Why we don't use synthetic benchmarks

A common practice in tool-review content is "benchmark a 1 MB perfectly-formed test PDF." This kind of testing makes every tool look identical because every tool handles the easy case. The interesting differences only show up on the messy real-world files most people actually have: a 12 MB scanned contract with mixed orientation, a 45 MB PowerPoint export with embedded video previews, a HEIC photo straight off an iPhone 15.

For every comparison post on Pickrack, we define a real-world taskfirst, find or construct the kind of file you'd actually need to process for that task, and then run every candidate tool against it. The grading rubric is calibrated against that real task, not against an abstract benchmark.

Our grading rubric

Each tool we review is scored on six dimensions. Not every dimension applies to every tool category, but the ones that apply are scored consistently across competitors.

1. Output quality

Does the result match what you actually need? For a PDF compressor: is the text still readable? For an image converter: is there visible banding or color shift? Subjective but consistent — same reviewer, same monitor, same time of day.

2. Free-tier limits

Daily quota, file-size cap, file-count cap. We test until we hit the limit so we know exactly where it bites. "Free with limits" counts; "free trial" (i.e., paid in disguise) does not.

3. Signup friction

Email required? Account required for download? Captcha? OAuth dance? Anything that interrupts the "upload → process → download" flow is friction and is graded down.

4. Watermarks & branding

Watermarked output, brand stamps, "upgrade to remove" nag screens — these immediately drop a tool to the bottom tier no matter how good the conversion is.

5. Performance

Wall-clock time from file upload to download-ready. Measured on a clean Chrome window, no cache, residential US broadband (~500 Mbps). Mobile timing tested on a 3-year-old mid-range Android over LTE.

6. Edge-case behavior

What happens with a corrupt file? Encrypted PDF? File too large? Unicode filename with Vietnamese diacritics? We deliberately try to break each tool to surface failure modes the marketing pages won't tell you.

How rankings are decided

Within each comparison post, tools are ranked by their real usefulness for the defined task. This often differs from a simple sum of the six rubric scores. A tool with mediocre output quality but a generous free tier and no signup might beat a slightly-better-quality tool that watermarks every output. Context matters; we document the context for each ranking decision in the post itself.

Tools we recommend most strongly are often free or open-source with no affiliate program at all. If a paid tool earns a top spot, the post explains exactly which task it beats free alternatives on, and which tasks free alternatives still win. We explicitly flag affiliate links at the top of any post that contains them — see the Affiliate Disclosure.

Test environment

For reproducibility, here is the exact environment used for the bulk of tests on this site:

Primary test machine: MacBook Pro M1 (8 GB RAM, macOS Sonoma)
Secondary test machine: Mid-range Windows 11 desktop (Ryzen 5, 16 GB RAM) — used for any test where Windows behavior differs (HEIC, font rendering, OS clipboard)
Browser: Chrome 140+ (default), Safari 18+ (for browser-side WebKit-specific tests), Firefox 130+ spot-checked
Network: Comcast residential gigabit (down ~500 Mbps measured, up ~25 Mbps) — typical US home broadband, not a data-center connection
Mobile: Google Pixel 7 on T-Mobile LTE, used for "works on phone" checks

Each test is run at least twice to rule out network jitter or transient server issues. Results that swing widely between runs are noted explicitly in the post.

Test files we use

Where licensing allows, the input files we test against are publicly available on the Pickrack GitHub repository so you can verify our claims yourself or replicate the comparison on your machine.

github.com/pickrack/pickrack/tree/main/test-fixtures

Common test artifacts include:

contract-12mb-scanned.pdf — a 28-page scanned contract with mixed orientation, used for compress / OCR / rotate testing
iphone-photo-heic-4mb.heic — a fresh iPhone 15 photo, used for HEIC conversion tests
deck-45mb.pptx — a PowerPoint with embedded fonts and image-heavy slides, used for PPTX↔PDF tests
article-5kw.md — a 5,000-word markdown article with code blocks, tables, and footnotes, used for markdown rendering and AI summarizer tests
messy-csv.csv — a CSV with quoted commas, embedded newlines, and mixed encodings, used for translation and parsing tests

Conflicts of interest

We do not accept payment for positive reviews. We do not publish sponsored content disguised as editorial. We do not enter paid backlink exchanges. Every recommendation reflects what the reviewer would actually pick on a Tuesday afternoon for a real task.

Some links on this site are affiliate links — clicking through and completing a paid signup earns Pickrack a commission. These are always disclosed at the top of posts that contain them, and they do not influence the ranking. The full list of affiliate programs we participate in (and the ones we've specifically declined) is on the Affiliate Disclosure page.

Pickrackis the founder's own product. When we recommend Pickrack tools alongside competitors, we're explicitly biased and we acknowledge that bias in the post. Where competitors genuinely beat our own tools on a task, we say so.

Corrections and updates

If a tool we recommend changes behavior — new pricing, removed feature, new dark pattern — we update the post and bump the "Last reviewed" date. We do not silently revise; significant corrections are noted in a footer block on the affected post.

Found a factual error or an outdated detail? Three ways to flag it:

Open an issue at github.com/pickrack/pickrack/issues with the URL and what's wrong
Click Suggest an edit at the bottom of any blog post — opens GitHub's file editor for that exact MDX file
Email [email protected]

What we deliberately don't do

Never accept payment for a positive review or ranking adjustment

Never use AI to generate review verdicts — every conclusion is from a human running the actual tests

Never recommend a paid tool just because its affiliate program pays better

Never abandon old reviews — if a tool's situation changes, we update or retire the post

Never publish synthetic benchmarks as if they were real-world tests

Never link to tools we haven't personally used for the task in question

On AI assistance in writing

Drafts may use AI tools for grammar checking, fact verification (with manual confirmation), translation of technical references, and brainstorming structure. Every published article on Pickrackis written and edited by a human, with first-hand testing of the tools described. We do not generate review verdicts with AI. We do not use AI to fabricate test results we haven't actually run.

If you suspect a specific post was AI-generated and you're right, please flag it — that would be a quality lapse worth correcting. We use our own AI grammar checker on most posts as the final pass before publish, but the substance is always human.

Who runs the tests

Right now, everything on Pickrack is tested and written by founder David Pham. If that ever changes — guest contributors, hired editors — we will add their profiles to the authors page and disclose any change in editorial structure publicly.

→ Read our comparison posts → About the editor