How Pickrack tests tools
Every comparison and best-of article on Pickrack is grounded in real testing. This page documents the exact methodology so you can verify our claims — or replicate them yourself.
TL;DR
Real-world tasks (not synthetic benchmarks). 5-10 candidates per category. Same input file across all tools. Grade on output quality, free-tier limits, signup friction, watermarks, and edge cases. Output files and test inputs published on GitHub. Ranking ignores affiliate revenue — we explicitly note when our top pick has no affiliate program at all.
Why we don't use synthetic benchmarks
A common practice in tool-review content is "benchmark a 1 MB perfectly-formed test PDF." This kind of testing makes every tool look identical because every tool handles the easy case. The interesting differences only show up on the messy real-world files most people actually have: a 12 MB scanned contract with mixed orientation, a 45 MB PowerPoint export with embedded video previews, a HEIC photo straight off an iPhone 15.
For every comparison post on Pickrack, we define a real-world taskfirst, find or construct the kind of file you'd actually need to process for that task, and then run every candidate tool against it. The grading rubric is calibrated against that real task, not against an abstract benchmark.
Our grading rubric
Each tool we review is scored on six dimensions. Not every dimension applies to every tool category, but the ones that apply are scored consistently across competitors.
1. Output quality
Does the result match what you actually need? For a PDF compressor: is the text still readable? For an image converter: is there visible banding or color shift? Subjective but consistent — same reviewer, same monitor, same time of day.
2. Free-tier limits
Daily quota, file-size cap, file-count cap. We test until we hit the limit so we know exactly where it bites. "Free with limits" counts; "free trial" (i.e., paid in disguise) does not.
3. Signup friction
Email required? Account required for download? Captcha? OAuth dance? Anything that interrupts the "upload → process → download" flow is friction and is graded down.
4. Watermarks & branding
Watermarked output, brand stamps, "upgrade to remove" nag screens — these immediately drop a tool to the bottom tier no matter how good the conversion is.
5. Performance
Wall-clock time from file upload to download-ready. Measured on a clean Chrome window, no cache, residential US broadband (~500 Mbps). Mobile timing tested on a 3-year-old mid-range Android over LTE.
6. Edge-case behavior
What happens with a corrupt file? Encrypted PDF? File too large? Unicode filename with Vietnamese diacritics? We deliberately try to break each tool to surface failure modes the marketing pages won't tell you.
How rankings are decided
Within each comparison post, tools are ranked by their real usefulness for the defined task. This often differs from a simple sum of the six rubric scores. A tool with mediocre output quality but a generous free tier and no signup might beat a slightly-better-quality tool that watermarks every output. Context matters; we document the context for each ranking decision in the post itself.
Tools we recommend most strongly are often free or open-source with no affiliate program at all. If a paid tool earns a top spot, the post explains exactly which task it beats free alternatives on, and which tasks free alternatives still win. We explicitly flag affiliate links at the top of any post that contains them — see the Affiliate Disclosure.
Test environment
For reproducibility, here is the exact environment used for the bulk of tests on this site:
- Primary test machine: MacBook Pro M1 (8 GB RAM, macOS Sonoma)
- Secondary test machine: Mid-range Windows 11 desktop (Ryzen 5, 16 GB RAM) — used for any test where Windows behavior differs (HEIC, font rendering, OS clipboard)
- Browser: Chrome 140+ (default), Safari 18+ (for browser-side WebKit-specific tests), Firefox 130+ spot-checked
- Network: Comcast residential gigabit (down ~500 Mbps measured, up ~25 Mbps) — typical US home broadband, not a data-center connection
- Mobile: Google Pixel 7 on T-Mobile LTE, used for "works on phone" checks
Each test is run at least twice to rule out network jitter or transient server issues. Results that swing widely between runs are noted explicitly in the post.
Test files we use
Where licensing allows, the input files we test against are publicly available on the Pickrack GitHub repository so you can verify our claims yourself or replicate the comparison on your machine.
github.com/pickrack/pickrack/tree/main/test-fixtures
Common test artifacts include:
contract-12mb-scanned.pdf— a 28-page scanned contract with mixed orientation, used for compress / OCR / rotate testingiphone-photo-heic-4mb.heic— a fresh iPhone 15 photo, used for HEIC conversion testsdeck-45mb.pptx— a PowerPoint with embedded fonts and image-heavy slides, used for PPTX↔PDF testsarticle-5kw.md— a 5,000-word markdown article with code blocks, tables, and footnotes, used for markdown rendering and AI summarizer testsmessy-csv.csv— a CSV with quoted commas, embedded newlines, and mixed encodings, used for translation and parsing tests
Conflicts of interest
We do not accept payment for positive reviews. We do not publish sponsored content disguised as editorial. We do not enter paid backlink exchanges. Every recommendation reflects what the reviewer would actually pick on a Tuesday afternoon for a real task.
Some links on this site are affiliate links — clicking through and completing a paid signup earns Pickrack a commission. These are always disclosed at the top of posts that contain them, and they do not influence the ranking. The full list of affiliate programs we participate in (and the ones we've specifically declined) is on the Affiliate Disclosure page.
Pickrackis the founder's own product. When we recommend Pickrack tools alongside competitors, we're explicitly biased and we acknowledge that bias in the post. Where competitors genuinely beat our own tools on a task, we say so.
Corrections and updates
If a tool we recommend changes behavior — new pricing, removed feature, new dark pattern — we update the post and bump the "Last reviewed" date. We do not silently revise; significant corrections are noted in a footer block on the affected post.
Found a factual error or an outdated detail? Three ways to flag it:
- Open an issue at github.com/pickrack/pickrack/issues with the URL and what's wrong
- Click Suggest an edit at the bottom of any blog post — opens GitHub's file editor for that exact MDX file
- Email [email protected]
What we deliberately don't do
Never accept payment for a positive review or ranking adjustment
Never use AI to generate review verdicts — every conclusion is from a human running the actual tests
Never recommend a paid tool just because its affiliate program pays better
Never abandon old reviews — if a tool's situation changes, we update or retire the post
Never publish synthetic benchmarks as if they were real-world tests
Never link to tools we haven't personally used for the task in question
On AI assistance in writing
Drafts may use AI tools for grammar checking, fact verification (with manual confirmation), translation of technical references, and brainstorming structure. Every published article on Pickrackis written and edited by a human, with first-hand testing of the tools described. We do not generate review verdicts with AI. We do not use AI to fabricate test results we haven't actually run.
If you suspect a specific post was AI-generated and you're right, please flag it — that would be a quality lapse worth correcting. We use our own AI grammar checker on most posts as the final pass before publish, but the substance is always human.
Who runs the tests
Right now, everything on Pickrack is tested and written by founder David Pham. If that ever changes — guest contributors, hired editors — we will add their profiles to the authors page and disclose any change in editorial structure publicly.