How we work

Our methodology

Every score on StackArbiter comes from the same rubric, applied the same way, to every tool we test. Here's exactly how we do it — no black boxes.

Scoring axes

137

Tools tested

Quarterly repriced

Paid rankings

The process

How we test every tool

The same four steps, every time. No skipping, no reviewing from the brochure.

Step 01

We open a real account

We sign up using a real email, go through the actual onboarding, and run realistic workflows — invoicing, project setup, reporting. We never review a tool from its marketing page alone.

Step 02

We score on 6 axes

Each tool is graded 0–5 on six weighted criteria. The same rubric, the same questions, every time. We document our notes during testing so scores are traceable — not recalled from memory.

Step 03

We cross-check user data

Our hands-on score is validated against aggregated user reviews from G2, Capterra, and Trustpilot. Where our experience diverges significantly from user consensus, we investigate and note why.

Step 04

We name a winner

Every comparison ends with one named pick for a specific use case. If we genuinely can't separate two tools, we say so explicitly — we don't hide behind "it depends" and call it done.

Step 05

We verify pricing quarterly

Prices change. We re-check every pricing page every quarter and update the scores if value-for-money shifts materially. Each page shows the last verification date so you always know how fresh the data is.

Step 06

We publish corrections

When a tool changes significantly, when readers flag an error, or when a new competitor shifts our verdict — we update and note it publicly. Scores aren't set in stone.

The rubric

The 6 scoring criteria

Each axis is scored 0–5. The final score is a weighted average. Weights reflect how much each factor affects real-world value for a business user.

Setup & Onboarding

20%

How fast can a new user go from signup to first meaningful output — invoice sent, project created, report run. Complexity and friction are penalized.

Time from signup to first useful action
Data import tools (CSV, from competitors)
Quality of onboarding wizard or in-app guidance
Complexity of initial configuration

Day-to-Day UX

25%

The heaviest-weighted axis. A tool you hate using on Tuesday will be abandoned by Thursday. We test the core daily workflows the product is built for.

Navigation clarity and information density
Mobile app quality (not just "mobile-friendly")
Speed of the most common actions
Consistency and polish across the interface

Feature Depth

20%

Does the tool do what it claims — and does it do it completely? We test edge cases, not just the happy path. Integrations and API quality count here.

Core feature completeness vs. category standards
Integration ecosystem (native and via Zapier/API)
Reporting and analytics depth
Handling of edge cases and complex workflows

Customer Support

15%

We test support channels directly — submitting real tickets and measuring response time, quality, and resolution. "24/7 support" claims are verified, not taken at face value.

Available channels (chat, phone, email, community)
Measured first-response time
Quality of the help documentation
Support availability by plan tier

Price-to-Value

12%

Not "is it cheap" but "is what you get worth what you pay." Scored relative to the category average. A $200/month tool can score higher than a $20/month tool if it delivers proportionate value.

Features per dollar vs. category median
Free tier or trial generosity
Pricing transparency (no hidden fees)
Scaling cost as team or usage grows

Data Portability

Can you leave? Vendor lock-in is a real cost. We test data exports on day one — before we're emotionally invested. A tool that traps your data loses points regardless of everything else.

Export completeness (all data, not just summaries)
Standard formats (CSV, JSON, industry-specific)
Migration path to common competitors
Account deletion and data deletion process

Score formula

How the final number is calculated

Each axis score (0–5) is multiplied by its weight, summed, then scaled to a 10-point final score. The formula is the same for every tool in every category.

Axis

Weight distribution

Weight

Day-to-Day UX

Heaviest — this is what you live in

25%

Setup & Onboarding

First impression shapes retention

20%

Feature Depth

Does it actually do what it claims

20%

Customer Support

When things break, who answers

15%

Price-to-Value

Relative to category median

12%

Data Portability

Can you leave without pain

Independence policy

What we will and won't do

Our affiliate relationships fund the site. Here's exactly where the line is.

What we do

Earn affiliate commission when readers sign up through some of our links
Disclose affiliate relationships clearly on every page where they exist
Apply the identical scoring rubric to all tools regardless of affiliate status
Rank tools by score — affiliate tools rank lower if they score lower
Update scores when tools improve or deteriorate, regardless of relationship
Accept correction requests from vendors if they include verifiable facts

What we never do

Accept payment to improve a tool's score or ranking position
Give a tool a higher score because it has a higher affiliate commission
Allow vendors to see or influence our scores before publication
Add "Editor's Choice" or similar labels in exchange for compensation
Remove negative findings from a review at a vendor's request
Recommend a tool we believe is genuinely worse for the reader's use case

Staying current

How we keep data fresh

A review written in 2023 and never updated is a liability, not an asset. Here's our update schedule.

Price checks

Every pricing page is re-verified every quarter. Changed prices update immediately.

6mo

Full re-tests

Every tool gets a hands-on re-test every six months to catch UX and feature changes.

48h

Error corrections

Verified factual errors reported by readers or vendors are corrected within 48 hours.

Live

Date stamps

Every review shows the exact date prices and features were last verified.

Questions

Methodology FAQ

Do vendors pay to be listed on StackArbiter?

No. Any tool in our category can be reviewed — we choose what to cover based on market relevance and reader interest, not vendor relationships. We have affiliate partnerships with some tools, but that relationship has zero influence on whether a tool is reviewed or how it scores.

What if a vendor disagrees with our score?

We welcome factual corrections. If a vendor believes we've made a factual error — a wrong price, a feature we missed, a capability we mischaracterized — they can contact us with documentation. We investigate and update if the correction is valid. We don't change scores based on a vendor's preference or commercial pressure.

How do you handle tools with no affiliate programme?

We review them anyway and don't mark them differently. Scores are calculated identically. The only difference is that there's no affiliate link — we link to the tool's homepage directly instead. Our coverage decisions are not tied to whether we can earn from a tool.

Can I submit a tool for review?

Yes — contact us via the About page. We prioritise tools with meaningful market presence in the categories we cover. Submitting a tool doesn't guarantee a review, and it has no influence on the score if we do review it.

Why are your scores sometimes different from G2 or Capterra?

G2 and Capterra aggregate user ratings, which capture broad sentiment but can be skewed by review campaigns and vary wildly by use case. Our scores reflect our specific rubric applied to a specific business profile (SMB B2B). A tool loved by enterprise users might score lower with us if it's genuinely harder to set up and more expensive for a 5-person team.