How we work

Our methodology

Every score on StackArbiter comes from the same rubric, applied the same way, to every tool we test. Here's exactly how we do it — no black boxes.

6
Scoring axes
137
Tools tested
Q
Quarterly repriced
0
Paid rankings

The process

How we test every tool

The same four steps, every time. No skipping, no reviewing from the brochure.

Step 01

We open a real account

We sign up using a real email, go through the actual onboarding, and run realistic workflows — invoicing, project setup, reporting. We never review a tool from its marketing page alone.

Step 02

We score on 6 axes

Each tool is graded 0–5 on six weighted criteria. The same rubric, the same questions, every time. We document our notes during testing so scores are traceable — not recalled from memory.

Step 03

We cross-check user data

Our hands-on score is validated against aggregated user reviews from G2, Capterra, and Trustpilot. Where our experience diverges significantly from user consensus, we investigate and note why.

Step 04

We name a winner

Every comparison ends with one named pick for a specific use case. If we genuinely can't separate two tools, we say so explicitly — we don't hide behind "it depends" and call it done.

Step 05

We verify pricing quarterly

Prices change. We re-check every pricing page every quarter and update the scores if value-for-money shifts materially. Each page shows the last verification date so you always know how fresh the data is.

Step 06

We publish corrections

When a tool changes significantly, when readers flag an error, or when a new competitor shifts our verdict — we update and note it publicly. Scores aren't set in stone.


The rubric

The 6 scoring criteria

Each axis is scored 0–5. The final score is a weighted average. Weights reflect how much each factor affects real-world value for a business user.

01

Setup & Onboarding

20%

How fast can a new user go from signup to first meaningful output — invoice sent, project created, report run. Complexity and friction are penalized.

  • Time from signup to first useful action
  • Data import tools (CSV, from competitors)
  • Quality of onboarding wizard or in-app guidance
  • Complexity of initial configuration
02

Day-to-Day UX

25%

The heaviest-weighted axis. A tool you hate using on Tuesday will be abandoned by Thursday. We test the core daily workflows the product is built for.

  • Navigation clarity and information density
  • Mobile app quality (not just "mobile-friendly")
  • Speed of the most common actions
  • Consistency and polish across the interface
03

Feature Depth

20%

Does the tool do what it claims — and does it do it completely? We test edge cases, not just the happy path. Integrations and API quality count here.

  • Core feature completeness vs. category standards
  • Integration ecosystem (native and via Zapier/API)
  • Reporting and analytics depth
  • Handling of edge cases and complex workflows
04

Customer Support

15%

We test support channels directly — submitting real tickets and measuring response time, quality, and resolution. "24/7 support" claims are verified, not taken at face value.

  • Available channels (chat, phone, email, community)
  • Measured first-response time
  • Quality of the help documentation
  • Support availability by plan tier
05

Price-to-Value

12%

Not "is it cheap" but "is what you get worth what you pay." Scored relative to the category average. A $200/month tool can score higher than a $20/month tool if it delivers proportionate value.

  • Features per dollar vs. category median
  • Free tier or trial generosity
  • Pricing transparency (no hidden fees)
  • Scaling cost as team or usage grows
06

Data Portability

8%

Can you leave? Vendor lock-in is a real cost. We test data exports on day one — before we're emotionally invested. A tool that traps your data loses points regardless of everything else.

  • Export completeness (all data, not just summaries)
  • Standard formats (CSV, JSON, industry-specific)
  • Migration path to common competitors
  • Account deletion and data deletion process
Score formula

How the final number is calculated

Each axis score (0–5) is multiplied by its weight, summed, then scaled to a 10-point final score. The formula is the same for every tool in every category.

Axis
Weight distribution
Weight
Day-to-Day UX
Heaviest — this is what you live in
25%
Setup & Onboarding
First impression shapes retention
20%
Feature Depth
Does it actually do what it claims
20%
Customer Support
When things break, who answers
15%
Price-to-Value
Relative to category median
12%
Data Portability
Can you leave without pain
8%

Independence policy

What we will and won't do

Our affiliate relationships fund the site. Here's exactly where the line is.

What we do

  • Earn affiliate commission when readers sign up through some of our links
  • Disclose affiliate relationships clearly on every page where they exist
  • Apply the identical scoring rubric to all tools regardless of affiliate status
  • Rank tools by score — affiliate tools rank lower if they score lower
  • Update scores when tools improve or deteriorate, regardless of relationship
  • Accept correction requests from vendors if they include verifiable facts

What we never do

  • Accept payment to improve a tool's score or ranking position
  • Give a tool a higher score because it has a higher affiliate commission
  • Allow vendors to see or influence our scores before publication
  • Add "Editor's Choice" or similar labels in exchange for compensation
  • Remove negative findings from a review at a vendor's request
  • Recommend a tool we believe is genuinely worse for the reader's use case

Staying current

How we keep data fresh

A review written in 2023 and never updated is a liability, not an asset. Here's our update schedule.

Q
Price checks
Every pricing page is re-verified every quarter. Changed prices update immediately.
6mo
Full re-tests
Every tool gets a hands-on re-test every six months to catch UX and feature changes.
48h
Error corrections
Verified factual errors reported by readers or vendors are corrected within 48 hours.
Live
Date stamps
Every review shows the exact date prices and features were last verified.

Questions

Methodology FAQ

Do vendors pay to be listed on StackArbiter?
No. Any tool in our category can be reviewed — we choose what to cover based on market relevance and reader interest, not vendor relationships. We have affiliate partnerships with some tools, but that relationship has zero influence on whether a tool is reviewed or how it scores.
What if a vendor disagrees with our score?
We welcome factual corrections. If a vendor believes we've made a factual error — a wrong price, a feature we missed, a capability we mischaracterized — they can contact us with documentation. We investigate and update if the correction is valid. We don't change scores based on a vendor's preference or commercial pressure.
How do you handle tools with no affiliate programme?
We review them anyway and don't mark them differently. Scores are calculated identically. The only difference is that there's no affiliate link — we link to the tool's homepage directly instead. Our coverage decisions are not tied to whether we can earn from a tool.
Can I submit a tool for review?
Yes — contact us via the About page. We prioritise tools with meaningful market presence in the categories we cover. Submitting a tool doesn't guarantee a review, and it has no influence on the score if we do review it.
Why are your scores sometimes different from G2 or Capterra?
G2 and Capterra aggregate user ratings, which capture broad sentiment but can be skewed by review campaigns and vary wildly by use case. Our scores reflect our specific rubric applied to a specific business profile (SMB B2B). A tool loved by enterprise users might score lower with us if it's genuinely harder to set up and more expensive for a 5-person team.