Most Shopify® projects that go wrong do not go wrong because of Shopify. The platform is stable, well-documented, and increasingly capable. They go wrong because the merchant picked the wrong development company.
This guide gives you a framework for avoiding that mistake: 7 evaluation criteria, the red flags that signal a bad fit before you sign a contract, and a realistic view of what Shopify development actually costs in 2026.
Development company vs agency vs freelancer
These 3 labels get used interchangeably. They are not the same thing.
A freelancer is 1 developer. They can move fast on well-scoped tasks, cost less per hour, and are ideal when you need a specific thing built and you can specify it precisely. The risk is single-point-of-failure: vacation, illness, a better offer, and your project stalls.
An agency is a firm with multiple developers, a project manager, and often a strategy or design function. They handle scope ambiguity better and can absorb team changes mid-project. The risk is overhead: you pay for coordination, account management, and margin on top of the developers actually doing the work.
A development company in the Shopify context typically means a technical firm specializing in Shopify implementations, often with a focus on a specific segment (mid-market, Plus, headless). The best ones look like a hybrid: agency accountability with embedded-team economics.
Which you need depends on your project size and how well you can define scope upfront.
Why vendor selection beats platform selection
When Shopify Plus merchants survey post-implementation failures, the platform almost never appears in the root-cause list. What does appear:
- Vendor lacked Plus experience. Standard Shopify and Plus look similar in a demo. In production, Plus means checkout extensions, B2B Company model, Shopify Functions in Rust/AssemblyScript, multi-market currency handling, and Shopify Flow at scale. A team with 50 standard stores and 0 Plus stores is not a Plus team, regardless of what their website says.
- Scope was set by the vendor, not by business requirements. Every project pitched as "we need to rewrite this in Hydrogen" should be interrogated. Headless commerce solves specific problems. When the vendor leads with the solution before understanding the constraint, you are buying their competency demonstration, not your business outcome.
- Handoff failed. The build finished. Nobody on the merchant's side can maintain it. The agency is gone. This is the most common long-tail cost and the one least discussed during vendor selection.
7 criteria for evaluating a Shopify development company
1. Plus production experience
Ask for 3 Plus stores they have shipped in the last 18 months. Check whether those stores are live. Ask specifically: "What checkout extensions did you build and what did they replace?" A team with real Plus work can answer this immediately.
2. Checkout extensions and Shopify Functions
Checkout extensions (replacing checkout.liquid, which Shopify deprecated for Plus in 2024) and Shopify Functions (custom logic in Rust/AssemblyScript running at Shopify's infrastructure level) are now core Plus capabilities. If the team describes these vaguely or defaults to "we use apps for that," they are not current.
3. Hydrogen and headless capability
Not every project needs Hydrogen. But a team that cannot evaluate whether your project needs it — or worse, always recommends it — is missing judgment. Ask: "For a merchant in our situation, what would make you recommend Hydrogen vs a standard Online Store 2.0 theme?" The quality of the answer tells you more than the answer itself.
4. Integration track record
Most Shopify projects involve at least one major integration: ERP (NetSuite, SAP), OMS, PIM, marketing platform, loyalty system, or 3PL. Ask for a specific integration they have shipped, the approach (Shopify webhooks vs polling vs middleware), and what broke in production and how they fixed it. Real experience surfaces in the edge cases.
5. Performance budgets
A well-built Shopify theme should hit Core Web Vitals consistently: LCP under 2.5s, INP under 200ms, CLS under 0.1. Ask the team for their process for setting and hitting performance budgets. If they say "we run Lighthouse at launch," that is not a process. If they describe image optimization strategy, third-party script governance, and theme section lazy-loading, that is a process.
6. Post-launch support model
Shopify pushes platform updates. Checkout UI changes. Apps conflict. A merchant with no post-launch support is exposed. Ask: What does the engagement look like after go-live? Is there a retainer option? What is the SLA for a production-down issue? A vendor with no answer to this question has never been responsible for a live store.
7. Contract exit terms
The engagement will sometimes need to end before the original scope is complete. Clear exit terms protect both sides. Ask: What happens to IP if we end the engagement early? Can we get the code in a deployable state at any point? Is there a project wind-down clause? Vague answers here are a pre-signature preview of disputes post-signature.
Red flags worth walking away over
Every project pitched as a headless rewrite. If the first call leads with Hydrogen before any discovery of your current architecture, traffic patterns, and team capability, the vendor is optimizing for a larger contract, not your outcome.
No Plus references on a Plus project. References from standard Shopify merchants do not validate Plus capability. Ask specifically for Plus merchant references and call them.
Hourly rates with no senior developer accountability. Offshore rates of $30-60/hr exist. Some teams at those rates are excellent. Many are not. The differentiator is not the rate — it is whether there is a senior technical lead on your project who owns quality and can be held accountable.
Prototype demos instead of production examples. A demo built for a pitch uses clean data, no third-party scripts, and no real user traffic. Ask to see the Lighthouse score of a live store they built, not a staging environment. Real is different from demo.
Vague discovery process. A good development company runs a discovery phase before proposing scope and cost. If the vendor gives you a price estimate on a 30-minute call without asking about your current tech stack, integrations, SKU count, or team structure, the estimate is fiction.
Pricing reality in 2026
Pricing varies significantly by geography and seniority.
| Tier |
Rate range |
Best for |
| Senior onshore (US, AU, UK) |
$150-300/hr |
High-complexity Plus work, audit engagement |
| Senior nearshore (Eastern Europe, LatAm) |
$80-140/hr |
Mid-market Plus, long-term embedded team |
| Senior offshore (Southeast Asia, South Asia) |
$40-80/hr |
Clear-scope work, strong senior lead on project |
| Junior offshore (any region) |
$20-50/hr |
Not recommended for Plus-complexity work |
These are market rates, not aspirational rates. A vendor quoting significantly below these ranges is pricing in corners: junior team, no senior review, or delivery risk transferred to you.
Project-level costs for a typical Plus migration from another platform or a major theme rebuild: $40,000-$150,000 depending on integration complexity, custom functionality, and migration data volume.
Fixed-bid vs T&M vs embedded team
Fixed-bid works when scope is genuinely fixed and you have the technical knowledge to verify it is complete. Most mid-market Shopify projects do not have genuinely fixed scope. The vendor's incentive is to minimize hours; your incentive is to get everything in scope. This misalignment produces disputes and quality shortcuts.
Time and materials works when you have a technical project manager who can evaluate output and control scope. Without that accountability, T&M drift is expensive. You need someone on your side who can challenge a developer on whether a task genuinely required 12 hours.
Embedded team is the model that works best for merchants without a strong in-house technical function. A dedicated senior developer (or small team) works on your roadmap, is accountable to you directly, and is interruptible for production issues. The economics are closer to a contractor than a project engagement, but without the overhead of hiring and equipment.
The Sapota team operates on an embedded model: senior engineers, a 2-week paid trial before committing to a long engagement, and a flat monthly rate. If the trial does not work for either side, both parties exit cleanly.
How to run the vendor evaluation process
- Define your project scope and success criteria before talking to vendors. If you cannot articulate what done looks like, the vendor will define it for you.
- Send the same brief to 3-4 vendors. Compare not just the proposals but the quality of the discovery questions they ask back.
- Ask for 2-3 references for projects similar to yours. Call them. Ask specifically what went wrong and how the vendor handled it.
- Request a small paid pilot — a well-scoped 2-week piece of work — before committing to the full engagement.
- Review exit terms before signing. Do not sign a contract with no exit clause.
The vendor who earns the work will welcome these steps. The one who pushes back on the pilot or references is telling you something.
Thảo at Sapota has been part of Plus migrations, performance audits, and embedded team engagements across retail, B2B, and DTC verticals. If you're evaluating Shopify development companies and want a technical second opinion, reach out to the Sapota team.