Blog · Published April 30, 2026 · 11-minute read
The flat-fee solo's leak: different shape, same arithmetic
The first leak post we wrote — why US solo lawyers leak $30,000 a year in unbilled hours — was about hourly billers under-recording time. The most common pushback we got was the most reasonable one: "I bill flat-fee specifically so I do not have to do that." Fair. This post is about why flat-fee solos leak too, in a different shape, with the same underlying arithmetic — and why measuring still helps, even when no client will ever see a billable entry on an invoice.
TL;DR
Flat-fee solos — immigration, criminal defense, family-law uncontested, small estate planning, modest-fee transactional — do not leak revenue through unbilled hours. They leak it through five other holes: free intake calls that never convert, engagement letters priced from gut feel rather than data, undocumented scope creep that should have triggered an additional-fee letter, post-engagement work the practice swallows because the matter is "closed," and the long-tail bad-fit client who consumes three times the average effort for the same flat fee. Across the practices we have studied, the realized hourly rate runs 18–32% below the engagement letter's implied rate — equivalent to $50–$100 of margin erosion per matter on a typical $3,000–$5,000 flat fee, or $30,000–$80,000 a year for a solo handling 60–100 matters. The lever is the same as in the hourly post — passive metadata-only capture of actual time-per-matter — but the downstream use is different. Hourly solos use the data to bill more accurately. Flat-fee solos use the data to price more accurately and to flag scope creep before it metastasizes. Same instrument; different output. This post is the arithmetic, by practice area, with concrete numbers.
Why this post had to be a separate post
Solo practice in the United States is more flat-fee than the legal-press narrative implies. Per ABA TechReport and the Clio Legal Trends data over the past three years, somewhere between 38% and 44% of US solo matter volume is billed on a flat-fee basis at intake — concentrated in immigration (typically 70–85% flat-fee), criminal defense (60–75%), uncontested family-law work like simple divorces and prenups (50–65%), small estate planning (80%+), and modest-fee transactional work like real-estate closings, business-formation packages, and simple contracts. Hybrid is also common — a flat fee for the main matter, hourly for materially out-of-scope work, contingent for the upside in injury cases.
The hourly leak post implicitly assumed the reader was selling time. Half the audience was not. Several of the substantive replies on r/Lawyertalk and on LinkedIn made the same point: the leak we described did not feel like the leak they had. They were right. The leak in flat-fee practice is not under-recording time; it is under-pricing engagements relative to the actual cost of producing them. Different mechanism, different downstream artifact, but the same underlying problem — the practice does not know how long its work actually takes — and the same lever — measurement that makes the truth visible.
Five shapes of the flat-fee leak
1. The free intake call
Most flat-fee solos run a free initial consultation — fifteen to forty-five minutes by phone or video. The pitch is reasonable: low friction for the prospect, the lawyer screens for fit and conflicts, the conversion is somewhere between 30% and 60% depending on practice area. What gets quietly omitted from the cost-of-goods is the unconverted side. A 35-minute consult that does not convert costs roughly $145 of senior-lawyer time at a $250/hour implicit rate. At ten unconverted intakes a month — a low number for a practice doing any volume of inbound — that is $17,400 a year of senior-lawyer time the practice never charges for and never sees on the books.
This number is recoverable in two directions. Some of it is a real cost of doing business — sales has to happen somewhere, and the conversion rate is the sales-conversion metric every business runs. The other part is fixable: by tightening intake to a fifteen-minute call, by gating consultations through a written intake form, or by raising the consultation fee from zero to $75 with credit on engagement. The practice that does not measure the unconverted-intake load cannot tune any of those levers — it just absorbs the cost.
2. The engagement letter priced from gut feel
This is the largest leak shape and the easiest to fix. A solo immigration practitioner sets the I-485 + I-130 family-petition flat fee at $4,500. The number was set in 2022 against a recollection of how long the work used to take and a glance at what other practitioners in the metro charge. It has not been revisited. Three years later USCIS processing has changed, RFE rates are up, the typical matter takes 22 hours of actual work instead of the 18 the original price assumed, and the implicit hourly rate has dropped from $250 to $204 without anyone noticing.
The leak compounds because flat-fee pricing is sticky. Raising prices feels confrontational; clients and referral sources expect the same number this year as last year; bar-association directories list quoted ranges that calcify the market. The practice that raises its number from $4,500 to $5,200 once a year — well within the natural drift — recovers $42,000 a year on 60 matters, or $70,000 on 100. The practice that does not is the practice that drifts toward unprofitable equilibrium.
3. Undocumented scope creep
Most modern flat-fee engagement letters in immigration, family law, and criminal defense contain an additional-fee clause for materially out-of-scope work. RFE responses beyond the first one. Post-judgment motions in family law. Supplemental discovery in criminal cases. Probate estate appraisals that turn into contested fee disputes. The clauses are enforceable; we have read dozens of them and they are typically well-drafted. They are also nearly never triggered. The reason is mechanical: triggering the additional-fee clause requires sending the client a written notice, which requires producing contemporaneous evidence that the scope expanded, which requires having recorded the work that produced the expansion. Without time records, that conversation is a nine-out-of-ten loss for the lawyer because the lawyer has nothing to point at.
The dollar size of the leak depends on practice mix. In an immigration practice taking 20% of matters into RFE territory, with each RFE response averaging 4 hours over the included scope, the unbilled additional work runs $1,000 per RFE matter at $250/hour implicit — or $20,000 a year on 100 matters. In a family-law practice with 15% of matters going post-judgment, the leak is comparable. In criminal defense it can be larger because the additional work happens at trial readiness when the lawyer is most time-pressed. None of it gets billed; all of it gets done.
4. Post-engagement work after the fee is collected
The matter is "closed." The fee is paid. The client's file is in storage. Then they call. Status check on the green card. Question about a custody-modification deadline. Forwarding paperwork to a new employer. Quick consult on whether last year's plea affects something they are doing now. The lawyer takes the call out of professional courtesy, spends twelve to forty minutes on it, and books nothing — because the matter is closed and the engagement letter contemplated no post-engagement work.
This is the most defensible shape of leak in the sense that it builds long-term referral equity, and we are not going to argue against doing it. But the practice that does not measure post-engagement load underestimates how much of its capacity goes to closed matters. We have seen flat-fee solos who learn that 8–12% of their working hours go to matters that are formally over. That number drives a different decision — whether to add a $500/year alumni-retainer tier, whether to bake a six-month post-engagement window into the engagement letter, whether to migrate certain practice areas to a subscription model. None of those decisions can be made without the data.
5. The bad-fit client
Every flat-fee practice has a long tail. The DUI matter that goes to a rare suppression hearing on novel facts and consumes 80 hours instead of 28. The uncontested divorce that turns out not to be uncontested because one spouse hires counsel late. The simple business formation where the founders' operating-agreement disagreements turn into ten weeks of mediation. The flat fee was set against the average matter, and the long tail of three-times-the-average matters is where the margin disappears.
The standard response is to write better engagement letters — to disqualify the worst-fit clients at intake, to convert obviously-complex matters to hourly, to add explicit triggers for additional-fee letters. All of that helps. None of it works without measurement, because the practice that does not know which matters consumed three times the average cannot identify the pattern of who the bad-fit clients are. After two years of measurement, most flat-fee solos can describe the bad-fit profile with eerie specificity — a particular kind of opposing party, a particular kind of family situation, a particular kind of business partner. The disqualification at intake gets twenty percentage points more accurate, and the long tail compresses materially.
The arithmetic, in immigration practice
Take a representative US solo immigration practice. Family-based petitions and adjustments are the bread and butter; about 80 matters a year, average flat fee of $4,500, simple matters cleared in 14–18 actual senior-lawyer hours, complex matters in 26–40. The practice does not currently measure time-per-matter; the engagement letter has not been repriced since 2023. Here is what a year looks like with measurement applied:
| Line item | Without measurement | With measurement | Annual delta |
|---|---|---|---|
| Average matter hours | (Unknown — assumed 18) | 22 (measured) | — |
| Implicit hourly rate | $250 (intended) | $204 (realized) | — |
| Engagement letter price (next cycle) | $4,500 (frozen) | $5,000 (data-justified bump) | +$40,000 |
| RFE matters at additional-fee tier | 0 (not triggered) | 14 (measured + invoiced via clause) | +$14,000 |
| Unconverted intake hours | ~85 hrs (invisible) | ~85 hrs (visible) | $0 directly, informs intake redesign |
| Post-engagement hours | ~120 hrs (invisible) | ~120 hrs (visible) | $0 directly, informs alumni-retainer offer |
| Bad-fit matters identified | 0 (anecdotal) | 9 (measured) | $0 directly, informs disqualification at intake |
| First-year net revenue impact | — | — | ≈ +$54,000 |
The bulk of the recovered margin is just from the engagement-letter reprice. The additional-fee triggering on RFE matters is real but modest. The other three shapes — intake load, post-engagement work, bad-fit identification — are second-order: they do not directly add revenue in year one but they shape decisions that compound in years two and three. After three years of measurement most practices in this profile see another $40,000–$80,000 of avoided erosion from intake-redesign and bad-fit disqualification. None of it requires changing the fee structure clients see; the engagement letter is still flat-fee, the invoicing is still flat-fee, the marketing is still flat-fee.
The arithmetic, in flat-fee criminal defense
The criminal-defense flat-fee profile is shaped differently. A representative DUI/misdemeanor practice runs 90 matters a year at $3,500 average, with about 12% going to substantive litigation that exceeds the assumed scope. The intended hourly rate is $300; the realized rate is closer to $232 because the long-tail trial-readiness matters consume 60–80 hours of work for the same fee. The leak shape is concentrated almost entirely in the long-tail bad-fit matter rather than spread across the book.
The lever is different too. Criminal-defense engagement letters more typically split flat-fee for pretrial work with a contingent or fixed escalator if the matter goes to trial — but the escalator is rarely triggered cleanly without time records. In a 90-matter year with 11 trial-track matters, the practice that measures the actual hours-per-trial-matter and triggers the escalator on each one recovers roughly $19,000–$27,000 of fee that the practice that does not measure simply absorbs. The recovered amount is comparable to the salary of a part-time paralegal — large enough to materially change the practice's hiring math.
Why metadata-only capture serves flat-fee solos differently
This is where the post diverges from the hourly version. In an hourly practice, the captured metadata flows into a digest, the lawyer reviews and approves, and the entries become invoice line items. The client sees them. In a flat-fee practice, the captured metadata flows into the same digest, the lawyer reviews, and the data stays inside the firm — the client never sees a time entry, no invoice line item is generated. The data is used three ways:
- Engagement-letter pricing intelligence. At quarter-end, the practice runs a report: average hours per matter type. Compare against the implicit hourly rate. Adjust the engagement letter's standard price for next quarter's intakes.
- Scope-creep flagging. When a specific matter exceeds 1.5× the average for its type, the system surfaces it. The lawyer reviews and decides whether to send an additional-fee letter while there is still contemporaneous evidence to anchor the conversation.
- Bad-fit pattern detection. At year-end, the matters that ran 2× or more the average get a structured review — what was the pattern, what was the disqualifier missed at intake, how does the next iteration of the intake script catch it.
None of this requires the client to see a billable entry. None of it requires the engagement structure to change. None of it requires informing the client that time is being measured at all — the metadata is firm-side intelligence, not client-side billing. The privilege architecture is the same as in the hourly case: no audio, no email bodies, no document content, only metadata fields the lawyer would write into a paper notebook anyway. The data lives on the lawyer's machine; the practice owns it.
What the digest looks like for a flat-fee solo
In an hourly practice the end-of-day digest is "you spent 47 minutes on six billable-looking events; confirm or edit." In a flat-fee practice the same digest is functionally similar but the categorization is different. A typical end-of-day for a family-law solo on Tuesday might read:
- Garcia divorce (uncontested $3,500): 38 min today (3.2 hrs cumulative; average for matter type 14 hrs)
- Williams custody modification (out-of-scope hourly $300/hr): 1 hr 12 min today (10.4 hrs cumulative; logged for hourly invoice)
- Lee prenup (flat-fee $1,800): 24 min today (1.7 hrs cumulative; average 4 hrs)
- Free intake — Patel: 38 min (logged; awaiting conversion decision)
End-of-week and end-of-quarter views surface the matter-type averages, the matters tracking 1.5× over, the unconverted intake load, and the post-engagement work that would otherwise be invisible. None of it gets sent to a client. The Williams entry becomes an invoice; the others stay inside the firm.
When this is not your problem
Three flat-fee profiles for whom the arithmetic in this post does not apply cleanly.
Pure-contingency PI lawyers. The leak shape is different — it is mostly underestimating time-to-settlement and the cost of carrying advance expenses on cases that take three years instead of one. Time-per-matter is still useful but the bigger lever is portfolio diversification and case-selection rigor at intake. Worth a future post; not this one.
Lawyers with very narrow flat-fee scopes. A practice doing only simple wills at $400 a piece, with strict intake disqualification of anything complex, has effectively engineered the leak out of the model already. The marginal value of measurement is low. If your average matter consumes under 90 minutes from intake to closing, you are probably fine.
Subscription-model practices. A practice running a per-month retainer with a stated capacity cap (say, $250/month for two hours of work, with overflow at hourly rates) has a different mechanism — the cap itself is the scope-creep enforcement device. Measurement still helps tune the cap, but the urgency is lower because the cap creates a forcing function the engagement letter on flat-fee single matters does not.
The hourly-vs-flat-fee question many solos get stuck on
Once a flat-fee solo sees the leak math, the obvious move is "should I just switch to hourly?" Almost always the answer is no, for three reasons.
First, flat-fee clients chose flat-fee for cost certainty. That selection bias is real and durable. The clients who chose you for predictable cost will not become hourly clients; they will become someone else's flat-fee clients. The market is segmented; switching pricing models is a positioning change, not just a billing change.
Second, hourly billing has its own leak — under-recording the small-moment work — which we covered in the original leak post. Switching from a measured flat-fee model to an unmeasured hourly model can easily make the leak worse, not better. The lever is measurement, not the fee structure.
Third, flat-fee pricing is a competitive advantage in several practice areas. Immigration in particular is so dominated by flat-fee that hourly billing reads as a red flag in client referral conversations. The practice that wants to keep its referral pipeline intact should keep its fee structure stable while measuring quietly.
The right move is to keep flat-fee pricing while measuring actual time-per-matter privately, then reprice once a year against the data. The fee structure stays unchanged from the client's perspective; the practice's margins quietly recover. Hourly conversion is only worth considering for the specific subset of matters where flat-fee pricing has structurally failed — typically the 5–10% of work where outcomes vary by an order of magnitude on facts the lawyer cannot assess at intake.
The recap
Flat-fee solos do not leak revenue through unbilled hours. They leak it through five other holes: free intakes that never convert, engagement letters priced from gut feel, undocumented scope creep, post-engagement work absorbed without compensation, and bad-fit clients who consume three times the average matter. The realized hourly rate runs 18–32% below the engagement letter's intended rate; the annual margin erosion lands in the $30,000–$80,000 range for a typical 60–100-matter solo practice. The lever is the same as in the hourly post — passive metadata-only capture of actual time-per-matter — but the downstream output is different. Hourly solos use the data to bill more accurately; flat-fee solos use the data to price more accurately, flag scope creep before it metastasizes, and disqualify bad-fit clients at intake. None of it requires the client to see a billable entry. None of it requires the engagement structure to change. The only thing it requires is knowing how long the work actually takes — which, until now, almost no flat-fee solo has had a credible way to measure without surrendering to a $89-a-month practice management system that was built for a different problem entirely.
Frequently asked
I bill flat-fee precisely so I don't have to track time. Why would I track it?
Tracking is for pricing, not billing. Flat-fee solos who measure actual hours-per-matter discover the difference between the engagement letter's implicit hourly rate and the realized one — usually a $40–$80/hour gap once free intakes, scope creep, and post-engagement work are counted. The data informs next year's engagement letter, not next month's invoice. The client never sees a billable entry; you never send one. The measurement is private firm-side intelligence.
What does the typical flat-fee leak actually cost?
In our reading of immigration, criminal-defense, and family-law uncontested practices, realized hourly rate runs 18–32% below the engagement letter's implied rate — equivalent to roughly $50–$100 per matter on a $3,000–$5,000 flat fee. For a solo handling 60–100 matters a year that compounds to $30,000–$80,000 of margin erosion that the books cannot see, because the fee was collected in full and the work was completed without an unhappy client. The leak is invisible to ordinary accounting; it shows up only when actual time-per-matter is measured.
Should I switch from flat-fee to hourly to plug the leak?
Almost never. Flat-fee clients chose flat-fee for cost certainty; that selection bias is real and durable. The right move is to keep flat-fee pricing while measuring actual time-per-matter privately, then reprice the engagement letter once a year against the data. The fee structure stays unchanged from the client's perspective; the practice's margins quietly recover. Hourly conversion is only worth considering for the specific subset of matters where flat-fee pricing has structurally failed.
How is this different from the hourly leak post?
The hourly leak is a measurement-to-billing problem: hours worked but not invoiced. The flat-fee leak is a measurement-to-pricing problem: matters that took longer than the engagement letter assumed. Same instrument (passive metadata-only capture), different downstream use. The hourly version produces invoice line items; the flat-fee version produces pricing intelligence and scope-creep flags. Both rely on the lawyer knowing how long their work actually takes — which neither audience currently does without a tool.
Does scope creep ever justify additional fees on a flat-fee matter?
Yes, when the engagement letter contemplates it. Most modern flat-fee letters in immigration, family law, and criminal defense include an additional-fee clause for materially out-of-scope work — RFE responses beyond the first one, post-judgment motions, supplemental discovery. The clause is enforceable but only if the additional work was actually done and recorded. Without time records, the additional-fee conversation is a nine-out-of-ten loss for the lawyer because there is no contemporaneous evidence of the scope expansion. With them, it is a far more even conversation.
What about bar-discipline rules? Does flat-fee tracking touch any of them?
ABA Model Rule 1.5 and most state corollaries require that fees be reasonable and that the basis for them be communicated. Flat-fee tracking strengthens both — the lawyer can demonstrate the fee is reasonable relative to actual work performed, and the engagement letter is more accurate next time. ABA Formal Opinion 512 (2024) on AI tools in legal practice is the current standard for any data-handling claim; metadata-only capture sits well inside its permitted-use envelope. We walked through the privilege architecture in detail in the technical companion post.
Further reading
- Why US solo lawyers leak $30,000 a year in unbilled hours — the hourly companion to this post
- The $1,250-a-week math: hire a second associate, or recover the time you're already missing? — the unit-economic comparison
- Privilege-preserving time tracking: a metadata-only architecture, explained — the technical companion
- Clio vs Smokeball vs MyCase: the 2026 honest solo-lawyer ranking — for the solos who are evaluating PMS as the alternative to measurement
- Immigration lawyer time tracking: the flat-fee-aware playbook — vertical deep-dive
- Criminal defense time tracking: pretrial scope, escalators, Brady visibility — vertical deep-dive
- Family law time tracking: hourly + flat-fee + bad-fit detection — vertical deep-dive
- Billable hour capture without a PMS subscription — the unit-economics deep dive
- Automatic time tracking for attorneys — the capture mechanism, in detail
- The lean-app argument — why a tool should do one job well, not eight jobs adequately
- The ClaimHour launch essay — the 1,600-word opening argument