Sapience Med

Push-to-Talk Dictation vs Always-Listening AI Scribes for Therapists

Quick answer

Push-to-talk dictation lets the clinician speak their own notes between sessions — the client is never on the microphone and no audio is recorded. AI scribes record the full session, send audio to a cloud server, and use a language model to generate a note. The differences involve client consent, state recording laws, subpoena exposure, and what the patient agreed to.

Download for MacDownload for Windows14-day free trial · No card required

What is the difference between push-to-talk dictation and an AI scribe?

They are different products solving overlapping problems. The distinction matters because the compliance, ethical, and clinical implications are not the same.

Push-to-talk dictation is a tool the clinician uses to convert their own speech into text. The clinician presses a hotkey, speaks the note in their own words — usually between sessions or after the session ends — and the words appear as text in the EHR or note app. The client is not on the microphone. Nothing is recorded. The clinician is the author of the note in every meaningful sense.

AI scribes (Heidi, Freed, Mentalyc, Supanote, Abridge, Nabla, DeepScribe, SimplePractice Note Taker) record the entire session, send the audio to a cloud server, run speech recognition on both speakers’ voices, and use a large language model to generate a structured clinical note — SOAP, BIRP, DAP, or modality-specific. The clinician reviews and signs the LLM’s draft. The audio of the patient and the generated transcript exist on the vendor’s infrastructure.

Both reduce documentation time. The trade-offs they ask the clinician to accept are very different.

Which states require all-party consent for session recording?

As of 2026, the states that require all-party (also called two-party or universal) consent for recording private conversations are: California, Connecticut, Delaware, Florida, Illinois, Maryland, Massachusetts, Michigan, Montana, Nevada, New Hampshire, Oregon, Pennsylvania, Vermont, and Washington. Specific statutory language varies; a few states have nuanced carve-outs.

A therapist licensed in any of these states who uses an AI scribe must document explicit consent from every client before the recording starts. Many clinics handle this with a one-time consent form added to intake paperwork; others handle it with a verbal consent recorded at the top of each session. Both methods work if implemented consistently. The risk arises when a new client signs the intake but later withdraws consent verbally, or when a returning client recorded before is asked again under a new vendor or retention policy.

A clinician who uses push-to-talk dictation has none of this surface area. The clinician records themselves, between sessions, stating their own clinical reasoning. There is no patient voice on the microphone, no recording to consent to, and no two-party consent question to manage. This is not a minor advantage in two-party-consent states.

Why do many therapists prefer push-to-talk dictation for clinical work?

Several reasons recur in clinician discussions on Reddit, in private-practice newsletters, and in the AI-scribe trust literature published since late 2025.

Authorship clarity. Push-to-talk produces the clinician’s own words, in their own clinical voice. AI scribe output is an LLM rendering of the session, which the clinician must review for accuracy and tone. Hallucinations — small but real — can place words in the chart that the clinician would not have written. The discovery liability is the same either way: the clinician signs the note and owns it. Push-to-talk minimizes the gap between intent and output.

Patient trust. Many clients, especially those seeking therapy specifically for privacy-sensitive issues (trauma, sexual orientation, immigration status, custody disputes), explicitly do not want their sessions recorded. A clinician who avoids ambient recording communicates a posture; one who records and discloses it communicates a different posture. Both can be ethical; only one signals maximum-restraint.

Subpoena minimization. Audio of a session is more powerful evidence than a written note and is harder to redact or contextualize. Therapists in custody cases, immigration matters, and personal-injury claims report discovery requests for AI scribe recordings; the existence of the recording is the discoverable artifact.

Cost and complexity. AI scribes typically run $79–150 per month per clinician for therapy-grade products with BAAs. Push-to-talk dictation is a fraction of that cost and does not require a recording-consent workflow.

When does an AI scribe still make sense?

AI scribes are not the wrong tool everywhere. There are legitimate clinical contexts where ambient recording with explicit consent makes sense:

  • Inpatient or hospital settings where every encounter is already documented in detail and standardized SOAP output is the deliverable.
  • High-volume primary care or behavioral health intake where ambient drafting saves enough time to be the difference between sustainable practice and burnout.
  • Group practices that have already implemented consent workflows, BAA review, and EHR integrations with the AI scribe vendor.
  • Clinicians who specifically want LLM-generated structured notes (e.g. PCL-5 or PHQ-9 templated extraction).

For solo and small-group mental health practices where the clinician already writes thoughtful notes and just wants to write them faster, push-to-talk dictation is the better fit. That is the audience Sapience Med is built for. Read the HIPAA architectural rationale for the compliance dimension, or see how Sapience Med works in SimplePractice for setup.

Frequently asked questions

Is push-to-talk dictation the same as an AI scribe?
No. Push-to-talk dictation converts your own speech into text — you press a key, speak your note, the text appears. The client is not recorded, no AI generates the note, and the words are entirely your own. An AI scribe records the full session, sends audio to a server, and uses an LLM to draft the note for you to review and sign.
Do I need my client's consent to use Sapience Med?
No, because Sapience Med does not record the session or capture the client's voice. The clinician dictates their own notes between or after sessions. This is no different from a clinician speaking notes into a digital recorder for their own use, which has never required client consent under HIPAA or state wiretap laws.
Are AI scribes legal in two-party consent states like California?
Yes, with documented consent from every party. A clinician practicing in California or another all-party consent state who uses an AI scribe must obtain and document explicit, informed consent from every client before recording. Without that consent, recording the session is unlawful under state wiretap law and may also violate professional ethics standards.
Can recordings from AI scribes be subpoenaed in court?
Yes. Session audio held by an AI scribe vendor is discoverable in custody disputes, malpractice cases, board complaints, and other legal proceedings. Many therapists who switched from AI scribes back to manual notes or push-to-talk dictation cite subpoena exposure as a primary reason.
Does Sapience Med produce SOAP, BIRP, or DAP notes automatically?
No. Sapience Med types your own words into your EHR's note field. If you dictate a SOAP note structure, that's what appears. If you dictate free-form, that's what appears. There is no LLM rewriting your words into a structured template — that's the trade-off you make for not having an AI listen to your sessions.
Why is push-to-talk slower than ambient AI scribing?
On paper, ambient is faster because you don't write the note at all — the AI does. In practice, the time difference shrinks once you account for review and editing of the AI's output, consent management, and the clinician's clinical judgment that often catches hallucinations or missing nuance. Push-to-talk dictation completes a clean note in 3-5 minutes for most therapists, which is fast enough to finish between sessions rather than after dinner.

Try Sapience Med free for 14 days.

$45/month or $399/year (save 24%) after the trial. No card required to start.

Related