Push-to-Talk Dictation vs Always-Listening AI Scribes for Therapists
Push-to-talk dictation lets the clinician speak their own notes between sessions — the client is never on the microphone and no audio is recorded. AI scribes record the full session, send audio to a cloud server, and use a language model to generate a note. The differences involve client consent, state recording laws, subpoena exposure, and what the patient agreed to.
What is the difference between push-to-talk dictation and an AI scribe?
They are different products solving overlapping problems. The distinction matters because the compliance, ethical, and clinical implications are not the same.
Push-to-talk dictation is a tool the clinician uses to convert their own speech into text. The clinician presses a hotkey, speaks the note in their own words — usually between sessions or after the session ends — and the words appear as text in the EHR or note app. The client is not on the microphone. Nothing is recorded. The clinician is the author of the note in every meaningful sense.
AI scribes (Heidi, Freed, Mentalyc, Supanote, Abridge, Nabla, DeepScribe, SimplePractice Note Taker) record the entire session, send the audio to a cloud server, run speech recognition on both speakers’ voices, and use a large language model to generate a structured clinical note — SOAP, BIRP, DAP, or modality-specific. The clinician reviews and signs the LLM’s draft. The audio of the patient and the generated transcript exist on the vendor’s infrastructure.
Both reduce documentation time. The trade-offs they ask the clinician to accept are very different.
What are the consent risks of recording therapy sessions with an AI scribe?
Therapy session recording is governed by a combination of state wiretap law, professional ethics codes, and clinic policy. Each layer adds friction.
Under state wiretap law, the United States is a patchwork of one-party and two-party consent states. In one-party consent states (most of the country), a single party to the conversation can authorize recording. In two-party (or all-party) consent states — including California, Florida, Illinois, Maryland, Massachusetts, Montana, Pennsylvania, and Washington — every participant must agree before recording is lawful. Recording a client without their consent in those states is a criminal matter, not a paperwork lapse.
Under professional ethics, the APA Ethics Code Standard 4.03 explicitly requires informed consent before recording images or voices in any clinical encounter. The ACA Code (B.1.b) and NASW Code (1.07) impose parallel obligations. Adopting an AI scribe means securing documented consent from every client, every time, with renewed consent if the recording purpose, vendor, or retention changes.
Under subpoena exposure, recorded session audio is discoverable. Once the recording exists, it can be ordered produced in custody disputes, malpractice claims, board complaints, or criminal cases. Many therapists are comfortable producing their own notes; producing audio of a patient’s words is a meaningfully different exposure.
Which states require all-party consent for session recording?
As of 2026, the states that require all-party (also called two-party or universal) consent for recording private conversations are: California, Connecticut, Delaware, Florida, Illinois, Maryland, Massachusetts, Michigan, Montana, Nevada, New Hampshire, Oregon, Pennsylvania, Vermont, and Washington. Specific statutory language varies; a few states have nuanced carve-outs.
A therapist licensed in any of these states who uses an AI scribe must document explicit consent from every client before the recording starts. Many clinics handle this with a one-time consent form added to intake paperwork; others handle it with a verbal consent recorded at the top of each session. Both methods work if implemented consistently. The risk arises when a new client signs the intake but later withdraws consent verbally, or when a returning client recorded before is asked again under a new vendor or retention policy.
A clinician who uses push-to-talk dictation has none of this surface area. The clinician records themselves, between sessions, stating their own clinical reasoning. There is no patient voice on the microphone, no recording to consent to, and no two-party consent question to manage. This is not a minor advantage in two-party-consent states.
What happens when a client refuses consent to an AI scribe?
This is a routine occurrence in 2026, and it is increasing as public awareness of AI in healthcare grows. NPR ran a well-circulated story in May 2026 on trust erosion in AI-assisted therapy; client questions about whether their sessions are being recorded are now a common intake conversation.
When a client declines AI scribe use, the clinician has two options: revert to typing the note themselves for that client, or use a non-recording dictation tool. Most clinics that adopt AI scribes maintain a mixed workflow: scribe-enabled for clients who consent, manual for the rest. Push-to-talk dictation makes the manual case faster without changing the clinician’s ethical posture.
Sapience Med is designed to be the same tool a therapist uses for every client, regardless of consent posture, because the tool does not record the client at any time.
Why do many therapists prefer push-to-talk dictation for clinical work?
Several reasons recur in clinician discussions on Reddit, in private-practice newsletters, and in the AI-scribe trust literature published since late 2025.
Authorship clarity. Push-to-talk produces the clinician’s own words, in their own clinical voice. AI scribe output is an LLM rendering of the session, which the clinician must review for accuracy and tone. Hallucinations — small but real — can place words in the chart that the clinician would not have written. The discovery liability is the same either way: the clinician signs the note and owns it. Push-to-talk minimizes the gap between intent and output.
Patient trust. Many clients, especially those seeking therapy specifically for privacy-sensitive issues (trauma, sexual orientation, immigration status, custody disputes), explicitly do not want their sessions recorded. A clinician who avoids ambient recording communicates a posture; one who records and discloses it communicates a different posture. Both can be ethical; only one signals maximum-restraint.
Subpoena minimization. Audio of a session is more powerful evidence than a written note and is harder to redact or contextualize. Therapists in custody cases, immigration matters, and personal-injury claims report discovery requests for AI scribe recordings; the existence of the recording is the discoverable artifact.
Cost and complexity. AI scribes typically run $79–150 per month per clinician for therapy-grade products with BAAs. Push-to-talk dictation is a fraction of that cost and does not require a recording-consent workflow.
When does an AI scribe still make sense?
AI scribes are not the wrong tool everywhere. There are legitimate clinical contexts where ambient recording with explicit consent makes sense:
- Inpatient or hospital settings where every encounter is already documented in detail and standardized SOAP output is the deliverable.
- High-volume primary care or behavioral health intake where ambient drafting saves enough time to be the difference between sustainable practice and burnout.
- Group practices that have already implemented consent workflows, BAA review, and EHR integrations with the AI scribe vendor.
- Clinicians who specifically want LLM-generated structured notes (e.g. PCL-5 or PHQ-9 templated extraction).
For solo and small-group mental health practices where the clinician already writes thoughtful notes and just wants to write them faster, push-to-talk dictation is the better fit. That is the audience Sapience Med is built for. Read the HIPAA architectural rationale for the compliance dimension, or see how Sapience Med works in SimplePractice for setup.
Frequently asked questions
Is push-to-talk dictation the same as an AI scribe?
Do I need my client's consent to use Sapience Med?
Are AI scribes legal in two-party consent states like California?
Can recordings from AI scribes be subpoenaed in court?
Does Sapience Med produce SOAP, BIRP, or DAP notes automatically?
Why is push-to-talk slower than ambient AI scribing?
Try Sapience Med free for 14 days.
$45/month or $399/year (save 24%) after the trial. No card required to start.