If you have ever stared at a stack of evaluation forms and wondered which standardized tool will actually answer your clinical question, you are in the right place. This guide walks you through the assessments OTs use most, organized the way the Occupational Therapy Practice Framework, 4th Edition (OTPF-4) structures the evaluation itself: occupational profile first, then a focused analysis of occupational performance.
We will cover the four categories of assessments, the most-used standardized tools by domain (with links to the developer or to peer-reviewed psychometrics on the Shirley Ryan AbilityLab Rehabilitation Measures Database), telehealth-friendly options, current CMS billing context for 2026, free vs paid choices, and a decision aid you can keep next to your desk.
An occupational therapy assessment is a structured, evidence-informed process you use to figure out what a person wants and needs to do, what they can do right now, and what is getting in the way. The American Occupational Therapy Association (AOTA) defines evaluation as "obtaining and interpreting data necessary for intervention." Assessment is the toolkit you use to gather that data.
Two things matter here. First, "assessment" is not the same as "evaluation." Evaluation is the entire process. Assessments are the specific instruments (interviews, observations, standardized tests) you choose along the way. Second, every assessment you pick should map back to a clinical question. If it does not, it is paperwork, not practice.
The framing comes from OTPF-4 and from the WHO International Classification of Functioning, Disability and Health (ICF): occupation sits at the intersection of body functions, activity, participation, and environment. A good assessment battery hits multiple levels.
One more clarifying note. The World Federation of Occupational Therapists (WFOT) ties OT's identity to the use of occupation as both the means and the end of intervention. That has practical consequences for assessment: a test that measures grip strength tells you something about an impairment, but only an occupation-grounded measure tells you whether the person can open a jar of pasta sauce, button a child's coat, or hold a pen long enough to finish a shift report. Both kinds of data belong in your eval; the second kind is what makes the eval occupational therapy and not generic rehabilitation.
If you are an OT student or new grad preparing for boards through NBCOT, this distinction shows up everywhere on the exam. The "right answer" almost always involves an assessment that captures occupational performance, not just an isolated body function.
The OTPF-4 splits evaluation into two steps. You start with an occupational profile (the client's history, patterns, values, and what they want from therapy), then move to an analysis of occupational performance (the standardized testing and observation that pinpoints supports and barriers). The AOTA evaluation and assessment hub is the canonical reference.
For Medicare Part B and most commercial payers in the United States, you bill an OT evaluation using one of three CPT codes based on complexity:
CPT 97168 is the re-evaluation code. All four are untimed and billed once per episode. For the official CMS guidance on outpatient OT and PT billing rules, see CMS Therapy Services.
The KX modifier threshold for calendar year 2026 is $2,480 for occupational therapy services, separate from the combined PT and SLP threshold (also $2,480). Once a beneficiary's incurred expenses pass that amount in the year, you append the KX modifier to attest that continued services are medically necessary. The targeted medical review threshold is $3,000 and is statutorily frozen at that level through calendar year 2027, so it does not adjust annually the way the KX threshold does. The APTA payment-thresholds page explains the mechanics, which apply equally to OT.
The KX modifier is not a permission slip. It is a clinician's attestation. Documentation has to support medical necessity for every claim above the threshold.
A note on selecting your evaluation complexity level: the difference between 97165 and 97167 is not just minutes spent. It is the number of performance deficits you identified, the complexity of the clinical reasoning, and whether the client's history is straightforward or layered with comorbidities. CMS expects your documentation to support whichever level you bill. Auditors look at the occupational profile, the assessments administered, the deficits enumerated, and the clinical reasoning narrative. If those four elements line up with the code's definition, the claim holds.
If you only remember one taxonomy, make it this one. It maps cleanly onto the OTPF-4 domain and gives you a way to scan a fresh referral and decide what to grab.
| Category | What it measures | Example tools |
|---|---|---|
| Occupation-based | Performance and satisfaction in the actual occupations the client cares about | COPM, AMPS, OT-PAL |
| Performance skills | Motor, process, and social interaction skills observed during task performance | Fugl-Meyer, Box and Block, 9-Hole Peg, Jebsen-Taylor, ARAT |
| Client factors | Body functions and structures (cognition, sensation, ROM, strength, mood) | MoCA, MMSE, Sensory Profile 2, PHQ-9, GAD-7, goniometry |
| Environment and context | Physical, social, cultural, and virtual context shaping performance | Home FAST, HOME, SAFER-HOME, Craig Hospital CHIEF |
A good battery for any client touches at least two of these. Picking only client-factor tests is a common trap; it produces a reductionist picture that misses the actual occupational question. AOTA's Quality Toolkit reinforces this with outcome-driven measurement recommendations.
Below are the tools you will see most often in U.S. clinical practice, organized by the question they answer. Each entry tells you what it measures, who it is for, whether it is free or paid, and where to find the canonical source.
Telehealth flexibilities that started during the COVID-19 public health emergency have largely been extended. Under the Consolidated Appropriations Act of 2026, OTs remain eligible as distant-site telehealth providers through December 31, 2027, with no geographic restrictions and audio-only permitted when clinically appropriate. The authoritative reference is the CMS List of Telehealth Services for the current calendar year.
Not every assessment translates cleanly to a video visit. The ones that do best are interview-based or rely on simple movements the client can perform in their own space with a camera and a household item or two. Strong telehealth options include:
For upper-extremity motor assessments like the Fugl-Meyer, remote-administration studies have emerged but interrater reliability is lower than in-person. Document the modification and use clinical judgment.
Two practical tips for video evaluations. First, build a "telehealth eval kit" worksheet that lists the household objects the client should have within reach (chair, full water bottle for grip resistance, ruler for goniometry estimation, etc.) and email it before the visit. Second, when an assessment has both an in-person and a published telehealth version, name the version you administered in your note. "Berg Balance Scale (video-administered, caregiver supervision)" reads differently to a reviewer than a generic "Berg Balance Scale" entry, and it protects you if a fall happens later.
Three things to keep straight for U.S. Medicare and most commercial payers:
Your eval note has to map back to the OTPF-4 framework. AOTA's Quality Toolkit includes documentation guidance and outcome-measure templates. Most denials we see are not about the test you chose; they are about a thin occupational profile or missing tie-back from the assessment finding to the functional goal.
A clinician with a tight budget can build a credible adult-rehab battery using almost entirely free instruments. Here is the lay of the land.
| Free or public domain | Paid (manual or kit required) |
|---|---|
| Barthel Index (free for non-commercial clinical use with citation), Katz ADL, Lawton IADL, PHQ-9, GAD-7, Berg Balance, TUG, 30-Second Chair Stand, Tinetti POMA, Fugl-Meyer, ARAT, Box and Block, 9-Hole Peg, Role Checklist v3, Home FAST | COPM, AMPS (certification required), AM-PAC (licensed for clinical use via CREcare), MoCA (certification approx. $125), Sensory Profile 2, SIPT, Bayley-4, PDMS-2, BOT-3 (and predecessor BOT-2), MABC-2, Jebsen-Taylor (kit), MMSE (commercial license), most FCE protocols |
Before you spend departmental money on a paid assessment, ask four questions: Does it answer a clinical question your free tools cannot? Are the norms current and relevant to your population? Will reimbursement actually change based on the score? Can the rest of the interdisciplinary team interpret the result?
If the answer is yes to two or more, buy it. If not, the free instrument is almost certainly fine.
One more consideration that does not show up on the spreadsheet: clinical credibility with the interdisciplinary team. In acute care, the rehab team is often using the AM-PAC "6-Clicks" to make discharge recommendations. If you walk in with a different ADL measure, your data will not feed into the team's huddle. Match the local norms when you can; deviate when the clinical question genuinely demands it.
You do not need a stats PhD to read an assessment manual. You need four words.
The Shirley Ryan AbilityLab Rehabilitation Measures Database publishes MDC and MCID values for hundreds of instruments, organized by population. Bookmark it.
Pick assessments with published MDC and MCID values for your patient's diagnosis. If those numbers do not exist, you cannot reliably say whether your intervention helped.
A worked example: a chronic-stroke patient's Box and Block score moves from 22 to 31 blocks per minute over six weeks. The published MDC for chronic stroke is roughly 5.5 blocks per minute (Chen et al. 2009); the change of 9 blocks exceeds it, so it is real. MCID estimates for the BBT in stroke vary in the literature and appear to scale with baseline severity, but a 9-block change comfortably exceeds the commonly cited 6-block estimate, so it is also likely to feel meaningful to the patient. Now you have a paragraph for your progress note that maps a number to a story.
The flip side: a moderate-stroke patient's Fugl-Meyer UE score moves from 38 to 41 over four weeks. The MDC in chronic stroke is around 5 points. Your 3-point change has not cleared measurement error. The honest interpretation is "stable" and the honest action is to revisit dosage, modality, or the goal itself rather than report a win.
Start from the clinical question, not the instrument. The decision aid below maps common questions to the tools most clinicians reach for first.
| Clinical question | First-line assessment | When to add more |
|---|---|---|
| What does this client want from therapy? | COPM | Role Checklist v3 if occupational identity is unclear |
| How independent is this older adult in ADLs? | Barthel Index or Katz | Lawton IADL if the question extends to community living |
| Is this stroke survivor's UE recovering? | Fugl-Meyer UE | ARAT or Box and Block for activity-level performance |
| Is this older adult at risk of falling? | TUG plus 30-Second Chair Stand (CDC STEADI) | Berg Balance or Tinetti POMA for greater sensitivity |
| Does this client have cognitive impairment? | MoCA | ACLS or Cognistat for occupational-cognition profile |
| Does this child have a sensory processing difference? | Sensory Profile 2 | SPM-2 if school context is the question |
| What is this preschooler's developmental motor level? | PDMS-2 or Bayley-4 | BOT-2/BOT-3 once over age 4 |
| Is this client safe to return to work? | FCE (WorkWell, Matheson, or ErgoScience) | DASH or QuickDASH for self-reported UE disability |
One last principle: triangulate. A standardized motor score, a self-reported function score, and a direct ADL observation tell you more together than any one alone.
OTs use four broad categories: occupation-based measures like the COPM and AMPS; performance-skill assessments like the Fugl-Meyer and Box and Block; client-factor assessments like the MoCA, Sensory Profile 2, PHQ-9, goniometry, and manual muscle testing; and environmental assessments like the Home FAST. A complete evaluation usually combines at least two categories, anchored by the OTPF-4 occupational profile.
This is a common search query, but it conflates two different concepts and is worth unpacking. OTPF-4 does not list "7 occupational needs." It lists 9 areas of occupation: activities of daily living (ADLs), instrumental activities of daily living (IADLs), health management, rest and sleep, education, work, play, leisure, and social participation. See the AOTA occupations and everyday activities reference.
"7 needs" sometimes appears in older textbooks or in informal lists that combine ADLs and IADLs into one category and drop health management. Use the current OTPF-4 nine-area list when you are documenting or teaching.
There is no official "5 assessment tools" list, but the five most frequently named in U.S. OT practice are:
An assessment is any structured tool you use - interview, observation, standardized test, or questionnaire - to gather data during an OT evaluation. The OTPF-4 frames assessments as part of the "analysis of occupational performance" that follows the occupational profile. A specific assessment answers a specific clinical question; the full evaluation integrates assessments, observation, and clinical reasoning into a plan of care.
Yes. Both were developed by Spitzer, Williams, and Kroenke with support from Pfizer, and Pfizer placed them in the public domain. No permission is required to reproduce, translate, or use them clinically. The University of Washington National HIV Curriculum hosts the canonical scoring guide.
Yes. The MoCA developer required certification beginning September 2019, with full access enforcement starting September 2020. Certification is one online training (roughly one hour) and currently costs about $125 as a one-time fee for lifelong certification. It covers the standard MoCA, MoCA-Basic, MoCA-Blind, and MoCA-XpressO. See the Shirley Ryan AbilityLab MoCA entry for context.
The assessment itself is bundled into the evaluation CPT code (97165, 97166, or 97167). You do not bill the COPM or the MoCA on top. The complexity of the assessment battery is one factor that helps you select the right evaluation code. The AAPC reference for CPT 97165 lays out the documentation requirements.