Occupational Therapy Assessments: The 2026 Complete Guide

If you have ever stared at a stack of evaluation forms and wondered which standardized tool will actually answer your clinical question, you are in the right place. This guide walks you through the assessments OTs use most, organized the way the Occupational Therapy Practice Framework, 4th Edition (OTPF-4) structures the evaluation itself: occupational profile first, then a focused analysis of occupational performance.

We will cover the four categories of assessments, the most-used standardized tools by domain (with links to the developer or to peer-reviewed psychometrics on the Shirley Ryan AbilityLab Rehabilitation Measures Database), telehealth-friendly options, current CMS billing context for 2026, free vs paid choices, and a decision aid you can keep next to your desk.

What Is an Occupational Therapy Assessment?

An occupational therapy assessment is a structured, evidence-informed process you use to figure out what a person wants and needs to do, what they can do right now, and what is getting in the way. The American Occupational Therapy Association (AOTA) defines evaluation as "obtaining and interpreting data necessary for intervention." Assessment is the toolkit you use to gather that data.

Two things matter here. First, "assessment" is not the same as "evaluation." Evaluation is the entire process. Assessments are the specific instruments (interviews, observations, standardized tests) you choose along the way. Second, every assessment you pick should map back to a clinical question. If it does not, it is paperwork, not practice.

The framing comes from OTPF-4 and from the WHO International Classification of Functioning, Disability and Health (ICF): occupation sits at the intersection of body functions, activity, participation, and environment. A good assessment battery hits multiple levels.

One more clarifying note. The World Federation of Occupational Therapists (WFOT) ties OT's identity to the use of occupation as both the means and the end of intervention. That has practical consequences for assessment: a test that measures grip strength tells you something about an impairment, but only an occupation-grounded measure tells you whether the person can open a jar of pasta sauce, button a child's coat, or hold a pen long enough to finish a shift report. Both kinds of data belong in your eval; the second kind is what makes the eval occupational therapy and not generic rehabilitation.

If you are an OT student or new grad preparing for boards through NBCOT, this distinction shows up everywhere on the exam. The "right answer" almost always involves an assessment that captures occupational performance, not just an isolated body function.

The OT Evaluation Process

The OTPF-4 splits evaluation into two steps. You start with an occupational profile (the client's history, patterns, values, and what they want from therapy), then move to an analysis of occupational performance (the standardized testing and observation that pinpoints supports and barriers). The AOTA evaluation and assessment hub is the canonical reference.

For Medicare Part B and most commercial payers in the United States, you bill an OT evaluation using one of three CPT codes based on complexity:

CPT 97165 - low complexity. One to three performance deficits. Typically 30 minutes face-to-face.
CPT 97166 - moderate complexity. Three to five performance deficits. Typically 45 minutes face-to-face.
CPT 97167 - high complexity. Five or more performance deficits requiring complex clinical reasoning. Typically 60 minutes face-to-face.

CPT 97168 is the re-evaluation code. All four are untimed and billed once per episode. For the official CMS guidance on outpatient OT and PT billing rules, see CMS Therapy Services.

The KX modifier threshold for calendar year 2026 is $2,480 for occupational therapy services, separate from the combined PT and SLP threshold (also $2,480). Once a beneficiary's incurred expenses pass that amount in the year, you append the KX modifier to attest that continued services are medically necessary. The targeted medical review threshold is $3,000 and is statutorily frozen at that level through calendar year 2027, so it does not adjust annually the way the KX threshold does. The APTA payment-thresholds page explains the mechanics, which apply equally to OT.

The KX modifier is not a permission slip. It is a clinician's attestation. Documentation has to support medical necessity for every claim above the threshold.

A note on selecting your evaluation complexity level: the difference between 97165 and 97167 is not just minutes spent. It is the number of performance deficits you identified, the complexity of the clinical reasoning, and whether the client's history is straightforward or layered with comorbidities. CMS expects your documentation to support whichever level you bill. Auditors look at the occupational profile, the assessments administered, the deficits enumerated, and the clinical reasoning narrative. If those four elements line up with the code's definition, the claim holds.

The 4 Categories of OT Assessments

If you only remember one taxonomy, make it this one. It maps cleanly onto the OTPF-4 domain and gives you a way to scan a fresh referral and decide what to grab.

Category	What it measures	Example tools
Occupation-based	Performance and satisfaction in the actual occupations the client cares about	COPM, AMPS, OT-PAL
Performance skills	Motor, process, and social interaction skills observed during task performance	Fugl-Meyer, Box and Block, 9-Hole Peg, Jebsen-Taylor, ARAT
Client factors	Body functions and structures (cognition, sensation, ROM, strength, mood)	MoCA, MMSE, Sensory Profile 2, PHQ-9, GAD-7, goniometry
Environment and context	Physical, social, cultural, and virtual context shaping performance	Home FAST, HOME, SAFER-HOME, Craig Hospital CHIEF

A good battery for any client touches at least two of these. Picking only client-factor tests is a common trap; it produces a reductionist picture that misses the actual occupational question. AOTA's Quality Toolkit reinforces this with outcome-driven measurement recommendations.

The Most-Used Standardized Assessments by Domain

Below are the tools you will see most often in U.S. clinical practice, organized by the question they answer. Each entry tells you what it measures, who it is for, whether it is free or paid, and where to find the canonical source.

ADL and IADL

Canadian Occupational Performance Measure (COPM) - a semi-structured interview where the client identifies and rates performance and satisfaction across self-care, productivity, and leisure on a 1-10 scale. Excellent for goal-setting and detecting change. Paid (manual purchase required); psychometrics summarized at Shirley Ryan AbilityLab.
Assessment of Motor and Process Skills (AMPS) - observational assessment of ADL performance quality across 16 motor and 20 process skill items. Requires therapist certification (around $795-$995). Gold-standard for occupation-based observation.
Functional Independence Measure (FIM) - 18-item, 7-point ordinal scale used in inpatient rehab. Now largely transitioned to the Section GG quality measure in inpatient rehab facilities under CMS rules.
AM-PAC "6-Clicks" - short-form acute-care functional measure. The two most-used inpatient short forms are Basic Mobility and Daily Activity; an Applied Cognitive short form is also available. Free for academic research; licensed (paid) for clinical use through CREcare at Boston University.
Barthel Index - 10-item ADL scale, well-validated for stroke and geriatric populations. Copyrighted by the Maryland State Medical Society; free for non-commercial clinical use with citation.
Lawton IADL Scale - 8-item IADL screen for older adults. Originally developed for caregiver/observer report (preferred for cognitively impaired clients); self-report is common but has reduced validity. Free for clinical use. Validation in Spanish-speaking older adults at PMC.
Katz Index of ADL - 6-item dichotomous ADL screen (bathing, dressing, toileting, transferring, continence, feeding). Free, brief, and useful when the question is "independent or not."

Motor Function and Upper Extremity

Fugl-Meyer Assessment (FMA) - stroke-specific impairment measure with excellent interrater reliability (ICC 0.96-0.98 depending on subscale and study). The upper-extremity subscale ranges 0-66. Free.
Action Research Arm Test (ARAT) - 19-item observational measure of grasp, grip, pinch, and gross arm movement. Free.
Box and Block Test (BBT) - bilateral gross manual dexterity test (60 seconds per hand). Cheap to build, fast to administer, sensitive to change.
Nine-Hole Peg Test (9-HPT) - timed finger-dexterity test widely used in MS, stroke, and Parkinson's disease. The official instructions PDF is free.
Jebsen-Taylor Hand Function Test - seven timed subtests simulating ADLs (writing, page turning, lifting). Reported Cronbach's alpha 0.96 dominant and 0.92 non-dominant in chronic stroke. Kit pricing varies by vendor; many clinics build their own from readily available items.
Goniometry and manual muscle testing - bedrock client-factor measures. The OTPF-4 domain classifies these as body-function assessments and they remain reimbursable on the eval.

Balance and Fall Risk

Berg Balance Scale (BBS) - 14-item, 0-56 scale assessing both static and dynamic balance and widely treated as the gold standard for functional balance assessment. Cutoffs: 0-20 high fall risk, 21-40 medium, 41-56 low.
Timed Up and Go (TUG) - patient rises from a chair, walks 10 feet, turns, returns, sits. The CDC STEADI initiative flags ≥12 seconds as increased fall risk in older community-dwelling adults.
30-Second Chair Stand - CDC STEADI-recommended lower-extremity strength and fall-risk screen. Free, 30 seconds, no equipment beyond a chair.
Tinetti Performance-Oriented Mobility Assessment (POMA) - combined balance (9 items) and gait (7 items) score out of 28. Score ≤18 indicates high fall risk.

Cognition

Montreal Cognitive Assessment (MoCA) - 30-point screen covering attention, executive function, memory, language, visuospatial, abstract reasoning, and orientation. More sensitive than the MMSE for mild cognitive impairment (Nasreddine et al. 2005). Requires certification through the official MoCA website to administer (approximately $125, one-time; required as of September 2019 with full access enforcement beginning September 2020).
Mini-Mental State Examination (MMSE) - 30-point cognitive screen developed by Folstein in 1975. Less sensitive to executive function than the MoCA. No longer in the public domain; commercial use requires a license.
Allen Cognitive Level Screen (ACLS) - leather-lacing task that estimates cognitive function on Allen's six-level scale. Common in psychiatric and dementia settings. One Korean validation study (Park and Lee 2020) reported strong convergent validity for the ACLS-5 with the K-MMSE (r = 0.778).
Cognistat - assesses consciousness, orientation, attention, language, construction, memory, calculation, and executive skills. Produces a domain-specific profile rather than a single score; sensitivity comparisons to the MMSE vary by population.

Sensory Processing

Sensory Profile 2 (SP-2) - caregiver- and teacher-reported sensory processing patterns from birth through 14 years, 11 months. Built on Dunn's four-quadrant model (Seeking, Avoiding, Sensitivity, Registration). See psychometric work in autism populations at Journal of Autism and Developmental Disorders.
Sensory Integration and Praxis Tests (SIPT) - 17-test battery for ages 4 through 8 years, 11 months. Requires advanced certification. Standard for ASI-trained pediatric OTs.
Sensory Processing Measure, Second Edition (SPM-2) - covers infants through adults across home, school, and community contexts. A validation comparison with the SP-2 is freely available at PMC.

Pediatric Development

Bayley Scales of Infant and Toddler Development, 4th Edition (Bayley-4) - cognitive, language, motor, social-emotional, and adaptive behavior, normed for 16 days through 42 months. Norm-referenced and widely used in early intervention.
Peabody Developmental Motor Scales, 2nd Edition (PDMS-2) - six subtests covering reflexes, stationary, locomotion, object manipulation, grasping, and visual-motor integration, birth through 5 years.
Pediatric Evaluation of Disability Inventory (PEDI) and PEDI-CAT - parent-report on self-care, mobility, social function, and responsibility across birth-20 years. The PEDI-CAT uses item response theory for adaptive testing.
Bruininks-Oseretsky Test of Motor Proficiency (BOT-3, released 2024; BOT-2 is the predecessor still in clinical use) - eight subtests of fine and gross motor proficiency. BOT-3 norms now extend the age range to 4 years through 25 years, 11 months.
Movement Assessment Battery for Children, 2nd Edition (MABC-2) - manual dexterity, ball skills, and balance across ages 3-16, with an observation checklist for ages 5-12.

Psychosocial and Mental Health

Patient Health Questionnaire-9 (PHQ-9) - 9-item depression severity screen scored 0-27, derived from DSM criteria. Free, public domain, takes about 2 minutes. The full instrument and scoring are available through the University of Washington National HIV Curriculum.
Generalized Anxiety Disorder-7 (GAD-7) - 7-item anxiety severity screen scored 0-21. Free, public domain, takes under 2 minutes. See the University of Washington reference page.
COPM doubles as a psychosocial assessment because it captures perceived performance and satisfaction in roles the client values.
Role Checklist v3 (RCv3) - free self-report on past, present, and future occupational roles. Useful in mental health settings.

Work and Vocational

Functional Capacity Evaluation (FCE) - umbrella term for multi-hour, multi-task performance batteries used to determine return-to-work capacity. Common protocols include WorkWell, Matheson, and ErgoScience PWPE.
Work Environment Impact Scale (WEIS) - semi-structured interview assessing how the work environment affects performance.
DASH and QuickDASH - patient-reported upper-extremity disability measures, free for clinical use. The original DASH was developed by Hudak, Amadio, and Bombardier (American Journal of Industrial Medicine, 1996; PMID 8773720); the QuickDASH was validated by Beaton and colleagues in 2005.

Telehealth-Friendly OT Assessments

Telehealth flexibilities that started during the COVID-19 public health emergency have largely been extended. Under the Consolidated Appropriations Act of 2026, OTs remain eligible as distant-site telehealth providers through December 31, 2027, with no geographic restrictions and audio-only permitted when clinically appropriate. The authoritative reference is the CMS List of Telehealth Services for the current calendar year.

Not every assessment translates cleanly to a video visit. The ones that do best are interview-based or rely on simple movements the client can perform in their own space with a camera and a household item or two. Strong telehealth options include:

COPM - interview only, works fine over video.
PHQ-9, GAD-7 - either administered verbally or scored from a secure intake form.
Berg Balance Scale - feasible with caregiver support and a webcam, with caveats about safety.
30-Second Chair Stand - high-yield with a stable chair and a clear view.
MoCA - administer in-person where possible; published video-administration studies exist but interrater reliability is lower than face-to-face. Document the modality and any modifications. Psychometrics summary at the RehabMeasures entry.
Sensory Profile 2 - caregiver-completed, no clinical observation required.
PEDI-CAT - parent-report, fully remote-friendly.

For upper-extremity motor assessments like the Fugl-Meyer, remote-administration studies have emerged but interrater reliability is lower than in-person. Document the modification and use clinical judgment.

Two practical tips for video evaluations. First, build a "telehealth eval kit" worksheet that lists the household objects the client should have within reach (chair, full water bottle for grip resistance, ruler for goniometry estimation, etc.) and email it before the visit. Second, when an assessment has both an in-person and a published telehealth version, name the version you administered in your note. "Berg Balance Scale (video-administered, caregiver supervision)" reads differently to a reviewer than a generic "Berg Balance Scale" entry, and it protects you if a fall happens later.

Billing and Documentation Notes

Three things to keep straight for U.S. Medicare and most commercial payers:

The OT evaluation is untimed. 97165/97166/97167 are billed once per episode regardless of how long the eval actually took. Pick the complexity level that matches the deficits you documented.
The 8-minute rule applies to treatment, not the evaluation. Timed treatment codes (97110, 97530, 97535, etc.) follow CMS's substantial portion rule. The eval codes do not.
The KX modifier kicks in at $2,480 in 2026. Once the beneficiary's OT services pass that threshold for the calendar year, append KX to additional claims. The CMS Therapy Services page is the canonical source.

Your eval note has to map back to the OTPF-4 framework. AOTA's Quality Toolkit includes documentation guidance and outcome-measure templates. Most denials we see are not about the test you chose; they are about a thin occupational profile or missing tie-back from the assessment finding to the functional goal.

Free vs Paid Assessments

A clinician with a tight budget can build a credible adult-rehab battery using almost entirely free instruments. Here is the lay of the land.

Free or public domain	Paid (manual or kit required)
Barthel Index (free for non-commercial clinical use with citation), Katz ADL, Lawton IADL, PHQ-9, GAD-7, Berg Balance, TUG, 30-Second Chair Stand, Tinetti POMA, Fugl-Meyer, ARAT, Box and Block, 9-Hole Peg, Role Checklist v3, Home FAST	COPM, AMPS (certification required), AM-PAC (licensed for clinical use via CREcare), MoCA (certification approx. $125), Sensory Profile 2, SIPT, Bayley-4, PDMS-2, BOT-3 (and predecessor BOT-2), MABC-2, Jebsen-Taylor (kit), MMSE (commercial license), most FCE protocols

Before you spend departmental money on a paid assessment, ask four questions: Does it answer a clinical question your free tools cannot? Are the norms current and relevant to your population? Will reimbursement actually change based on the score? Can the rest of the interdisciplinary team interpret the result?

If the answer is yes to two or more, buy it. If not, the free instrument is almost certainly fine.

One more consideration that does not show up on the spreadsheet: clinical credibility with the interdisciplinary team. In acute care, the rehab team is often using the AM-PAC "6-Clicks" to make discharge recommendations. If you walk in with a different ADL measure, your data will not feed into the team's huddle. Match the local norms when you can; deviate when the clinical question genuinely demands it.

Psychometrics in 5 Minutes

You do not need a stats PhD to read an assessment manual. You need four words.

Reliability - does the test give consistent results across raters and across time? Look for interrater ICC and test-retest ICC above 0.75 for clinical decision-making.
Validity - does the test measure what it claims to measure? Look for construct validity (does it correlate with other measures of the same construct?) and predictive validity (does it forecast outcomes that matter?).
Minimal Detectable Change (MDC) - the smallest score change that exceeds measurement error. If a client's Box and Block score improved by 4 blocks but the MDC is 6, that change is statistical noise.
Minimal Clinically Important Difference (MCID) - the smallest change a patient perceives as meaningful. MCID is usually larger than MDC and is what payers and patients actually care about.

The Shirley Ryan AbilityLab Rehabilitation Measures Database publishes MDC and MCID values for hundreds of instruments, organized by population. Bookmark it.

Pick assessments with published MDC and MCID values for your patient's diagnosis. If those numbers do not exist, you cannot reliably say whether your intervention helped.

A worked example: a chronic-stroke patient's Box and Block score moves from 22 to 31 blocks per minute over six weeks. The published MDC for chronic stroke is roughly 5.5 blocks per minute (Chen et al. 2009); the change of 9 blocks exceeds it, so it is real. MCID estimates for the BBT in stroke vary in the literature and appear to scale with baseline severity, but a 9-block change comfortably exceeds the commonly cited 6-block estimate, so it is also likely to feel meaningful to the patient. Now you have a paragraph for your progress note that maps a number to a story.

The flip side: a moderate-stroke patient's Fugl-Meyer UE score moves from 38 to 41 over four weeks. The MDC in chronic stroke is around 5 points. Your 3-point change has not cleared measurement error. The honest interpretation is "stable" and the honest action is to revisit dosage, modality, or the goal itself rather than report a win.

How to Choose the Right Assessment

Start from the clinical question, not the instrument. The decision aid below maps common questions to the tools most clinicians reach for first.

Clinical question	First-line assessment	When to add more
What does this client want from therapy?	COPM	Role Checklist v3 if occupational identity is unclear
How independent is this older adult in ADLs?	Barthel Index or Katz	Lawton IADL if the question extends to community living
Is this stroke survivor's UE recovering?	Fugl-Meyer UE	ARAT or Box and Block for activity-level performance
Is this older adult at risk of falling?	TUG plus 30-Second Chair Stand (CDC STEADI)	Berg Balance or Tinetti POMA for greater sensitivity
Does this client have cognitive impairment?	MoCA	ACLS or Cognistat for occupational-cognition profile
Does this child have a sensory processing difference?	Sensory Profile 2	SPM-2 if school context is the question
What is this preschooler's developmental motor level?	PDMS-2 or Bayley-4	BOT-2/BOT-3 once over age 4
Is this client safe to return to work?	FCE (WorkWell, Matheson, or ErgoScience)	DASH or QuickDASH for self-reported UE disability

One last principle: triangulate. A standardized motor score, a self-reported function score, and a direct ADL observation tell you more together than any one alone.

Frequently Asked Questions

What Type of Assessments Do Occupational Therapists Use?

OTs use four broad categories: occupation-based measures like the COPM and AMPS; performance-skill assessments like the Fugl-Meyer and Box and Block; client-factor assessments like the MoCA, Sensory Profile 2, PHQ-9, goniometry, and manual muscle testing; and environmental assessments like the Home FAST. A complete evaluation usually combines at least two categories, anchored by the OTPF-4 occupational profile.

What Are the 7 Occupational Needs?

This is a common search query, but it conflates two different concepts and is worth unpacking. OTPF-4 does not list "7 occupational needs." It lists 9 areas of occupation: activities of daily living (ADLs), instrumental activities of daily living (IADLs), health management, rest and sleep, education, work, play, leisure, and social participation. See the AOTA occupations and everyday activities reference.

"7 needs" sometimes appears in older textbooks or in informal lists that combine ADLs and IADLs into one category and drop health management. Use the current OTPF-4 nine-area list when you are documenting or teaching.

What Are the 5 Assessment Tools?

There is no official "5 assessment tools" list, but the five most frequently named in U.S. OT practice are:

Canadian Occupational Performance Measure (COPM)
Functional Independence Measure (FIM) or AM-PAC in acute care
Montreal Cognitive Assessment (MoCA)
Berg Balance Scale
Sensory Profile 2 (pediatric) or Fugl-Meyer (adult neuro)

What Is an Assessment in Occupational Therapy?

An assessment is any structured tool you use - interview, observation, standardized test, or questionnaire - to gather data during an OT evaluation. The OTPF-4 frames assessments as part of the "analysis of occupational performance" that follows the occupational profile. A specific assessment answers a specific clinical question; the full evaluation integrates assessments, observation, and clinical reasoning into a plan of care.

Are PHQ-9 and GAD-7 Really Free for Clinical Use?

Yes. Both were developed by Spitzer, Williams, and Kroenke with support from Pfizer, and Pfizer placed them in the public domain. No permission is required to reproduce, translate, or use them clinically. The University of Washington National HIV Curriculum hosts the canonical scoring guide.

Do I Need Certification to Administer the MoCA?

Yes. The MoCA developer required certification beginning September 2019, with full access enforcement starting September 2020. Certification is one online training (roughly one hour) and currently costs about $125 as a one-time fee for lifelong certification. It covers the standard MoCA, MoCA-Basic, MoCA-Blind, and MoCA-XpressO. See the Shirley Ryan AbilityLab MoCA entry for context.

Which OT Assessments Are Reimbursable Separately by Medicare?

The assessment itself is bundled into the evaluation CPT code (97165, 97166, or 97167). You do not bill the COPM or the MoCA on top. The complexity of the assessment battery is one factor that helps you select the right evaluation code. The AAPC reference for CPT 97165 lays out the documentation requirements.

Key Takeaways

Start every OT evaluation with the OTPF-4 occupational profile, then choose assessments that answer the specific clinical question you wrote down.
Cover at least two of the four categories: occupation-based, performance skills, client factors, and environment. A pure body-function battery misses the point.
For most adult-rehab questions, a free instrument exists that does the job. Spend on paid tools only when the clinical or reimbursement case is clear.
Use the Shirley Ryan AbilityLab Rehabilitation Measures Database as your psychometrics reference. It is free and updated.
Bill the right evaluation code based on documented deficits (97165, 97166, or 97167), watch the 2026 KX threshold of $2,480, and document the link from each assessment finding to a functional goal.
Telehealth-friendly options now have a permanent home in your toolkit. Interview-based and caregiver-report instruments translate best; document any modifications to in-person standardized administration.
The OTPF-4 lists 9 areas of occupation, not 7 needs. Use the current language.

Occupational Therapy Assessments