Methodology Document · Draft for project team review
AI Competency Assessment
for Thai Hospital Personnel
Framework, test design, generation pipeline, and item-level maps — built by a hybrid human–AI method grounded in real Thai hospital AI tools.
Overview
The shape of the assessment
One framework, three audience-specific tests, expanded by pipeline into 462 total item variants for cohort test-security.
Framework · Two axes
Construct × Depth
Knowledge → MCQ. Recognise what is going on / what to do. ~30–90s.
Skill → open task. Produce a response, scored on a rubric. ~3–5 min.
Foundational — grasp what's going on day-to-day.
Applied — bring it to bear on one concrete case.
Advanced — set the standing rule / policy / strategy.
Depth measures reasoning, not seniority — a junior can show Applied; a senior may show only Foundational.
Framework · The five competencies
C1–C5
Framework · Construct mapping
How each competency is tested
The format always follows the construct it measures — recognition is asked as MCQ, production as an open task.
Framework · Operator vs governor
Two parallel tracks
Individual — the operator
Hands-on AI use by contributors. C2 / C5 progress by the complexity of one's own AI use.
- C2 — discrete tasks → multi-step workflows → ambitious orchestration
- C5 — seeks new tools → experiments → adapts as tech changes
Executive — the governor
Strategic AI deployment across the organisation. Never measured on operating a tool.
- C2 — efficiency → defensible advantage → transformation & sourcing
- C5 — keeps the team current → sets the org's staying-current strategy
Shared floor: C1 · C3 · C4 Foundational are word-for-word identical across both tracks.
Audiences
Three audiences, three tests
Non-clinical staff
Foundational depth, all multiple-choice. 5 role variants from one 9-item master.
Clinical professionals
Foundational + Applied, integrated MCQs across competencies. 7 roles, 3 tracks.
AI-policy leaders
Part 1 — Applied MCQ (own AI use). Part 2 — Advanced open tasks (org rules & strategy).
Audiences · Medical
Three clinical tracks
Gen-AI prompting
Conversational AI use on personal devices and scribes.
PresScribe
Imaging AI
Interpreting an AI flag against a heatmap before reporting.
Inspectra CXR
Real-time procedural AI
Alert-driven AI during live procedures.
gastroAI-model G
Items 1–3 differ by track; items 4–14 are shared.
Pipeline
Six-phase build
Phases 1–2 human-led · 3–5 LLM-assisted · 6 assembly. Resumable, checkpointed per (role, item).
Pipeline · Grounding
Grounded in real Thai hospital AI
In use
- Inspectra CXR
- PresScribe (on HOSxP)
- gastroAI-model G
- HOSxP DDI alerts (rule-based)
In pilot
- RAMAAI CXR
- LiverSound
- Inspectra MMG
Excluded
- Autonomous diagnosis
- US / global scribes
- Med-recommending CDSS
Scenarios never reference tools that don't exist in Thai practice; updating the catalogue propagates on the next run.
Item maps
Item maps at a glance
Coverage
What it measures — and doesn't
Measures
- Recognition (knowledge) across C1–C4 in all MCQ tests
- Executive Part 2 measures C1–C5 skill at Advanced depth
Doesn't
- C2 / C5 skill production in MCQ — only the recognition layer
- C5 out of scope for back-office & medical (longitudinal)
A directional signal to inform pilots and capability conversations — not a high-stakes psychometric instrument.
Architecture & QA
Quality assurance
Built in
- Expert-reviewed master tests anchor each audience
- Validated Thai AI-tools list, consulted by every prompt
- Per-item checkpoints + tolerant JSON auto-repair
Audited automatically
- Every item has its N variants & 4 options
- Valid correct-answer letters; letter-distribution check
- Competency-tag consistency across the bank
Planned: clinical reviewer sign-off per role + pilots of 10–20 test-takers per audience.
Limitations
Limitations & pending
Extending measurement of skill production needs task-based components — flagged as future work.
Showcase
Explore the generated
test bank
A read-only, bilingual (Thai + English) item browser — every role, item, and variant produced by the pipeline.
AI Competency Assessment for Thai Hospital Personnel
Methodology Document · Draft for project team review · Branded with Chulalongkorn University corporate identity.