Here’s a summary of the current page:
- Study Findings: Large language models (LLMs) like GPT-4 and GPT-3.5 show limited accuracy in medical coding, with performance below 50%.
- Best Performer: GPT-4 had the highest exact match rates for medical codes but still produced a significant number of errors.
- **Potential Applications**: LLMs could automate medical code assignment for healthcare reimbursement and research, but require further refinement1.
- **Future Research**: The team at Icahn School of Medicine plans to develop tailored LLM tools for accurate medical data extraction and billing code assignment2.
Leave a Reply