DoctorGLM

基于ChatGLM-6B的中文问诊模型

Generate Convert Improve

Install / Use

/learn @xionghonglin/DoctorGLM

About this skill

Quality Score

0/100

README

<img src="imgs/logo.png" width=400px/> <img src="https://img.shields.io/badge/Version-0.0.3--alpha-brightgreen"> <a href="https://xionghonglin.github.io/DoctorGLM/">[Project Page]</a> <a href="https://arxiv.org/abs/2304.01097">[Arxiv]</a> <a href="https://zhuanlan.zhihu.com/p/622649076">[最新博客]</a> <a href="https://doctorglm.idealab-llms.com">[在线体验]</a>

DoctorGLM

基于 ChatGLM-6B的中文问诊模型

训练数据

| Dataset | Department | Language | Q&A | Chat | Number | Syn. | Size | Weight | |------------|--------------------------|----------|------|------|--------|------|-------|-------| | CMD. | Surgical | CN | ✔ | × | 116K | × | 52MB | | | | Obstetrics and Gynecology| CN | ✔ | × | 229K | × | 78MB | | | | Pediatrics | CN | ✔ | × | 117K | × | 47MB | | | | Internal Medicine | CN | ✔ | × | 307K | × | 102MB | | | | Andriatria | CN | ✔ | × | 113K | × | 44MB | | | | Merged | CN | ✔ | × | 1.9M | × | |Doctor_GLM/ckpt| | MedDialog | Multiple | CN&EN | ✔ | ✔ | 3.4M | × | 1.5GB |ptuning_weight | | ChatDoctor | Multiple | EN | ✔ | × | 5.4K | ✔ | 2.9MB |Coming soon | | HearlthcareMagic| Multiple | EN | ✔ | × | 200K | × | 216MB |Coming soon |

https://github.com/Toyhom/Chinese-medical-dialogue-data

使用

lora

显存 >= 13G （未量化版本）
pip install deep_training cpm_kernels icetk transformers>=4.26.1
torch >= 1.12.0 (icetk依赖cpu版torch, 建议先安装icetk后安装gpu版torch)
lora的finetune代码来自 https://github.com/ssbuild/chatglm_finetuning

对于fp16模型，直接使用Doctor_GLM/chat_lora.ipynb，由于官方更新了chatglm的权重，我们将老版权重放在了 old_pretrain_model 可以下载后解压到old_pretrain_model目录

量化的模型我们打了个包，使用方便，但是效果目前来看很成问题：INT4需要大约6G显存，INT8需要大约8G显存，在Doctor_GLM/chat_lora_quant.ipynb下使用

from load_quantization import load_int
tokenizer, model = load_int('DoctorGLM-6B-INT8-6merge-int8.pt',8)
response, history = model.chat(tokenizer,
                               "我爷爷高血压可以喝咖啡吗",
                               history=[],
                               max_length=2048)
print(response)

模型下载链接： INT4 INT8 量化方法均为分层的线性量化。目前量化模型的性能仍有较大问题，后期我们会对量化方法和模型进行更新

p-tuningv2

官方提供了p-tuningv2的实现，新版本权重可以在hugging face上下载，也可以从我们的链接下载 pretrain_model
p-tuningv2的权重在 ptuning_weight ，下载后解压到ckpt/ptuningv2目录下, 然后使用Doctor_GLM/chat_ptuning_v2.ipynb，根据需要调整quantization_bit为4或8

模型在线部署

为了方便部署并随时调整模型生成回答时的参数，我们提供了基于 Gradio 库的部署代码，路径为 Doctor_GLM/gradio.ipynb。运行之后，访问本机的7860或者代码声明的其他端口即可以运行Demo，模型在生成回答时的参数可以由用户自由调控。若想让部署的模型可以被局域网之外的其他用户访问，需要将sharing设置为 True（默认为False）。部署之后运行效果如下所示：

即将到来的更新

[ ] <img src="https://img.shields.io/badge/Version-0.0.4--alpha-brightgreen"> (2023.4.21) 对话中加入参考文献，模型上传到huggingface

第一次运行会下载chatGLM-6B权重, 如果已有chatGLM-6B权重可以将data_utils.py里的路径修改为自己的权重目录

结果示例

<img src="imgs/3_ret.png" width=1300px/> 我们随机跑了100个结果，在 ./results目录下，两份json文件分别为由ChatGLM, DoctorGLM得到的结果，目前存在大量复读机。

引用

@article{xiong2023doctorglm,
  title={Doctorglm: Fine-tuning your chinese doctor is not a herculean task},
  author={Xiong, Honglin and Wang, Sheng and Zhu, Yitao and Zhao, Zihao and Liu, Yuxiao and Wang, Qian and Shen, Dinggang},
  journal={arXiv preprint arXiv:2304.01097},
  year={2023}
}

Related Skills

node-connect

353.3k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

111.7k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

353.3k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

353.3k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。

xionghonglin

View profile

View on GitHub

GitHub Stars836

CategoryDevelopment

Updated3d ago

Forks88

xionghonglin/DoctorGLM

Languages

Python

Security Score

75/100

Audited on Apr 6, 2026

No findings

DoctorGLM

Install / Use

README

DoctorGLM

最近更新

训练数据

使用

lora

p-tuningv2

模型在线部署

最近更新

即将到来的更新

结果示例

引用

Related Skills