1. Introduction

X. Song);

Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs

Yingjian Chen

Feiyang Li

Xingyu Song

Tianxiao Li

Zixin Xu

Xiujie Chen

Issey Sukeda

Irene Li

3 0 Dokkyo Medical University , Tochigi , Japan 1 Graduate School of Frontier Sciences, The University of Tokyo , Tokyo , Japan 2 NEC Laboratories America , Palo Alto , USA 3 The University of Tokyo , Tokyo , Japan

1838

000 0 0002

Large language models (LLMs) perform well in medical QA, but their efectiveness in Japanese contexts is limited due to privacy constraints that prevent the use of commercial models like GPT-4 in clinical settings. As a result, recent eforts focus on instruction-tuning open-source LLMs, though the potential of combining them with retrieval-augmented generation (RAG) remains underexplored.To bridge this gap, we are the first to explore a knowledge graph-based (KG) RAG framework for Japanese medical QA small-scale open-source LLMs. Experimental results show that KG-based RAG has only a limited impact on Japanese medical QA using small-scale open-source LLMs. Further case studies reveal that the efectiveness of the RAG is sensitive to the quality and relevance of the external retrieved content. These findings ofer valuable insights into the challenges and potential of applying RAG in Japanese medical QA, while also serving as a reference for other low-resource languages.

eol>Japanese Medical Question Answering Retrieval-Augmented Generation Small-Scale LLMs Knowledge Graph

1. Introduction

Large language models (LLMs) have achieved remarkable performance in medical question answering (QA), even demonstrating the ability to pass medical licensing exams (e.g., the United States Medical Licensing Examination, USMLE) [ 1 ], which highlights their potential to understand complex medical knowledge. In particular, recent research [ 2, 3, 4 ] has explored the use of retrieval-augmented generation (RAG) [ 5 ] to incorporate external medical knowledge into LLMs, efectively mitigating “hallucination” issues [ 6, 7, 8, 9 ] and further enhancing their applicability in medical QA tasks.

While these advances are promising, most existing eforts [ 10, 11 ] focus on English-only scenarios, with limited exploration of multilingual medical QA, particularly in Japanese. Since mainstream LLMs [12, 13] are predominantly trained on English-centric medical data, the significant imbalance in resource distribution [14] limits their efectiveness in Japanese medical QA scenarios. While GPT-4 has demonstrated strong performance on the Japanese NMLE [15], strict privacy regulations prevent its use in clinical settings. As a result, research has shifted toward open-source LLMs, but progress is hindered by the scarcity of high-quality Japanese medical data―highlighting the broader challenges faced by low-resource languages.

To address this limitation, in this paper, we are the first to explore the use of Knowledge Graph-based RAG for Japanese medical question answering, specifically targeting small-scale LLMs. Given the limited accessibility of external Japanese medical resources, we use the easily accessible external medical や嘔吐は、喘息の症状や他の健康問題の兆候である可能性があります。特に、喘息を持っている場合、ストレスや不安が発作を引き起こすことがあります。また、禁煙のためのパッチが体に与える影響も考慮する必要があります。ニコチンパッチは、禁煙を助けるために使用されることが多いですが、喘息の症状を悪化させる可能性があるため、注意が必要です。..........

胸部絞扼感は喘息の症状である嘔喘吐息はは消好化酸管球の性非...肺特炎異の的一症部状ででああるる今夜2回嘔吐し、胸部に非常に強い締め付け感がありますが、鼻や喉は通っています。禁煙を助けるためにパッチを使い始めたばかりで、私は喘息持ちですが、現在は酸素が体内に留まっていないような感覚があります。脈拍はあり、また不安症状も経験しています。このパッチが私の喘息や呼吸に影響しているのでしょうか？あなたの症状について心配されるのは理解できます。胸部の強い締め付け感や嘔吐は、喘息の症状や他の健康問題の兆候である可能性があります。特に、喘息を持っている場合、ストレスや不安が発作を引き起こすことがあります。また、禁煙のためのパッチが体に与える影響も考慮する必要があります。ニコチンパッチは、..........

Step 1: Entity Extraction 胸部の締め付け感

不安症状 LLM 喘嘔息吐 Step 4: Declarative Conversion

Question 胸部絞扼感は喘息の症状である嘔吐は消化管の.非特異的症状である .

. 喘息は好酸球性肺炎の一部である

Kownledge Graph based RAG

Step 2: Knowledge Graph Retrieval chest tightness

asthma anxievtoymsiytminpgtoms UMLS

Top 1: 0.84 Top 2: 0.76 . .

Top k: 0.64

Step 3: Ranking

Re-rank itsunoeQ liiitraySm

Triple 1 Triple 2 . .

Triple n knowledge base UMLS [16] to support knowledge retrieval, following the design of MKG-Rank [17]. By applying word-level translation, LLMs can integrate non-Japanese-centric medical knowledge, mitigating the limitations caused by the relative scarcity of Japanese medical data and its access restrictions. Empirical results show that the KG-based RAG has limited efect on Japanese medical QA with small open-source LLMs, largely depending on the quality and relevance of the retrieved KGs from external knowledge bases.

2. Method 2.1. Base Models

Our work focuses on evaluating the performance of small-scale LLMs in Japanese medical QA. Specifically, we select: (1) models with fewer than 5B parameters, including Borea-Phi-3.5-Mini-InstructCommon [18] and LLaMA-3-ELYZA-JP-8B [19]; (2) models with 5B–10B parameters, including LLMJP-3-7.2B-Instruct3 [20], Mistral-7B-Instruct-v0.3 [21], Qwen2.5-7B-Instruct [22], and LLaMA-3.1-8BInstruct [23]; and (3) 10B–32B models, including Qwen2.5-14B-Instruct [24], Phi-4-14B [25], and Gemma-3-12B-it [26]. In addition, we evaluate the commercial LLM GPT-4o-mini [27] as a proprietary baseline.

2.2. Knowledge Graph-based RAG

Our pipeline, as illustrated in Fig. 1, consists of four main steps: (1) Given a medical question , we ifrst use an LLM to extract relevant medical entities ℰ ; (2) The extracted entities ℰ are translated into English via word-level translation and used to query external medical knowledge base UMLS to obtain relevant KGs . (3) A ranking mechanism is applied to select the most relevant triples ′ based on their semantic relevance to the question ; (4) The selected triples are then converted into declarative sentences by an LLM, along with the original question, are provided as input to the LLM for answer generation.

Prompts. We provide the prompts used in our knowledge graph-based RAG mechanism, including Medical Entity Extraction, Declarative Conversion and Answer Generation.

3. Experiment 3.1. Datasets

We selected three long-form medical question answering datasets: ExpertQA-Bio, ExpertQA-Med, and LiveQA. ExpertQA is a high-quality QA dataset verified by domain experts, from which we used 96 bioMedical Entity Extraction text: {question} Please extract at most 4 terms related to medical that you think are the most important from the provided text. Returns the result in the following json form. All the results are merged into one json. -- Examples of results: {”medical terminologies” : [”term1”, ”term2”, ...]} result: logical questions (ExpertQA-Bio) and 504 medical questions (ExpertQA-Med) [28]. LiveQA consists of consumer health questions submitted to the National Library of Medicine, comprising 627 training QA pairs and 104 test pairs [29]. All datasets were originally in English and were translated into Japanese using a multi-stage agentic machine translation framework [30]. We provide detailed statistics of the evaluation datasets, including the number of samples and the average word count of questions and answers, as shown in Table 2.

Dataset ExpertQA-Bio ExpertQA-Med LiveQA

Size 96 504 627

Question Length

Answer Length 56.7 56.0 118.9 410.7 378.1 438.3

3.2. Metrics

We adopt ROUGE-L [31] and BERTScore [32] to evaluate the quality of answers generated by LLMs for Japanese medical QA tasks. These metrics provide a comprehensive assessment of both lexical and semantic accuracy.

3.3. Results and Analysis 3.4. Case Study

To further investigate the reasons behind the relatively limited impact of RAG, we analyze the intermediate reasoning process, as shown in Table 4. Specifically, in Case 1 (top), the retrieved medical knowledge includes the information that is relevant and helpful for answering the question (e.g. “ワルファリンは生理的に凝固因子濃度を低下させる効果がある”). However, we observe that ROUGE-L BERTScore ROUGE-L BERTScore ROUGE-L BERTScore 4.33 61.20 4.92 51.54 4.05 59.98 4.77 (+0.44%) 61.32 (+0.12%) 4.82 (-0.10%) 61.39 (-0.15%) 3.97 (-0.08%) 59.71 (-0.28%)

Question: ワルファリン（ワーファリン）を服用している人は避けるべき野菜は何ですか？(Which vegetables should people taking warfarin (Coumadin) avoid?) Retrieved Medical Knowledge: 1. ワルファリンは生理的に凝固因子濃度を低下させる効果がある。(Warfarin physiologically reduces the concentration of coagulation factors.) 2. ワルファリンはクマリン系の抗凝固薬である。(Warfarin is a coumarin-based anticoagulant.) … Ground Truth: ワーファリンを服用している人は、ビタミン K を多く含む野菜の摂取を避けるか制限する必要があります。なぜなら、それが薬の血液凝固抑制効果（血液をサラサラにする効果）を妨げる可能性があるからです。…(People taking warfarin should avoid or limit the intake of vegetables that are high in vitamin K, as it may interfere with the medication’s blood clotting inhibition efect (the efect of making the blood thinner). ) Generated Answer: ワルファリンを服用している人が避けるべき野菜は、特にビタミン K を多く含むものです。ビタミン K は血液凝固に関与しており、ワルファリンの効果を減少させる可能性があります。…(People taking warfarin should avoid vegetables that are particularly high in vitamin K. Vitamin K is involved in blood clotting and may reduce the efectiveness of warfarin. ) ついては、ではどうでしょうか？(Antibiotics are typically administered to patients with bacterial or fungal infections. But what about diseases that are often caused by viruses? ) Retrieved Medical Knowledge: 1. バイ菌感染は、他のバイ菌性疾患に分類されます。(Bacterial infections are classified under other bacterial diseases.) 2. バイ菌感染は、肝膿瘍に分類されます。(Bacterial infections are classified as liver abscesses.) … Ground Truth: 抗生物質は、感冒、インフルエンザ、様々な咳などのウイルス感染症に対しては効果がありません。…これは、ウイルスが細胞ではなく、細菌細胞と比較して細胞壁を持たないためです。…(Antibiotics are inefective against viral infections such as the common cold, influenza, and various types of cough. …This is because viruses do not have a cell wall, unlike bacterial cells. …) Generated Answer: 抗生物質は主に細菌感染に対して効果がありますが、ウイルス感染に対しては効果がありません。ウイルスが原因の疾患、例えばインフルエンザや風邪などには、抗生物質は無効です。…(Antibiotics are mainly efective against bacterial infectionsbutareinefectiveagainstviralinfections. Forillnessescausedbyviruses, influenzaorthecommoncold, antibioticsdonotwork. ) most LLMs already possess this knowledge internally, which explains why RAG ofers only marginal improvements in such cases. On the other hand, for Case 2 (bottom), the retrieved medical knowledge does not help answer the question (e.g. “バイ菌感染は、肝膿瘍に分類されます”). This is primarily because the specialized biomedical concepts in UMLS may not align well with broader, more general medical questions. Consequently, the retrieved information provides limited assistance and may even introduce noise that negatively impacts the LLM’s reasoning.

4. Conclusion

This work presents the first exploration of a knowledge graph-based RAG framework for Japanese medical QA using small-scale open-source LLMs. Empirical findings show that its overall impact is limited, primarily constrained by the quality and relevance of the retrieved content. These insights highlight the challenges and potential of applying RAG to Japanese and other low-resource language medical QA tasks.

Acknowledgment

This work is supported by JST ACT-X (Grant JPMJAX24CU) and JSPS KAKENHI (Grant 24K20832). This work used supercomputers provided by the Research Institute for Information Technology, Kyushu University, through the HPCI System Research Project (Project ID: hp250092). This work is also supported by NVIDIA Academic Grant Program, Google Cloud (Gemma 3 Academic Program), and Henan Province International Science and Technology Cooperation Program (Grant 252102521047).

Declaration on Generative AI

In this work, the first author, Yingjian Chen, used ChatGPT and Grammarly in order to: Grammar and spelling check, sentence polishing. After using these tools, Yingjian Chen reviewed and edited the content as needed and takes full responsibility for the publication’s content. [11] H. Yang, H. Chen, H. Guo, Y. Chen, C.-S. Lin, S. Hu, J. Hu, X. Wu, X. Wang, Llm-medqa: Enhancing medical question answering through case studies in large language models, arXiv preprint arXiv:2501.05464 (2024). [12] A. Hurst, A. Lerer, A. P. Goucher, A. Perelman, A. Ramesh, A. Clark, A. Ostrow, A. Welihinda,

A. Hayes, A. Radford, et al., Gpt-4o system card, arXiv preprint arXiv:2410.21276 (2024). [13] H. Touvron, T. Lavril, G. Izacard, X. Martinet, M.-A. Lachaux, T. Lacroix, B. Rozière, N. Goyal, E. Hambro, F. Azhar, et al., Llama: Open and eficient foundation language models, arXiv preprint arXiv:2302.13971 (2023). [14] C. Chataigner, A. Taïk, G. Farnadi, Multilingual hallucination gaps in large language models, arXiv preprint arXiv:2410.18270 (2024). [15] J. Kasai, Y. Kasai, K. Sakaguchi, Y. Yamada, D. Radev, Evaluating gpt-4 and chatgpt on japanese medical licensing examinations, arXiv preprint arXiv:2303.18027 (2023). [16] O. Bodenreider, The unified medical language system (umls): integrating biomedical terminology,

Nucleic acids research 32 (2004) D267–D270. [17] F. Li, Y. Chen, H. Liu, R. Yang, H. Yuan, Y. Jiang, T. Li, E. M. Taylor, H. Rouhizadeh, Y. Iwasawa, et al., Mkg-rank: Enhancing large language models with knowledge graph for multilingual medical question answering, arXiv preprint arXiv:2503.16131 (2025). [18] AXCXEPT, Borea-phi-3.5-mini-instruct-common, https://huggingface.co/AXCX EPT/Borea-Phi3.5-mini-Instruct-Common, 2024. Retrieved April 08, 2025. [19] ELYZA, Llama-3-elyza-jp-8b, https://huggingface.co/elyza/Llama-3-ELYZA-JP-8B, 2024. Retrieved April 08, 2025. [20] LLM-jp, llm-jp-3-7.2b-instruct3, https://huggingface.co/llm-jp/llm-jp-3-7.2b-instruct3, 2024. Retrieved April 08, 2025. [21] M. AI, Mistral-7b-instruct-v0.3, https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3, 2024.

Retrieved April 08, 2025. [22] Qwen, Qwen2.5-7b-instruct, https://huggingface.co/Qwen/Qwen2.5-7B-Instruct, 2024. Retrieved

April 08, 2025. [23] M. Llama, Llama-3.1-8b-instruct, https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct, 2024.

Retrieved April 08, 2025. [24] Qwen, Qwen2.5-14b-instruct, https://huggingface.co/Qwen/Qwen2.5-14B-Instruct, 2024. Retrieved April 08, 2025. [25] Microsoft, phi-4, https://huggingface.co/microsoft/phi-4, 2024. Retrieved April 08, 2025. [26] Google, gemma-3-12b-it, https://huggingface.co/google/gemma-3-12b-it, 2025. Retrieved April 08, 2025. [27] OpenAI, Gpt-4o-mini, https://openai.com/index/gpt-4o-mini-advancing-cost-eficientintelligence/, 2024. Retrieved April 08, 2025. [28] C. Malaviya, S. Lee, S. Chen, E. Sieber, M. Yatskar, D. Roth, Expertqa: Expert-curated questions and attributed answers, arXiv preprint arXiv:2309.07852 (2023). [29] A. B. Abacha, E. Agichtein, Y. Pinter, D. Demner-Fushman, Overview of the medical question answering task at trec 2017 liveqa., in: TREC, 2017, pp. 1–12. [30] W. Xuan, R. Yang, H. Qi, Q. Zeng, Y. Xiao, Y. Xing, J. Wang, H. Li, X. Li, K. Yu, et al., Mmluprox: A multilingual benchmark for advanced large language model evaluation, arXiv preprint arXiv:2503.10497 (2025). [31] C.-Y. Lin, Rouge: A package for automatic evaluation of summaries, in: Text summarization branches out, 2004, pp. 74–81. [32] T. Zhang, V. Kishore, F. Wu, K. Q. Weinberger, Y. Artzi, Bertscore: Evaluating text generation with bert, arXiv preprint arXiv:1904.09675 (2019). す。ニコチンパッチは、.......... 喘息は好酸球性肺炎の一部であるあなたの症状について心配されるのは理解できます。胸部の強い締め付け感や嘔吐は、喘息の症状や他の健康問題の兆候である可能性があります。特に、喘息を持っている場合、ストレスや不安が発作を引き起こすことがあります。また、禁煙のためのパッチが体に与える影響も考慮する必要があります。ニコチンパッチは、禁煙を助けるために使用されることが多いですが、喘息の症状を悪化させる可能性があるため、注意が必要です。特に、胸部の締め付け感や呼吸困難を感じている場合は、パッチの使用を中止し、医師に相談することをお勧めします。また、嘔吐や不安症状も、身体のストレス反応の一部である可能性があります。これらの症状が続く場合や悪化する場合は、早急に医療機関を受診することが重要です。あなたの健康が最優先ですので、専門家の意見を仰ぐことを強くお勧めします。

Retrieved Medical Knowledge

胸部絞扼感は喘息の症状である胸部絞扼感はアナフィラキシー反応の一部である嘔吐は消化管の非特異的症状である喘息は好酸球性肺炎の一部である

I vomited twice tonight and have a very strong tightness in

my chest, although my nose and throat are clear. I just started using a patch to help me quit smoking. I have asthma, but right now it feels like oxygen isn’t staying in my body. I have a pulse, and I’m also experiencing anxiety symptoms.

Could this patch be affecting my asthma or breathing? It is understandable that you are concerned about your

symptoms. Severe chest tightness and vomiting could be signs of asthma or other health issues. Especially for individuals with asthma, stress and anxiety can sometimes trigger symptoms. It is also important to consider the potential effects of the nicotine patch on your body.

Nicotine patches are commonly used to support smoking

cessation, but they may worsen asthma symptoms in some cases, so caution is necessary. If you are experiencing chest tightness or difficulty breathing, it is advisable to stop using the patch and consult a doctor.

Additionally, vomiting and anxiety symptoms may be part

of your body’s stress response. If these symptoms persist or worsen, it is crucial to seek medical attention promptly. Your health is the top priority, and we strongly recommend consulting a medical professional.

Retrieved Medical Knowledge

Chest tightness is a symptom of asthma

Chest tightness can be part of an anaphylactic reaction Vomiting is a nonspecific symptom of the gastrointestinal tract Asthma is a component of eosinophilic pneumonia

A. English Translation of the Content in Fig. 1

For clearer presentation, we provide the English translation and the full content of the question and answer in the Fig. 1, as illustrated in Fig. 2.

Declarative Conversion あなたは医学分野の知能助手です。すべての背景知識をそれぞれ日本語の平叙文に変換する。医学に関係ないと思うものは何でも削除できます。 - Background Knowledge: {triple}

Converted Background Knowledge:

B. Case Study Details

For clarity and ease of understanding, we provide the full content and English translations of the cases from Table 4 here, as shown in Fig. 3 and Fig. 4.

Answer Generation with Medical Knowledge あなたは医学分野の知能助手です。質問をよく分析し、提供された背景知識とあなた自身の知識に基づいて以下の質問に答えてください。できるだけ 512 の token 内で完全に回答します。日本語で質問に答える。 - 問題: {question} - 背景知識: {background_knowledge} - 答える: ワルファリン（ワーファリン）を服用している 1.ワルファリンは生理的に凝固因子濃度を低下させる効果がある。人は避けるべき野菜は何ですか？ Which vegetables should people taking warfarin (Coumadin) avoid?

Retrieved Medical Knowledge: Warfarin physiologically reduces the concentration of coagulation factors. 2.ワルファリンは生理的に凝固因子活性を低下させる効果がある。 Warfarin has the effect of lowering the activity of coagulation factors physiologically. 3.ワルファリンはクマリン系の抗凝固薬である。 Warfarin is a coumarin-based anticoagulant. 4。ワルファリンはWARFARIN SODIUM 3 mg経口錠剤の有効成分である。 Warfarin is the active ingredient in WARFARIN SODIUM 3 mg oral tablets. ワーファリンを服用している人は、ビタミンKを多く含ワルファリンを服用している人が避けるべき野菜は、特にむ野菜の摂取を避けるか制限する必要があります。なぜビタミンKを多く含むものです。ビタミンKは血液凝固に関なら、それが薬の血液凝固抑制効果（血液をサラサラに与しており、ワルファリンの効果を減少させる可能性があする効果）を妨げる可能性があるからです。避けるか適ります。具体的には、以下のような野菜が挙げられます。度に摂取すべき野菜には、ケール、ほうれん草、コラードグリーン（ケールに似た葉野菜）、フダンソウ（スイ - ケール - ほうれん草 - ブロッコリー - キャベツスチャード）、カラシナ（からし菜）、ビートの葉、かパセリぶの葉、ブロッコリーなどの葉物野菜が含まれます。さらに、パセリやコリアンダー（パクチー）などのハーブこれらの野菜はビタミンKの含有量が高いため、ワルファリもビタミンKを多く含むため、ワーファリン服用中は適ンの効果を妨げる可能性があるため、摂取を控えるか、医度に摂取すべきです。師と相談の上で適切な量を守ることが重要です。 People taking warfarin should avoid or limit the People taking warfarin should avoid vegetables that intake of vegetables rich in vitamin K, as they are particularly high in vitamin K, as vitamin K is may interfere with the blood-thinning effect of involved in blood clotting and may reduce the the medication. Vegetables that should be avoided effectiveness of warfarin. Specifically, the or consumed in moderation include kale, spinach, following vegetables should be considered: collard greens (leafy greens similar to kale), - Kale -Spinach -Broccoli -Cabbage -Parsley Swiss chard, mustard greens, beet greens, turnip greens, and broccoli. Additionally, herbs such as These vegetables contain high levels of vitamin K and parsley and coriander (cilantro), which are also may interfere with the effect of warfarin. It is high in vitamin K, should be consumed in important to either avoid them or consult with a moderation while taking warfarin. doctor to ensure an appropriate amount is consumed. 抗生薬は通常、細菌感染または真菌感染のある 1.バイ菌感染は、ニトロフラン抗菌薬によって予防されることがあり患者に投与されます。ウイルスが原因であるこます。とが多い疾患については、ではどうでしょう Bacterial infections may be prevented by nitrofuran か？ antibiotics.

Antibiotics are typically administered to 2.バイ菌感染は、他のバイ菌性疾患に分類されます。 patients with bacterial or fungal infections. But what about diseases that are often caused by viruses? 3.バイ菌感染は、肝膿瘍に分類されます。

Bacterial infections are classified under other bacterial diseases.

Bacterial infections are classified as liver abscesses. 4.バイ菌感染は、他のバイ菌感染に分類されます。 Bacterial infections are classified as other bacterial infections. 5.バイ菌感染は、ニトロフラン抗菌薬によって治療されることがあります。 Bacterial infections may be treated with nitrofuran antibiotics. 抗生物質は、感冒、インフルエンザ、様々な咳などのウ抗生物質は主に細菌感染に対して効果がありますが、ウイイルス感染症に対しては効果がありません。ウイルス感ルス感染に対しては効果がありません。ウイルスが原因の染症の患者に対して、医師は症状管理のための一般用医疾患、例えばインフルエンザや風邪などには、抗生物質は薬品を含む治療を推奨することがあり、抗生物質を使用無効です。ウイルス感染に対しては、抗ウイルス薬が使用する場合は必要な場合にのみ使用することが常に推奨さされることがあります。したがって、ウイルスが原因であれます。場合によっては、HIVに対する抗ウイルス療ることが多い疾患に対しては、抗生物質を投与することは法、インフルエンザ、COVID-19、サル痘(mpox)、または適切ではありません。ウイルス感染に対しては、症状を緩インフルエンザに対するオセルタミビルリン酸塩（タミ和するための対症療法や、必要に応じて抗ウイルス薬が用フル）など、特定のウイルスにのみ効果のある特定の薬いられます。剤が処方されることがあります。抗生物質は細菌感染症に特化して処方されるものであり、ウイルス感染症には Antibiotics are primarily effective against bacterial 効果がないことに注意することが重要です。これは、ウ infections but are not effective against viral イルスが細胞ではなく、細菌細胞と比較して細胞壁を持 infections. For diseases caused by viruses, such as たないためです。したがって、抗生物質はウイルス感染 influenza and the common cold, antibiotics are 症に対して効果を示しません。 ineffective. Antiviral medications may be used for viral infections. Therefore, it is not appropriate to Antibiotics are not effective against viral administer antibiotics for diseases that are often infections such as the common cold, influenza, and caused by viruses. For viral infections, symptomatic various types of coughs. For patients with viral treatments are used to relieve symptoms, and infections, doctors may recommend treatments that antiviral medications may be used when necessary. include over-the-counter medications for symptom management, and antibiotics should only be used when necessary. In some cases, specific antiviral medications may be prescribed for certain viruses, such as antiretroviral therapy for HIV, oseltamivir phosphate (Tamiflu) for influenza, or treatments for COVID-19 or monkeypox (mpox). It is important to note that antibiotics are prescribed specifically for bacterial infections and are not effective for viral infections. This is because viruses do not have a cell wall, unlike bacterial cells. Therefore, antibiotics are ineffective against viral infections.

[1]

Nori ,

King ,

S. M.

McKinney ,

Carignan , E. Horvitz, Capabilities of gpt-4 on medical challenge problems , arXiv preprint arXiv: 2303 .13375 ( 2023 ).

[2]

Shi ,

Xu ,

Yang ,

Liu , T. Liu,

Li ,

Liu , Mkrag: Medical knowledge retrieval augmented generation for medical question answering , arXiv preprint arXiv:2309.16035 ( 2023 ).

[3]

Xiong ,

Jin ,

Lu , A. Zhang, Benchmarking retrieval-augmented generation for medicine , in: L. -W. Ku , A. Martins , V. Srikumar (Eds.), Findings of the Association for Computational Linguistics: ACL 2024 , Association for Computational Linguistics , Bangkok, Thailand, 2024 , pp. 6233 - 6251 . URL: https://aclanthology.org/ 2024 .findings-acl. 372 /. doi: 10 .18653/v1/ 2024 . findings- acl.372.

[4]

Yang ,

Liu ,

Marrese-Taylor ,

Zeng ,

Y. H.

Ke ,

Li , L. Cheng,

Chen ,

Caverlee ,

Matsuo , et al., Kg-rank: Enhancing large language models for medical qa with knowledge graphs and ranking techniques , arXiv preprint arXiv:2403.05881 ( 2024 ).

[5]

Edge ,

Trinh , N. Cheng, J. Bradley , A.

Chao , A.

Mody , S.

Truitt , D.

Metropolitansky , R. O.

Ness , J.

Larson , From local to global: A graph rag approach to query-focused summarization , arXiv preprint arXiv:2404.16130 ( 2024 ).

[6]

McKenna ,

Li , L. Cheng, M. J. Hosseini , M. Johnson , M. Steedman, Sources of hallucination by large language models on inference tasks , arXiv preprint arXiv:2305.14552 ( 2023 ).

[7]

Yang ,

Ning , E. Keppo, M. Liu,

Hong ,

D. S.

Bitterman ,

J. C. L.

Ong ,

D. S. W.

Ting , N. Liu, Retrieval-augmented generation for generative artificial intelligence in health care, npj Health Systems 2 ( 2025 ) 2 .

[8]

Zhang , O. Press,

Merrill ,

Liu ,

N. A.

Smith, How language model hallucinations can snowball , arXiv preprint arXiv:2305.13534 ( 2023 ).

[9]

Chen , H. Liu,

Liu ,

Xie ,

Yang ,

Yuan ,

Fu ,

P. Y.

Zhou ,

Chen ,

Caverlee , et al., Graphcheck: Breaking long-term text barriers with extracted knowledge graph-powered factchecking , arXiv preprint arXiv:2502.16514 ( 2025 ).

[10]

Ke ,

Yang ,

S. A.

Lie ,

T. X. Y.

Lim ,

Ning ,

Li ,

H. R.

Abdullah ,

D. S. W.

Ting , N. Liu, Mitigating cognitive biases in clinical decision-making through multi-agent conversations using large language models: simulation study , Journal of Medical Internet Research 26 ( 2024 ) e59439 .