Understanding Tables in Financial Documents Shared Tasks for Table Retrieval and Table QA on Japanese Annual Securities Reports

Understanding Tables in Financial Documents Shared Tasks for Table Retrieval and Table QA on Japanese Annual Securities Reports YasutomoKimura kimura@res.otaru-uc.ac.jp Otaru University of Commerce

Hokkaido Japan

EisakuSato Otaru University of Commerce

Hokkaido Japan

KazumaKadowaki kadowaki.kazuma@jri.co.jp The Japan Research Institute Limited

Tokyo Japan

HokutoOtotake ototake@fukuoka-u.ac.jp Fukuoka University

Fukuoka Japan

Testbeds and Community for Information Access Research The First Workshop on Evaluation Methodologies

December 12 2024 Tokyo Japan

Understanding Tables in Financial Documents Shared Tasks for Table Retrieval and Table QA on Japanese Annual Securities Reports 1613-0073 7D64228A3DA3121A8622AC31AB865FC8 GROBID - A machine learning software for extracting information from scholarly documents annual securities report shared task table retrieval table question-answering

This paper presents a framework for the "NTCIR-18 U4" and "SIG-FIN UFO-2024" shared tasks, which focus on tables within annual securities reports. Annual securities reports are critical documents that provide insights into a company's financial status and business performance. However, challenges remain in accurately and efficiently analyzing the data they contain. To address these issues, we propose two sub-tasks for the above shared tasks: Table Retrieval and Table QA tasks, which utilize datasets from TOPIX100 and TOPIX500 annual securities reports. Participants are tasked with developing systems (programs) that automatically process data for the two tasks and compete for top performance on a leaderboard. Accuracy scores and rankings are determined by submitting the task's output, in JSON format, to the leaderboard. Through these shared tasks, we aim to enhance the utility of annual securities reports and advance natural language processing technologies for financial data analysis.

Introduction

In recent years, financial disclosures have become essential for investors seeking to make informed decisions based on reliable corporate data. In Japan, listed companies are required to submit an annual securities report, a statutory disclosure document that provides comprehensive information on business operations, financial data, risk factors, corporate governance, and shareholder information. These reports, accessible via the Electronic Disclosure for Investors' NETwork (EDINET) 1 , serve as a critical information source for investors aiming to compare companies effectively.

These securities reports are structured in XBRL (eXtensible Business Reporting Language), an XMLbased format designed to standardize and facilitate the production, distribution, and reuse of financial information. By incorporating "taxonomies" that define the structure and meaning of data, XBRL enables automated processing, potentially streamlining financial analysis.

However, practical challenges arise due to the presence of untagged data and the existence of unique taxonomies created by different report submitters, complicating the identification of comparable elements across reports.

To this end, we propose two tasks that aim to facilitate cross-company comparisons by focusing on the tables and text within annual securities reports. The first task is the NTCIR-18 U4 task, adopted by Japan's National Institute of Informatics (NII) as part of NTCIR-18 2 . The second is the SIG-FIN UFO-2024 task, organized by the Financial Informatics Study Group (SIG-FIN) under the Japanese Society for Artificial Intelligence (JSAI). The former focuses on TOPIX100 annual securities reports submitted between April 1, 2020, and March 31, 2021, while the latter focuses on TOPIX500 annual securities reports submitted between July 1, 2023, and June 30, 2024. We organized these shared tasks in collaboration with the NII Testbeds and Community for Information Access Research (NTCIR) [1], which specializes in information retrieval, and the SIG-FIN group of the JSAI, which focuses on financial technology. Through these initiatives, we aim to attract researchers and practitioners interested in these fields and contribute to further advancing technologies at the intersection of finance and information retrieval.

In each shared task, we conducted two sub-tasks: QA), which involves answering questions by identifying the target cells within the tables, as illustrated in Figure 1. We designed each sub-task and constructed datasets for each task.

The contributions of this study are as follows:

• Design of two tasks, • Organization of the NTCIR-18 U4 task and the SIG-FIN UFO-2024 task.

Related Work

Research on Tables

A table is a data format with a two-dimensional structure used to organize and manage knowledge or information, and it is widely utilized in various contexts. However, not all tables have a highly structured database format; they are often represented as semi-structured data. Furthermore, the data contained in [3,4], table detection (searching for tables within documents) [5,6], spreadsheet manipulation [7], column type annotation [8], and entity linking (linking to knowledge bases) [9,10]. These tasks are critical in information retrieval and data analysis based on table data, and they are particularly anticipated in fields where handling large-scale data and automation are required. Recently, approaches utilizing large language models (LLMs) and visual language models (VLMs) have been increasing, and research on learning methods, prompt engineering, and agents is also gaining attention [11]. Our shared tasks (NTCIR-18 U4 and the SIG-FIN UFO-2024) are related to table search, table detection, and table question answering (Table QA).

Table Retrieval and Table QA

Table retrieval aims to identify appropriate tables from vast datasets [12]. In this task, methods that typically assign relevance scores on the basis of the relationship between natural language queries and individual tables are commonly used.

Table QA refers to the technology that provides appropriate answers from tables in response to user questions. Approaches to Table QA include semantic parsing-based, generation-based, extraction-based, matching-based, and retrieval-based methods [13]. The difficulty in Table QA lies in the need to handle semi-structured or unstructured data, as it also involves non-database tables.

Compared to existing Tabular QA datasets such as FinQA [14] and TAT-QA [15], which primarily focus on English-language datasets and are designed to handle numerical reasoning in financial contexts, our proposed shared tasks specifically target the Japanese language. Japanese tabular and textual data often exhibit unique linguistic and structural features distinct from those in English datasets. These features may include variations in numerical data formats, context-dependent expressions, and implicit relational cues.

Tables in the Financial Domain

Hybrid data, which includes both tables and text, such as in financial reports, is quite prevalent in the real world [16]. Zhu et al. constructed a question-answering benchmark dataset focused on the hybrid content of tabular and textual data in the financial domain [15].

Pan et al. proposed CLTR, an architecture for end-to-end table retrieval at the cell level [17]. While CLTR can be applied to open-domain datasets, including finance and healthcare ones, its performance specifically within the financial domain has not been clarified, nor does it target the Japanese language.

One of the tasks focused on Japanese financial table structure analysis is the UFO (Understanding of non-Financial Objects in Financial Reports) task [18]. The UFO task aims to extract structured information from tables and text found in annual securities reports and consists of two sub-tasks: the Table Data Extraction (TDE) task and the Text-to-Table Relationship Extraction (TTRE) task. The TDE task classifies cells in tables into four categories with the goal of identifying the type of each cell: metadata, header, attribute, and data [19]. The main focus of TDE was on cell classification, and additional processing to enable inter-company comparisons remained unexplored.

NTCIR-18 U4 and SIG-FIN UFO-2024 Tasks

Both the NTCIR-18 U4 and SIG-FIN UFO-2024 tasks consist of two sub-tasks: the Table Retrieval task, which involves searching for tables, and the Table Question Answering (Table QA) task, which involves answering questions by identifying the target cells within the tables [20]. Figure 1 illustrates the concept of these two sub-tasks.

Table Retrieval: Table Search Task

Table Retrieval is a task that involves searching for a "table" containing the values that answer a given question from the tables included in a company's annual securities report. On average, a company's annual securities report contains 221.9 tables [21], and it is necessary to identify the specific table that contains the answer to the question needs to be identified. The input, output, and evaluation criteria for this task are as follows.

Input

1. Question 2. HTML file of the annual securities report Output Table ( ID is used as the output. In the output example above, the Table ID "S100ISF1-0101010-tab2" refers to the second table in the "S100ISF1-0101010.html" file, with "-tab{table number}" appended to the file name.

The metric used for evaluation is accuracy, which is calculated by dividing the number of correct outputs by the total number of inputs in the test dataset.

We evaluated a few baseline methods using our validation datasets, which contain 3,131 and 1,533 questions for NTCIR-18 U4 and SIG-FIN UFO-2024, respectively. The results are shown in Table 1. For the NTCIR-18 U4 task, the highest accuracy of 0.2111 was achieved by using the text-embedding-3-small model to create embeddings based on Cell Text. Similarly, for the SIG-FIN UFO-2024 task, a top accuracy of 0.1937 was obtained using the text-embedding-3-large model for Cell Text embeddings. For input data, HTML files and Table IDs are provided, allowing the system to extract the range enclosed by the <table> tag from the HTML file. Additionally, if necessary, the system can utilize the surrounding context of the table.

Similar to Table IDs, each cell (<th> and <td> tags) within the table in the HTML file is assigned a unique Cell ID. When outputting the value corresponding to the answer to the given question, this Cell ID is used. In the output example above, the Cell ID is the Table ID of the table containing the cell, with "-r{row number}c{column number}" appended, so the Cell ID "S100ISF1-0101010-tab2-r8c1" refers to the cell in the 8th row and 1st column of the table "100ISF1-0101010-tab2".

For evaluation, similar to the Table Retrieval task, accuracy is calculated by dividing the number of correct outputs by the total number of inputs in the test dataset. However, discrepancies between the value contained in the HTML cell and the expected answer are frequently observed. For example, if the expected answer is "4448000000", the corresponding cell in the HTML might contain the string "4,448", while another cell, such as in the top right or column name, might indicate "(in millions of yen)". In this case, the system answering the task must reference both cells to generate the answer "4,448 million yen". While this is equivalent to the correct answer, to compare it accurately, the system must replace the string "million yen" with "000000" and remove the comma.

Due to this, in this task, both the response and the correct answer are normalized before calculating accuracy. The normalization specification was continually revised during the "dry run" period, considering feedback from participants.

We evaluated a few baseline methods using our validation datasets, which contain 3,132 and 1,534 questions for NTCIR-18 U4 and SIG-FIN UFO-2024, respectively. These baseline methods involved converting the target table into text format and inputting it, along with the question, into an LLM to generate the desired values. The results are shown in Table 2. For the NTCIR-18 U4 task, the highest accuracy of 0.7471 was achieved by using the Claude 3 Opus model. Similarly, for the SIG-FIN UFO-2024 task, a top accuracy of 0.5750 was obtained using the GPT-4o model.

Dataset

Securities Reports Used in Our Dataset

The NTCIR-18 U4 task focuses on analyzing securities reports from companies in the TOPIX100 index. The dataset consists of securities reports from companies that are part of the TOPIX100, submitted between April 1, 2020, and March 31, 2021.

The SIG-FIN UFO-2024 task, on the other hand, focuses on analyzing securities reports from companies in the TOPIX500 index. The annual securities reports used in this task are drawn from the TOPIX500, which represent publicly listed companies with high market capitalization and liquidity. For this task, we target the annual securities reports of 497 companies4 constituting the TOPIX 500 as of April 30, To account for differences in the structure of annual securities reports across industries, we ensure that the dataset is balanced by industry. The annual securities reports are distributed across the training, validation, and test sets with minimal industry bias. Specifically, we use the ten major categories from the Tokyo Stock Exchange's 33 industry classifications (service industry, transportation and communications, finance and insurance, construction, mining, commerce, fisheries, agriculture and forestry, manufacturing, electricity and gas, and real estate). The data is divided such that the ratio of train:validation:test is approximately 6:1:3 within each industry category. This results in 289 companies' reports being used for training, 52 for validation, and 153 for testing.

We retrieve the financial data using the EDINET API v2, utilizing the XBRL, HTML, and CSV files available through the API. The XBRL files contain tabular data, such as taxonomies and instances, referred to as "XBRL information" below, which is also embedded in the corresponding HTML files. The CSV files, referred to below as "annual securities report CSVs, " provide a more accessible format for the XBRL data for easier handling in the study.

Question Creation

Questions are created using annual securities report CSVs and question templates. In the annual securities report CSV, each row represents data, and each column shows the corresponding XBRL information (element ID, item name, context ID, relevant year, consolidated or individual, period or point in time, unit ID, unit, value). Among this XBRL information, the element ID and context ID are crucial for data extraction. The element ID indicates what the data represents, but it is not unique within a single annual securities report. Therefore, combining the element ID with the context ID, which represents the period and dimension, enables data within a report to be uniquely identified and the desired information to be extracted. Thus, the question must include both element ID and context ID.

On the basis of this, the initial version of the question is created as follows: Question (Initial Version)

What is the value of "{Element ID}" for {Company Name} in {Context ID}?

However, if the element ID and context ID are used as they appear in the annual securities report CSV, the question will not be meaningful in Japanese as they are simply IDs. Therefore, the context ID is represented using the relative year, consolidated or individual, and period or point in time, while the element ID is expressed as the item name. The final version of the question is defined as follows: stock price index composed of the TOPIX Core30, TOPIX Large70 and TOPIX Mid400, but of these, only 397 companies are included in the TOPIX Mid400.

Question (Detailed Version)

What is the value of "{Item Name}" in the {Year} {Period or Point in Time} {Consolidated or Individual (optional)} annual securities report of {Company Name} for {Member Element (optional)}?

The explanations for each part are as follows:

• Year: Calculated based on the basis of the relevant year, and the string is included in the question.

• Period or Point in Time: If it is a point in time, the word "point" is added right after the year.

• Consolidated or Individual: If it is consolidated or individual, the corresponding string is included; otherwise, "annual securities report of" is omitted. • Member Element: If the context ID contains a member element, the string is included. This element is not translated into Japanese to ensure uniqueness and is used as-is from the annual securities report CSV (ensuring uniqueness is a future challenge). • Item Name: This is essentially the Japanese translation of the element ID, so the string is included.

An example of a question created using the template is as follows: Example Created with Question Template What is the value of "Building (net amount)" in the 2020 individual annual securities report of Daiwa House Industry Co., Ltd. for NonConsolidatedMember?

When creating questions for the SIG-FIN UFO-2024 dataset, we also performed data sampling to avoid generating too many similar questions. For data sampling, a unique list of item names is created for each company, and random sampling is performed so that 1/10th of the entire dataset is selected. Additionally, the number of samples per item name is adjusted on the basis of the number of data entries for each item name 5 .

As a result of these procedures, we constructed the NTCIR-18 U4 dataset consisting of 32,587 entries for the

Schedule

To encourage broad participation from those interested in finance, the organizers have introduced two complementary tasks: the NTCIR-18 U4 and the SIG-FIN UFO-2024. The SIG-FIN community includes researchers and practitioners actively engaged in finance, while NTCIR attracts participants interested in shared tasks, especially those with a focus on information retrieval and natural language processing.

Conclusion

This paper proposed a framework for two shared tasks, NTCIR-18 U4 and SIG-FIN UFO-2024, which focus on tables within annual securities reports. In these shared tasks, two sub-tasks are conducted:

Figure 1 :1Figure 1: Overview of theTable Retrieval Task and Table QA Task

UFO-2024 GPT-4o (gpt-4o-2024-05-13) 0.5750 GPT-3o-mini (gpt-4o-mini-2024-07-18) 0.3957 2024. The dataset includes 494 financial statements submitted to EDINET between July 1, 2023, and June 30, 2024.

Table Retrieval Task and Table QA Task

Table Retrieval and Table QA, targeting securities reports. • Construction of datasets for Table Retrieval and Table QA, and their release on GitHub 3

table cells is not limited to numerical values; it frequently includes strings and other non-numerical data. Numerous methods have been proposed to accommodate such diverse table data.

In addition, table-related tasks include table fact verificationZhang and Balog [2] surveyed on tables on the web and classified approaches to accessing table datainto six main categories.1. Table extraction2. Table interpretation3. Table search4. Table question answering5. Knowledge base augmentation6. Table augmentation

Table IDIDFor the input, HTML files downloaded from EDINET are used. Each table element (<table>) in the HTML files is assigned a unique TableID, and when outputting the table that answers the question, this Table)Evaluation AccuracyAn example of input and output is shown below.Input 1. For Bandai Namco Holdings Inc.,what were the "net assets and key management indicators" as of 2020?2. S100ISF1-0000000.html, S100ISF1-0101010.html, ...Output S100ISF1-0101010-tab2

Table 11Validation Results for the Table Retrieval TaskTaskBaseline methodsAccuracyNTCIR-18 UFOtext-embedding-3-small + Cell Text0.2111text-embedding-3-large + Cell Text0.1833text-embedding-3-small + HTML Text0.1843text-embedding-3-large + HTML Text0.1418text-embedding-3-small + Markdown Text0.1233text-embedding-3-large + Markdown Text0.1383SIG-FIN UFO-2024 text-embedding-3-small + Cell Text0.1657text-embedding-3-large + Cell Text0.19373.2.

Table QA: Table Question Answering Task TheTable QA task, given a target table, identifies the "value" that answers the question. To accurately determine the answer, complex tables included in annual securities reports need to be handled[22].Input1. Question2. Target table (Table ID)3. HTML file of the annual securities reportOutputValue, Cell IDEvaluation Accuracy (value), Accuracy (cell ID)An example of input and output is shown below.Input 1. For Bandai Namco Holdings Inc.,what were the "net assets and key management indicators" as of 2020?2. S100ISF1-0101010-tab23. S100ISF1-0000000.html, S100ISF1-0101010.html, ...Output S100ISF1-0101010-tab2-r8c1The input, output, and evaluation criteria for this task are as follows:

Table 22Validation results for the TableQA task

Table Retrieval task and 32,589 entries for the Table QA task, and the SIG-FIN UFO-2024 dataset consisting of 14,410 entries for the Table Retrieval task and 14,412 entries for the Table QA task, as shown in Table 3 6 .

Table 33Breakdown of Each Dataset for Dry Run

TaskTrain Validation Test TotalNTCIR-18 U4Table Retrieval 22,9823,1316,474 32,587Table QA22,9823,1326,475 32,589SIG-FIN UFO-2024 Table Retrieval 8,3901,5334,487 14,410Table QA8,3901,5344,488 14,412

Table Retrieval and Table Question Answering (Table QA), which target the annual securities reports of companies belonging to the TOPIX 100 or TOPIX 500 indexes.https://disclosure2.edinet-fsa.go.jp/https://research.nii.ac.jp/ntcir/ntcir-18/index-en.htmlhttps://github.com/nlp-for-japanese-securities-reports/ntcir18-u4, https://github.com/nlp-for-japanese-securities-reports/ufo-2024Despite the name TOPIX500, as of 30 April 2024, it only includes 497 companies. To be more precise, the TOPIX500 is aIf there is only one data entry for an item name, one sample is taken; if there are two to five, two samples are taken; if there are six or more, three samples are randomly selected. Data with submitter-specific taxonomies that do not include an item name in the annual securities report CSV, or data not from tables (i.e., data not in HTML's <td> tags) are excluded.These are the breakdowns of the initial datasets used for the Dry Run. The datasets used in the Formal Run phase have been modified to fix a couple of issues, and as a result, they contain a slightly different number of entries.https://www.kaggle.com/

Acknowledgments

This research was supported by JSPS KAKENHI Grant Number 21H03769. We would also like to express our gratitude to everyone at the National Institute of Informatics, Japan, the NTCIR Co-chairs, the members of the SIG-FIN Research Group, and our corporate sponsor, Preferred Networks, Inc., for their valuable cooperation in planning these shared tasks.

By running similar tasks across these two distinct communities, we aim to foster interaction among participants with diverse expertise and perspectives on finance, creating an opportunity for knowledge exchange and collaboration. We look forward to welcoming a diverse group of participants to build a comprehensive and impactful competition.

The schedule for each task is outlined in Table 4, showing the parallel timelines and key phases for both NTCIR-18 U4 and SIG-FIN UFO-2024. As illustrated in the table, both tasks share similar phases such as a dry run, formal run, and evaluation period, which will allow participants to apply and refine their approaches across both tasks seamlessly. This alignment ensures that participants will have the opportunity to benefit from complementary insights across the tasks and fosters collaborative learning within the community.

NTCIR-18 U4 Task

The schedule for the NTCIR-18 U4 task is as follows: The dataset for the NTCIR-18 U4 shared task was released in July 2024, followed by an online briefing session on July 20, 2024, where participants received essential information about the task. The dry run phase ran from July 2024 to October 31, 2024, during which participants worked on the dataset and refined their methods. Any issues identified in the dataset during this phase were addressed and resolved to ensure a smooth formal run. The formal run phase is scheduled from November 1, 2024, to December 28, 2024. Throughout the NTCIR-18 U4 task, a leaderboard will be used to provide participants with real-time feedback on their performance. Similar to the SIG-FIN UFO-2024 task, the NTCIR-18 U4 leaderboard will display a Public score on the basis of a subset of the test data during the task period, allowing participants to gauge their progress. Evaluation results and final rankings are scheduled to be returned to participants on February 1, 2025, along with a partial publication of the task overview paper summarizing key outcomes.

SIG-FIN UFO-2024 Task

The schedule for the SIG-FIN UFO-2024 task is as follows: The dataset for this shared task was released on August 15, 2024, with the dry run phase extending until October 31, 2024. During this phase, participants worked on developing their methods using the dataset, and any data issues identified during this period were addressed and corrected. The formal run phase is scheduled from November 1, 2024, to December 28, 2024. The shared task ranking will be determined on the basis of the evaluation method used in Kaggle 7 , incorporating both Public and Private scores. Throughout the shared task, the Public score (calculated from a subset of the test data) will be displayed on the leaderboard. After the shared task concludes, the Private score (evaluated on the remaining portion of the test data) will be calculated. The final results, based on the Private score, are scheduled to be announced at the 34th SIG-FIN in March 2025.

Evaluating Information Retrieval and Access Tasks: NTCIR's Legacy of Research Impact, The Information Retrieval Series TSakai DWOard NKando 10.1007/978-981-15-5554-1 2021 Springer Nature Web table extraction, retrieval, and augmentation: A survey SZhang KBalog 10.1145/3372117 ACM Transactions on Intelligent Systems and Technology (TIST) 11 2 2020 TabFact: A largescale dataset for table-based fact verification WChen HWang JChen YZhang HWang SLi XZhou WYWang 8th International Conference on Learning Representations, ICLR 2020 2020 Table-based fact verification with salience-aware learning FWang KSun JPujara PSzekely MChen 10.18653/v1/2021.findings-emnlp.338 Findings of the Association for Computational Linguistics: EMNLP 2021 2021 TableVLM: Multi-modal pre-training for table structure recognition LChen CHuang XZheng JLin XHuang 10.18653/v1/2023.acl-long.137 Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics Long Papers the 61st Annual Meeting of the Association for Computational Linguistics 2023 1 CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents DPrasad AGadpal KKapadni MVisave KSultanpure 10.1109/CVPRW50498.2020.00294 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020. 2020 ZMa BZhang JZhang JYu XZhang XZhang SLuo XWang JTang 10.48550/arXiv.2406.14991 SpreadsheetBench: Towards challenging real world spreadsheet manipulation 2024 Table-GPT: Table fine-tuned GPT for diverse table tasks PLi YHe DYashar WCui SGe HZhang DRifinski Fainman DZhang SChaudhuri 10.1145/3654979 Proceedings of the ACM on Management of Data the ACM on Management of Data 2024 2 TURL: table understanding through representation learning XDeng HSun ALees YWu CYu 10.14778/3430915.3430921 Proceedings of the VLDB Endowment the VLDB Endowment 2020 14 TableLlama: Towards open large generalist models for tables TZhang XYue YLi HSun 10.18653/v1/2024.naacl-long.335 Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Long Papers the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2024 1 Large language model for table processing: A survey WLu JZhang JFan ZFu YChen XDu 10.48550/arXiv.2402.05121 2024 cTBLS: Augmenting large language models with conversational tables ASSundar LHeck 10.18653/v1/2023.nlp4convai-1.6 Proceedings of the 5th Workshop on NLP for Conversational AI the 5th Workshop on NLP for Conversational AI

NLP4ConvAI

2023. 2023 A survey on table question answering: Recent advances NJin JSiebert DLi QChen 10.1007/978-981-19-7596-7_14 Knowledge Graph and Semantic Computing: Knowledge Graph Empowers the Digital Economy

Nature Singapore

Springer 2022 FinQA: A dataset of numerical reasoning over financial data ZChen WChen CSmiley SShah IBorova DLangdon RMoussa MBeane T.-HHuang BRoutledge WYWang 10.18653/v1/2021.emnlp-main.300 Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing the 2021 Conference on Empirical Methods in Natural Language Processing 2021 TAT-QA: A question answering benchmark on a hybrid of tabular and textual content in finance FZhu WLei YHuang CWang SZhang JLv FFeng T.-SChua 10.18653/v1/2021.acl-long.254 Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing Long Papers the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021 1 NRomanusMyrberg SDanielsson Question-Answering in the Financial Domain 2023 Department of Computer Science, Lund University Master's thesis CLTR: An end-to-end, transformer-based system for cell-level table retrieval and table question answering FPan MCanim MGlass AGliozzo PFox 10.18653/v1/2021.acl-demo.24 Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics the 59th Annual Meeting of the Association for Computational Linguistics 2021 UFO: Proposal for an information extraction task for tables in annual securities reports YKimura TKondo KKadowaki MKato 10.11517/jsaisigtwo.2022.FIN-029_32 JSAI Technical Report, Type 2 SIG 2022 029 FIN- Toward the construction of a dataset for table structure analysis for annual securities reports KKadowaki YKimura MKato TKondo HOtotake 10.11517/jsaisigtwo.2023.FIN-030_100 JSAI Technical Report, Type 2 SIG 2023 030 FIN- Creating a question-answering dataset for securities reports and evaluation of the method using LLM (in Japanese) ESato YKimura IEICE Technical Report 124 173 2024 The Institute of Electronics, Information and Communication Engineers Analysis of tabular data contained in the TOPIX100 annual securities report ESato YKaji YKimura E-021 The 21st Forum on Information Technology FIT2022. 2022 in Japanese Analysis of machine-unreadable table structures in securities reports KOkuyama YKimura The 30th Annual Meeting of the Association for Natural Language Processing NLP2024. 2024 in Japanese