{"id":1968221863,"date":"2025-11-10T14:00:25","date_gmt":"2025-11-10T05:00:25","guid":{"rendered":"https:\/\/www.sakura.ad.jp\/corporate\/?post_type=en_newsreleases&#038;p=1968221863"},"modified":"2025-11-11T17:29:38","modified_gmt":"2025-11-11T08:29:38","slug":"paper-from-sakura-internet-research-center-accepted-for-ai4mat-workshop-at-world-leading-international-ai-conference-neurips-2025","status":"publish","type":"en_newsreleases","link":"https:\/\/www.sakura.ad.jp\/corporate\/en\/information\/2025\/11\/10\/1968221863\/","title":{"rendered":"Paper from SAKURA internet Research Center Accepted for AI4Mat Workshop at World-Leading International AI Conference NeurIPS 2025"},"content":{"rendered":"<p>\u3000A paper by researchers from SAKURA internet Research Center, the in-house research organization of SAKURA internet Inc. (hereinafter, &#8220;SAKURA internet,&#8221; Headquarters: Osaka City, Osaka; Founder, President &amp; CEO: Kunihiro Tanaka), has been accepted for the AI for Accelerated Materials Discovery (AI4Mat) Workshop, held in conjunction with Neural Information Processing Systems (NeurIPS) 2025, a world-renowned international conference in the field of AI and machine learning.<br \/>\nThe paper will be presented in San Diego, California, USA, on Saturday, December 6, 2025, local time.<\/p>\n<h3>Research Overview<\/h3>\n<p>\u3000SAKURA internet Research Center has released a new dataset called MatPROV, which extracts material synthesis procedures from scientific literature and enables the capture of causal relationships among materials, operations, and conditions using directed graphs. The novelty and utility of this dataset were recognized, which led to the acceptance of the paper.<\/p>\n<h3>Research Background and Objectives<\/h3>\n<p>\u3000SAKURA internet Research Center advances research and development in the field of laboratory informatics, making the management, analysis, and sharing of research data more efficient. Specifically, we are developing an electronic laboratory notebook platform to accelerate the process of scientific discovery. It is extremely important for electronic laboratory notebooks to record experiment procedures accurately and systematically.<\/p>\n<p>\u3000The paper accepted for the conference focuses on addressing these challenges in the following two ways:<br \/>\n1. Researching the optimal capture format for recording experiment procedures as structured data<br \/>\n2. Utilizing large language models (LLMs) to automatically convert experiment procedures described in natural language into structured data<\/p>\n<p>\u3000We selected materials science as our research subject because it is widely known that the procedures of material synthesis greatly affect the resulting material properties. This makes materials science a suitable subject for verifying the effectiveness of digitizing experiment procedures. However, existing studies have relied on limited schemas with predefined structures to capture synthesis procedures and have struggled to capture the complexity observed in real-world material synthesis procedures, such as branching and merging operations.<\/p>\n<h3>Research Features and Outcomes<\/h3>\n<p>\u3000To address these limitations, SAKURA internet Research Center adopted the Provenance Data Model (PROV-DM), an international standard for provenance information, which can flexibly model procedures and causal relationships. Furthermore, we used LLMs to extract synthesis procedures from scientific literature and constructed our own PROV-DM-compliant dataset, MatPROV.<\/p>\n<p>\u3000MatPROV captures the relationships among materials, operations, and conditions through visually intuitive directed graphs, serving as a foundation broadly applicable to future research, such as AI-driven automated synthesis planning and process optimization.<\/p>\n<p>\u3000The results of this research are expected to open new possibilities for AI-driven material exploration and to contribute to the acceleration of sustainable material development. Additionally, the knowledge gained from this research can serve as foundational technology for enhancing the recording and structuring of experiment procedures in electronic laboratory notebooks, leading to new developments in data management and utilization within research environments.<\/p>\n<p>\u3000SAKURA internet Research Center will continue its research and development endeavors to realize new and valuable internet infrastructures for society.<\/p>\n<h2>About the Accepted Paper<\/h2>\n<h3>Title<\/h3>\n<p>MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature <br \/>\nHirofumi Tsuruta (SAKURA internet Inc.), Masaya Kumagai (SAKURA internet Inc., Kyoto University)<\/p>\n<h3>Paper<\/h3>\n<p><a href=\"https:\/\/arxiv.org\/abs\/2509.01042\" target=\"_blank\">https:\/\/arxiv.org\/abs\/2509.01042<\/a><\/p>\n<h3>Open Dataset<\/h3>\n<p><a href=\"https:\/\/huggingface.co\/datasets\/MatPROV-project\/MatPROV\" target=\"_blank\">https:\/\/huggingface.co\/datasets\/MatPROV-project\/MatPROV<\/a><\/p>\n<h3>Abstract<\/h3>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"image-outline aligncenter wp-image-1968221864 size-large\" src=\"https:\/\/www.sakura.ad.jp\/corporate\/wp-content\/uploads\/2025\/11\/matprov_overview-1024x394.png\" alt=\"\" width=\"1024\" height=\"394\" srcset=\"https:\/\/www.sakura.ad.jp\/corporate\/wp-content\/uploads\/2025\/11\/matprov_overview-1024x394.png 1024w, https:\/\/www.sakura.ad.jp\/corporate\/wp-content\/uploads\/2025\/11\/matprov_overview-300x116.png 300w, https:\/\/www.sakura.ad.jp\/corporate\/wp-content\/uploads\/2025\/11\/matprov_overview-768x296.png 768w, https:\/\/www.sakura.ad.jp\/corporate\/wp-content\/uploads\/2025\/11\/matprov_overview-1536x592.png 1536w, https:\/\/www.sakura.ad.jp\/corporate\/wp-content\/uploads\/2025\/11\/matprov_overview-2048x789.png 2048w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p style=\"text-align: center;\"><u>Study Overview Diagram<\/u><\/p>\n<p>\u3000In materials research, the synthesis procedure is a crucial factor that directly influences the properties of the material. With data-driven approaches increasingly accelerating materials discovery in recent years, there has been growing interest in extracting synthesis procedures from scientific literature as structured data.<br \/>\n\u3000However, many existing studies rely on schemas tailored to specific materials with predefined fields or assume that synthesis procedures are linear sequences of operations, inadequately capturing the structural complexity inherent in real-world synthesis procedures.<br \/>\n\u3000To address these limitations, SAKURA internet Research Center adopted PROV-DM, an international standard for provenance information, as a way of modeling synthesis procedures, enabling the flexible representation of procedures as directed graphs.<br \/>\n\u3000In this study, we present MatPROV, a dataset of PROV-DM-compliant synthesis procedures extracted from scientific literature using LLMs. MatPROV captures the structural complexity and causal relationships among materials, operations, and conditions through visually intuitive directed graphs. This representation enables the acquisition of synthesis-related knowledge in a machine-interpretable form, paving the way for future developments such as automated synthesis planning and synthesis process optimization.<\/p>\n<h2>About the Presentation at the NeurIPS 2025 AI4Mat Workshop<\/h2>\n<h3>About NeurIPS<\/h3>\n<p>NeurIPS is an international conference in the field of artificial intelligence (AI) and machine learning, established in 1987. It is regarded as one of the world&#8217;s most prestigious conferences due to the large number of submissions and its rigorous peer review process, which results in low acceptance rates. Speakers at the conference present innovative research across a broad range of AI and machine learning, including deep learning, reinforcement learning, computer vision, natural language processing, and applied research in various fields.<br \/>\nThe 39th NeurIPS conference will be held in the United States in December 2025.<\/p>\n<h3>About the AI4Mat Workshop<\/h3>\n<p>The AI4Mat Workshop first took place in 2022 as an opportunity for researchers in materials science and AI to discuss and share the challenges and achievements at the forefront of AI-driven materials discovery.<\/p>\n<h3>Date and Venue<\/h3>\n<p>Date: Saturday, December 6, 2025<br \/>\nVenue: San Diego Convention Center, San Diego, California, USA<\/p>\n<h3>Presenter<\/h3>\n<p>Hirofumi Tsuruta &#8211; SAKURA internet Research Center<\/p>\n<h3>Details<\/h3>\n<p>Please see the website below.<br \/>\n<a href=\"https:\/\/neurips.cc\/Conferences\/2025\" target=\"_blank\">https:\/\/neurips.cc\/Conferences\/2025<\/a><\/p>\n","protected":false},"featured_media":1968221916,"template":"","class_list":["post-1968221863","en_newsreleases","type-en_newsreleases","status-publish","has-post-thumbnail","hentry"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.sakura.ad.jp\/corporate\/wp-json\/wp\/v2\/en_newsreleases\/1968221863","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.sakura.ad.jp\/corporate\/wp-json\/wp\/v2\/en_newsreleases"}],"about":[{"href":"https:\/\/www.sakura.ad.jp\/corporate\/wp-json\/wp\/v2\/types\/en_newsreleases"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.sakura.ad.jp\/corporate\/wp-json\/wp\/v2\/media\/1968221916"}],"wp:attachment":[{"href":"https:\/\/www.sakura.ad.jp\/corporate\/wp-json\/wp\/v2\/media?parent=1968221863"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}