May 29, 2025
[Paper-club sessions] SciAgents: Automating Scientific Discovery Through Multi-Agent Intelligent Graph Reasoning
SciAgents uses multi-agent AI and graph reasoning to autonomously generate hypotheses, revealing hidden patterns in scientific data.
![[Paper-club sessions] SciAgents: Automating Scientific Discovery Through Multi-Agent Intelligent Graph Reasoning](https://cdn.prod.website-files.com/6654b3697a9d140e0ca14e14/683860fc00e58c11354f7c27_ChatGPT%20Image%2029%20de%20mai.%20de%202025.png)
The rapid advancement of artificial intelligence has opened new frontiers in scientific discovery, with researchers seeking ways to automate the complex process of hypothesis generation and validation. In the paper "SciAgents: Automating Scientific Discovery Through Multi-Agent Intelligent Graph Reasoning" [1] by Alireza Ghafarollahi and Markus J. Buehler from the Massachusetts Institute of Technology, a novel framework is introduced that leverages multiple AI agents to autonomously explore scientific domains, identify complex patterns, and uncover previously unseen connections in vast scientific data. Presented by Caio Damasceno during CloudWalk's paper-club, a weekly meeting where team members present and discuss cutting-edge research across various domains, SciAgents represents a significant step toward AI systems that can independently advance scientific understanding by combining large language models with sophisticated knowledge representation techniques, particularly focused on the discovery of biologically inspired materials.
At the current state, most AI models are very good at compiling knowledge. They are trained over enormous datasets and can be used basically as a source to assess those contents all in the same place. And that’s already pretty good! But what if we could go further? How could AI actually bring new knowledge to the table, based on what we know by now? With SciAgents, Ghafarollahi and Buehler propose a framework to autonomously generate and refine research hypotheses.

FIGURE 1: The scientific method is a systematic way to investigate phenomena and acquire knowledge. It involves observing, formulating a testable hypothesis, conducting experiments, analyzing data, and drawing conclusions. This iterative process may lead to revising or rejecting the hypothesis, or formulating new ones, until a satisfactory conclusion is reached.
The Scientific Method consists of following steps towards discovering how nature works. Those steps can be summarized as: observation, question, hypothesis, experiment, analysis and conclusion, as displayed in Figure 1. The paper focuses on the hypothesis step, by raising new possibilities for further experimentation. Traditionally hypothesis proposals rely heavily on human researchers’ creativity, background knowledge, inspiration and ingenuity. With the explosion of scientific literature and data, it has become increasingly difficult for scientists to identify cross-disciplinary connections that might lead to breakthrough discoveries. Meanwhile, recent advances in large language models (LLMs) like GPT-4 have demonstrated remarkable capabilities across diverse domains, but these models often struggle with the specialized demands of scientific discovery, including challenges related to accuracy, accountability, and transparency when operating outside their initial training scope.

FIGURE 2: Overview of the multi-agent graph-reasoning system developed by the authors. The visual shows the progression from scientific papers as data source to knowledge graph construction, followed by two distinct approaches for AI-driven scientific discovery: one using pre-programmed sequences of interactions between specialized agents, and another featuring fully automated, flexible agent frameworks that adapt dynamically to evolving research contexts.
SciAgents addresses these limitations through an innovative framework that combines three core components: large-scale ontological knowledge graphs, a suite of large language models, and a multi-agent system with in-situ learning capabilities. As we can see in Figure 2, the system first transforms scientific papers into comprehensive knowledge graphs, organizing diverse concepts into interconnected nodes and edges that reveal relationships across disciplines.
FIGURE 3: The diagram shows the agents involved in the SciAgents framework. The 3 agents highlighted in the red box are only included in the second approach, which uses automated interactions.

The project itself was performed with two different approaches, one with pre-programmed interactions between agents and predefined sequences of tasks; and another with fully automated agent interaction and the possibility for human integration in the loop. Both of them used a knowledge graph from a previous work [2], with over 33,000 nodes and almost 49,000 edges. Research hypotheses are proposed based on sub-graphs generated by connecting two concepts (random or pre-selected) through a random path. The SciAgents framework utilizes a multi-agent AI system to enhance the hypothesis-generation process, with each agent specializing in a specific role, as shown in Figure 3. The Ontologist defines key concepts and relationships, structuring knowledge into an ontological framework that facilitates research discovery. Scientist 1 crafts a detailed research proposal based on these structured relationships, which Scientist 2 then refines and expands, ensuring the hypothesis is well-formed and viable for further exploration. The Critic plays a crucial role in reviewing the proposal, identifying weaknesses, and suggesting improvements. In a more advanced, second approach, additional agents come into play: the Planner develops a structured research plan, managing task distribution and workflow. The Assistant evaluates the novelty of the generated hypotheses using external tools, ensuring originality and relevance. Finally, the Group Chat Manager coordinates communication by selecting the next speaker and disseminating information to all agents. Together, this structured system of agents mimics human collaborative research, enhancing the efficiency and creativity of scientific discovery. What makes SciAgents particularly powerful is its methodical approach to breaking down the scientific discovery process into manageable subtasks executed through a hierarchical expansion strategy.

FIGURE 4: Overview of the entire process from initial keyword selection to the final document, following a hierarchical expansion strategy where answers are successively refined and improved, enriched with retrieved data, critiqued and amended through critical modeling, simulation and experimental tasks.
As illustrated in Figure 4, the system begins with initial keyword identification or random exploration within a knowledge graph, followed by path sampling to create a subgraph of relevant concepts and relationships. This subgraph forms the basis for generating structured output covering multiple aspects like hypothesis, outcome, mechanisms, and design principles. The process then follows a systematic workflow where each component is expanded through individual prompting, undergoes critical review, and receives specific recommendations for modeling and experimental priorities. This comprehensive methodology ensures that the generated research ideas are both innovative and grounded in scientific rigor.
The experimental results demonstrate SciAgents' ability to generate novel and feasible research hypotheses in the domain of biologically inspired materials, paving the way for applications in other fields as well. In one compelling example, the system connected seemingly unrelated concepts "silk" and "energy-intensive" to propose a composite material integrating silk with dandelion-based pigments. As shown in Figure 5, the knowledge graph connecting these concepts reveals a rich network of relationships that the system leverages to generate its hypothesis. We can see in Figure 5 how the random path approach (panel a) incorporates significantly more concepts than the shortest path (panel b), providing a richer substrate for hypothesis generation. Based on this expanded knowledge representation, the system predicted enhanced mechanical strength (up to 1.5 GPa compared to traditional silk's 0.5-1.0 GPa) and reduced energy consumption by approximately 30%.

FIGURE 5: The knowledge graphs connecting the keywords "silk" and "energy-intensive" extracted from the global graph using (a) random path and (b) the shortest path between the concepts. The figure demonstrates how enhanced sampling invokes additional concepts that get incorporated into research development, yielding more sophisticated research concepts.
The authors conducted multiple experiments using both pre-programmed and fully automated multi-agent approaches, consistently producing hypotheses with high novelty and feasibility scores as assessed against existing literature. The system's ability to identify unexpected properties, such as self-healing capabilities and stimuli-responsive structural colors, further underscores its potential for scientific innovation. When using the automated approach with random sampling of concepts, SciAgents generated diverse research ideas ranging from biomimetic microfluidic chips with enhanced heat transfer performance to novel graphene-amyloid composites for bioelectronic devices.
This work opens exciting possibilities for the future of scientific discovery, particularly in materials science. By harnessing what the authors call a "swarm of intelligence" similar to biological systems, SciAgents demonstrates that AI systems can effectively navigate the complexities of scientific research, generating hypotheses that might otherwise remain undiscovered. While the current implementation focuses on biologically inspired materials, the framework's modular design suggests potential applications across numerous scientific domains. As the authors note, future enhancements could include incorporating agents capable of conducting experimentation or soliciting data from simulation studies, thereby closing the loop on scientific discovery. SciAgents represents a significant advancement in automation of scientific discovery, offering researchers a powerful tool that not only assists in hypothesis generation but potentially accelerates the entire scientific process from ideation to validation.
Final remarks
Although extremely robust and relevant, the work presented by Ghafarollahi and Buehler represents only the beginning of what could be a profound interaction between humans and AIs in the implementation of the Scientific Method. Hypothesis generation is a fundamental step for the emergence of new theories and the expansion of human knowledge, but including the participation of these agents in the remaining stages of scientific research would have a massive impact on scientific production. From the stages of observation and questioning to experimentation and analysis, there is room to incorporate AI participation to enhance human capacity. It is imperative that we continually seek ways and mechanisms to use AI and new technologies for the advancement and development of humanity, and scientific knowledge is a fundamental aspect of achieving this.
As a reflection, although Knowledge Graphs have proven to be a useful tool for research in the field of biomaterials, there is room to evaluate other strategies that may be better suited to other areas of knowledge—particularly those where the rationale between different nodes, or concepts, is more numerical and quantitative than lexical.
References:
[1] Ghafarollahi, A., & Buehler, M. J. (2024). Sciagents: Automating scientific discovery through multi-agent intelligent graph reasoning. arXiv preprint arXiv:2409.05556. Available at: https://arxiv.org/pdf/2409.05556
[2] Buehler, M. J. Accelerating scientific discovery with generative knowledge extraction, graph-based representation, and multimodal intelligent graph reasoning. Machine Learning: Science and Technology (2024). URL http//iopscience.iop.org/article/10.1088/2632-2153/ad7228.
Explore our content
Get to know and learn more about Cloudwalk below.