Food and Ecological Systems Modelling Journal :
Formal Model Article Format
|
Corresponding author: Christopher John Topping (cjt@ecos.au.dk)
Academic editor: Lyubomir Penev
Received: 29 Jul 2022 | Accepted: 20 Sep 2022 | Published: 04 Oct 2022
© 2022 Christopher John Topping, Luna Kondrup Marcussen, Peet Thomsen, Jordan Chetcuti
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Topping CJ, Kondrup Marcussen L, Thomsen P, Chetcuti J (2022) The Formal Model article format: justifying modelling intent and a critical review of data foundations through publication. Food and Ecological Systems Modelling Journal 3: e91024. https://doi.org/10.3897/fmj.3.91024
|
Documenting complex models has long been a problem. Models are currently developed, implemented, and applied before review. Combined this leads to details hidden in the appendices or too little detail in the methods section to be reproducible. Modellers involve reviewers too late in the process. This does not allow them to flag issues, suggesting redesigns and reruns only after the analysis is complete. We propose splitting the model documentation, before analysis, into three steps: the Formal Model, Implementation Documentation, and Evaluation and Testing. Researchers can then use the well-built model for analysis. We introduce the first of these, the Formal Model as a peer-reviewed paper format that lays out the intentions for the model. The Formal Model includes reviewed literature that identifies the components of the model. Lays out the theoretical framework, modelling approaches and externalities. Plans to implement each process, with equations, descriptions, state variables and scales. Finally, the Formal Model gives the model’s strengths, weaknesses, exclusions, and place in the literature. We provide a flexible template for a Formal Model to aid in establishing a new common format.
The Formal Model aims to improve transparency and provide a formal approach to documentation. Reviewers can help improve the model by identifying problems early. The Formal model contains the details needed to allow for reproducibility. It also encourages modellers to think about the consequences of what is and is not included within the model. And finally, it gives the credit that modellers deserve for the involved process of creating a model.
The Food and Ecological Systems Modelling Journal (FESMJ) aims to provide opportunities for modellers to showcase their work (
We see the Formal Model as a mechanism to achieve three main goals. The first is a formal approach to describe a modeller’s intention for a model. Reaching this point is a considerable task, requiring extensive literature review, careful design, and ingenuity. This work often goes uncredited, with potential impacts on rigour and scientific quality (
As such, we propose a three-step publication process:
The Formal Model is the first step toward model development of systems models following a standard modelling cycle (
Thus, the Formal Model occupies the first quarter of the first iteration of the modelling cycle (Fig.
Since FESMJ is the only journal that offers this particular set of paper formats for models, the journal will be in a position to publish all three, subject to quality control. However, publiation of these would not be mandatory. The situation where a formal model is proposed by an author but not followed up could provide the chance for others to develop the model, and this might even be indicated in the formal model paper if known in advance.
The Formal Model should document the breadth and depth of information used to create the planned model. This documentation may include information not used in the model but discarded for a particular reason. The overall scope of the document is to reveal the intentions of the modeller and the information used to realise those intentions. This process should then be opened to peer review before a lot of time and resources are used to progress the work further.
Currently, in modelling studies, the normal process is to construct a model, calibrate and test it, then apply it. Only after the last step a peer-reviewed publication is made. This often leads to reviewers questioning modelling decisions and even suggesting re-designs and re-runs of models after a great deal of work has already been done. In addition, within science there has been a criticism of the tendency to pass off post-diction as prediction and therefore invalidation of hypothesis testing, with analysis being redesigned on the fly or creating a hypothesis to fit results (
Preregistration has been adapted for modelling (
The problem of documenting model building is not new, and others have suggested various approaches to overcome this. Initially, well-commented code was a good starting point, and this has been developed into detailed documentation e.g., using Doxygen (e.g., JSBSim (
Different scientific and business domains define formal models variously. The Formal Methods Model in software engineering is a precisely defined description of components and relationships in a complex piece of software, giving an overview for planning development. This spread to business in the form of formal process modelling (
We propose the use of a Formal Model paper. The Formal Model will state the intent of the model, giving the aims and purpose. A review of the literature to identify the key components that will be needed for the model, including any theoretical framework, modelling approaches and externalities. The Formal Model gives an overview of the processes that will be in the model and describes them in terms of the current state of knowledge. It demonstrates how the modeller plans to represent each process, giving equations, descriptions, state variables and scales. Finally, the format includes a discussion on the model’s strengths and weaknesses and places the model in a scientific narrative. This includes explaining how things not included within the current scope could affect the model or be incorporated in a future version.
The modeller should create a Formal Model as part of the modelling process. When the purpose of the model is defined, it forms the basis for the intent of the model. Thus, the Formal Model becomes a living document, to be added to and refined throughout the process. The modeller should complete the Formal Model before creating a finalized and documented versioned model. The completion occurs before final calibration and testing to allow the modeller to incorporate changes to the model from the review process. This three-tier approach (Formal Model, Implementation, and Evaluation and Testing) is best suited to any model that cannot be explained succinctly within the normal methods section of a paper. In fact models where this is the case or where the step from formal model to implementation are not suitable for this format and should probably combine formal model and implementation into a single article. Such models may include many decisions and assumptions that could, and often are, disputed during the final review of a model application. These decisions could also have an impact on the state of knowledge, policy or practice informed by the model. The Formal Model can be used for a diverse range of models, for example, social, agent-based, sub-population, behavioural, or food model processes. The underlying reasons for the Formal Model in all are the same: to avoid bias and to communicate the model structure, processes and background knowledge used to construct the model.
For example, agent-based models are simulations that are designed from the perspective of an entity, i.e., an agent (
Having stated what a Formal Model is, it is also important to state what it is not. The Formal Model is not suitable for very simple models. The model should have a volume and complexity that would make descriptions inside model application articles difficult, or might be a substantial sub-model that otherwise may not be described in detail. To allow for the feedback of peers and reviewers the Formal Model is an early formulation of the concepts before implementation, model evaluation, and testing. The formal model, therefore, does not include these next stages of model development. The Formal Model is the springboard to developing the code implementation before moving to further stages of the modelling cycle. Thus the Formal Model cannot include the experimental outputs and eventual documentation and evaluation as it could for a simple model. However, the implementation, evaluation and testing of models also have specific focus within the scope of FESMJ.
What format should the Formal Model take? By defining a template, much in the way the standard scientific papers are structured including introduction, methods, results, and discussion (IMRAD), we will allow those examining the Formal Model to become familiar with what to expect (a strategy in common with other model description formats). This will make reading a Formal Model easier and aid in finding information when dipping in and out of a Formal Model. This prescription is necessary to give a common format, but there needs also to be a degree of flexibility to enable the Formal Model to cope with a variety of model types. We, therefore, propose the following structure and give a brief description of the content of each section (Fig.
An introduction like any standard paper, with the exception, that this does not lay out problems and hypotheses per se. Instead, this introduction would lay out the reasons for creating the model in question, the theory that supports the model and the modelling approach. It would of course lay out the salient literature defining the model overview and the theoretical framework.
This is in effect the problem formulation, explaining the aim of creating the model, why the modeller chose this model and approach, and in what ways could the model be used.
Theoretical frameworks are taken from the relevant scientific domain and describe the perspective of the research as defined by the theories that the work is based in (
This section includes an overview of external influences on the model and model results, that are not explicitly included by the modeller in the model, but which will potentially affect the model outcome. This is in the form of a narrative explaining the things that we left out of the model knowingly. This process of framing the model is considered a way to avoid false inclusions and false-exclusion errors (
This section will be needed for each of the processes described in the overview. Here, each component is described in detail, including all relevant knowledge, state variables, and scale information necessary to understand the implementation of the process. Note that it is not necessary to follow each heading precisely, nor include them as headings, it is important that the information is present though:
Review and describe the current state of knowledge for each component
This section can be quite long and lists all the important references leading to the planned implementation. Typically, this will include tables and diagrams from literature used to develop the concepts applied in the model. In most cases this will include suggested starting values from parameters and process descriptions based on literature, although in some cases the model may be a set of equations without the need to specify parameter values.
Planned implementation of each component with a formal representation
The method of implementing the process in the model is described here in the Formal Model. This may be in form of a short text and equations or flow chart. For example, in the agent-based model ApisRAM (
“Every bee consumes resources and generates heat according to its metabolic rate q, in units of kcal s−1. Each class of bee has a metabolic rate determined by its activity. The temperature increase is defined as
\(T(t+∆t)=T(t)+Qs\)
where Q is the heat produced by burning q∆t of nectar, and s is the heat capacity of the bee.”
State variables, spatial and temporal scales (with units)
The state variables associated with the process. State variables are variables describing the structure or quality of an entity or process. For example, age, size, growth rate, or parameter values. State variables like these should be described and any units included. Similarly, time and spatial scales over which the processes operate should be described as appropriate for the process. In the above example the metabolic rate q has units of kcal per second, giving its temporal scale. In larger or more complex examples it is often useful to tabulate the parameter values. Since these parameters will typically be referenced in equations using symbols, its symbol, units, any predetermined value, and a short description of its meaning could all be usefully included in a table. For example, an entry to the table might look like Table
Example of a table describing constants and variables in a Formal Model, taken from
Symbol |
Units |
Value |
Meaning |
THC |
°C |
60 |
The upper survival temperature for Nosema ceranae ( |
… |
In this case in Table
This section of the Formal Model would introduce the main components to be explained in detail under the subheadings that follow it. This overview describes the interconnections between the components from a high-level perspective. This will typically include an overall diagram of processes and connections and should thus serve as a roadmap to the details presented in the following sections. For larger models this section may be quite long and include multiple diagrams as necessary to provide the overview of the model from a structural and process point of view.
This section is optional but may be useful especially for the case when the model implementation would be or is simple or when the model forms a sub-component of a larger model. This would not be a full implementation with calibration and sensitivity analysis but would serve to explore the properties of the model behaviour to help elucidate the functioning of the model. This could also include trials of ideas in a simplified form or expected properties and behaviours. This section might include examples of output under controlled conditions used to demonstrate properties of the model or some of its components.
This section will discuss any aspects of the model or model development that might be of interest to the reader, including lessons learned from the modelling process. Here, it might be relevant to discuss the coverage of information needed for the model development, highlighting gaps or inconsistencies. This section must cover the strengths and weaknesses of the model.
An imperfect solution is better than no solution. The proposed Formal Model format is not a silver bullet and has drawbacks as well as advantages. The advantages are that it clearly defines the modelling intention and provides a relatively compact, but still substantial, article for evaluating this. It gives credit to the modeller for the amount of work necessary to craft the design and it provides a review of the existing knowledge for model creation. It also provides a chance to catch problems in the design earlier in the process than waiting until peer review of the model application, at least if review feedback is rapid. This is a major advantage compared to the current ‘all-in-one’ approach used in the majority of journal articles where overview, design and implementation are combined in a single step. This is of particular importance to the more complex simulations which otherwise drown in detail, resulting in voluminous and rarely read model descriptions.
The disadvantages include the fact that it is another burden for the modeller in documenting their work, in addition to normal documentation and user guides. Although the document is smaller than would be needed to combine all the description, implementation, and testing, the Formal Model can still be substantial (e.g., the ApisRAM formal model (
There is also the problem that many of the larger models that this approach is targeted at will undergo several model cycles as new information becomes available or new applications are needed. As such a reviewed and static document does not help and can even confuse the issue. This is where versioning of the model comes in. The Formal Model as published should be associated with a defined model version, typically 1.0. This reduces the chance of confusion as the Formal Model author defines that the description relates to version 1.0 of the model. For later versions, two options are possible. For smaller changes the model author could publish updates to the formal model, without peer-review, as minor versions (version 1.1, 1.2…). Published in the Research Ideas and Outcomes (RIO) or similar journals, or on a model website mentioned in the Formal Model. For a major upgrade or expansion there may be enough material for a second major version of the Formal Model. This would be a version change outside the usual model cycle (e.g., version 2.0). If authors accompany each revision of the Formal Model with a version number, the development of the model over time can be represented. This approach can also form the basis for a dialogue between the modellers and interested stakeholders if the Formal Model versions are published in a journal allowing comments. Another advantage of the proposed format is the opportunity for people/groups without the modelling skills to develop formal models and open up the possibility for collaboration of people with different skillsets. Resulting in more efficient and complex model development.
Another drawback of the documentation process, also shared with all other forms, is that often journals, and journal referees prefer not to rely on secondary literature in the model application articles. This leads to a need to repeat information. A brief synopsis of the documentation will probably be included in related papers for readability, whilst deferring details in reference to the Formal Model. Ideally, wider use of the Formal Model and the other two model documentation formats we suggest will overcome this issue, as referees learn to use them. Preregistration and registered reports are gradually being accepted (
A key advantage of the format suggested is that it embraces the ‘modest approach’ to model construction (
In conclusion, complex models can result in publications with poor transparency. Modellers can leave information out or lose it in the supplementary material. Even when modellers have used a structured approach, they must decide what to include and how to implement a model. Reviewers often question these decisions after the models’ use and analysis. The Formal Model approach we propose aims to address these issues. The proposed format will improve transparency, provide the opportunity to review and give the modeller credit for crafting models, all while improving the approach of the modeller. We hope that through use, this format can improve and be improved by the modelling community, ideally beginning a journey towards better models through improved documentation.