Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

An observational study using eye tracking to assess resident and senior anesthetists’ situation awareness and visual perception in postpartum hemorrhage high fidelity simulation

  • Arnaud Desvergez ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    mederic.descoins@chu-reunion.fr (MD); desvergez.a@gmail.com (AD)

    Affiliations Centre d’Études Périnatales de l’Océan Indien (EA 7388), University Hospital of Reunion Island, Saint Pierre, La Réunion, France, Centre de Simulation en Santé de l’Océan Indien, University Hospital of Reunion Island, Saint Pierre, La Réunion, France, Critical Care Unit, University Hospital of Reunion Island Saint Pierre, La Réunion, France, University of Reunion Island, La Réunion, France

  • Arnaud Winer,

    Roles Conceptualization, Methodology, Resources, Supervision, Validation, Writing – original draft

    Affiliations Centre d’Études Périnatales de l’Océan Indien (EA 7388), University Hospital of Reunion Island, Saint Pierre, La Réunion, France, Centre de Simulation en Santé de l’Océan Indien, University Hospital of Reunion Island, Saint Pierre, La Réunion, France, Critical Care Unit, University Hospital of Reunion Island Saint Pierre, La Réunion, France, University of Reunion Island, La Réunion, France

  • Jean-Bernard Gouyon,

    Roles Funding acquisition, Methodology, Resources, Supervision, Validation

    Affiliations Centre d’Études Périnatales de l’Océan Indien (EA 7388), University Hospital of Reunion Island, Saint Pierre, La Réunion, France, University of Reunion Island, La Réunion, France

  • Médéric Descoins

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft

    mederic.descoins@chu-reunion.fr (MD); desvergez.a@gmail.com (AD)

    Affiliations Centre d’Études Périnatales de l’Océan Indien (EA 7388), University Hospital of Reunion Island, Saint Pierre, La Réunion, France, Centre de Simulation en Santé de l’Océan Indien, University Hospital of Reunion Island, Saint Pierre, La Réunion, France

Abstract

Background

The postpartum hemorrhage (PPH) is the leading cause of maternal mortality in the world. Human factors and especially situation awareness has primarily responsibility to explain suboptimal cares. Based on eye tracking and behavior analysis in high fidelity simulation of PPH management, the goal of this study is to identify perceptual and cognitive key parameters of the expertise.

Methods

Two groups of fifteen anesthetists (residents and experienced anesthetists) watched the beginning of a severe simulated PPH management. During this first experimental phase, situation awareness was assessed using SAGAT (Situation Awareness Global Assessment Technique) questionnaire and visual behavior was analyzed with eye tracking. In the continuity of the video sequence, they have to step in the PPH situation and to provide care to the simulated patient. Performance of cares was evaluated and self-assessed as well as cognitive load.

Results

No statistical difference between the residents and experienced anesthetists was observed on performance of simulated PPH management. The mean expected practice score was 76.9 ± 13.9%). Assessment of situation awareness (65 ± 7%), cognitive load (74.4 ± 11.3%) and theoretical knowledge of PPH (52.4 ± 3.5%) were also not statistically different between the two groups. Only results of self-assessed performance (respectively 66.1 ± 16.6 and 47.0 ± 20.8 for experts and residents) and eye-tracking data revealed that experts tended to get accurate evaluation of their performance and to monitor more the blood loss of the patient. Experts have in average 8.28% more fixating points than Novices and gazed the blood loss region longer (865 ms ± 439 vs. 717 ms ± 362).

Conclusions

This study pointed out the limits of classical assessment of performance, and human factors based on questionnaires to identify expertise in simulated PPH care. A neuroscientific approach with new technology like eye tracking could provide new objective and more sensitive insights on human factors in simulated medical emergency situations.

Introduction

The postpartum hemorrhage (PPH) is the leading cause of maternal mortality in the world [13]. In France, 20% of maternal deaths are attributable to PPH with a large regional disparity [4]. The French overseas regions (as the French Antilles, French Guiana or the Reunion Island) are particularly impacted with prevalence up to 10 times higher. In 2017, a survey estimated that 100% of maternal care was non-optimal with preventable deaths in 100% of cases [5]. Improving the management of this pathology remains a worldwide public health issue.

Retrospective real life case studies indicate a lack of recognition of the initial severity of patients and ineffective treatment choices, particularly concerning the anesthetic management of PPH (recognition and management of hemorrhagic shock, use of transfusion, anesthetic strategy) [68]. The analysis of the human factor in anesthesia resuscitation brings a new grid of reading within the framework of the complex and inter-professional management of this serious pathology.

The export of non-technical skills concepts from aeronautics to anesthesia has made possible to identify significant cognitive impairments during incidents in our practices [9]. Situational awareness (SA) is one of them [10,11]. It can be schematized by a three-level linear cognitive process starting from the integrated relevant information gathering in a dynamic environment (Level 1: Perception), an understanding of the situation (Level 2: Comprehension) and an anticipation of the clinical evolution of this situation which allows the elaboration of a management’s strategy (Level 3: Projection) and leads to the performance.

The dedicated analysis of this cognitive model in anesthesia accidents revealed its involvement in more than 80% of cases with a preponderance of the perception level [12,13]. A lack of information, particularly visual, leads to increase morbi-mortality of our practices. Our eye movements in dynamic situations are closer to cognition. Since the Yarbus’s study in 1956 [14], we know that it is a process controlled by the tasks in progress. The repetition of a correct oculomotor sequence is rewarded by dopaminergic circuits connected to visual attentional centers and is therefore selected by experience [15].

Expertise is a determinant of visual prospecting and performance management. We are convinced of the primary role of prospecting behavior and visual information gathering in the model of situation awareness and its involvement in the generation of incidents of anesthetic management [12,16]. We hypothesized that expert anesthetists develop specific visual behavior related to a higher level of situation awareness and performance than novices while maintaining superior cognitive reserve represented by a lower level of cognitive load.

The objective of this study is to characterize and compare the visual behavior of expert and novice anesthetists during a simulated severe PPH scenario and to assess levels of situation awareness, particularly perceptual, cognitive load and associated performance of simulated PPH management.

Materials and methods

Study design

This prospective observational study utilized a combination of quantitative and qualitative experimental research methodologies extracted from data recorded during experimental eye tracking and PPH sessions of high-fidelity simulation in healthcare. It was carried out from March to May 2016. This study was performed at the Healthcare Simulation Center of the Indian Ocean located in the University Hospital of La Réunion, France.

Participants

30 anesthetists were divided into two equal groups. Novices group included residents with experience of less than 5 years and Experts group included anesthetists with more than 5 years of professional activity. All participants had normal or corrected to normal vision. After a general information session, the participants were required to review and sign a consent form that described the purpose and nature of the study as approved by The Ethic Committee of the CHU de La Reunion. Participants were free to participate and could withdraw at any time. No incentive was given.

Conduct of the experiment

The experimental procedure is presented in Fig 1. The pretest questionnaires assessed the level of knowledge on PPH, the characterization of the anesthesia experience and previous exposure to high-fidelity simulation.

thumbnail
Fig 1. Description of the experimental protocol.

First a pretest questionnaire is performed, it characterizes the physicians’ professional experience and the medical knowledge on PPH. After instruction slides, a video clip is presented to the anesthetist for oculometry. The story began at 6:30 p.m., he received a phone call from the nurse for a PPH in progress. The PPH situation presented on the screen (upper banner) showed Noelle®, the nurse and the obstetrician during the first 15 minutes of management without the anesthetist. The standardized environment is equipped as a conventional delivery room. The severity of PPH and the actors’ contemplative attitude associated with the inefficiency of the first treatments contributed to the clinical degradation of the patient. A SAGAT is performed at the end of the video. The questions concerned the hemodynamics profile, therapies already undertaken, and the complications envisaged for the patient. Then he had 40 seconds to consult the anesthesia file and 20 seconds to consult the blood test. Another SAGAT is then performed. The questions concern its HPP risk factors and its anesthetic management conditions (allergic risk, risk of difficult intubation, transfusion management). The anesthetist was then asked to manage the patient in the same simulated environment (lower band). Management must be intensive, but the situation was getting worse. The decision of hemostasis surgery marked the end of the scenario. A post-test self-assessment questionnaire and a NASA TLX Index conclude the session.

https://doi.org/10.1371/journal.pone.0221515.g001

The eye-tracking procedure then proceeded for 8 minutes in a black room. They were to see the start of simulated PPH management that they will have to manage later. The rationale was to allow the participants to be fully aware of the PPH case they would have to manage during the high-fidelity simulation. The video featured a high-fidelity mannequin, a nurse and an obstetrician from delivery to the deterioration of the patient's clinical condition. Situation awareness was assessed by SAGAT questionnaire during this stage. The anesthetist started the high-fidelity simulation where the video sequence was interrupted matching with the call to the anesthetist. Cognitive coherence is guaranteed by the permanence of actors, time, places and material. The surgical decision ended the scenario. Final questionnaires provided anesthetist self-assessment and measurement of cognitive load by the NASA TLX index. An interview concluded experimentation.

Eye tracking set up and analysis

We used a fixed binocular eye-tracking device (The EyeTribe APS, Copenhagen, http://www.theeyetribe.com/) to measure the visuo-motor behavior of the participants and extract fixation points during viewing of a simulated PPH video. Participants' heads were placed 60 cm from a 22-inch LCD monitor with a resolution of 1920 by 1080 pixels (Philips 221B Brillance). An 8-minute video sequence was filmed in a fixed subjective view. It showed the 15 minutes before the anesthetists’ intervention thanks to the acceleration of non-determinant parts with six sequences related to the obstetrical gestures practiced and the patients clinical worsening (delivery, PPH diagnosis, call for an anesthetist, bleeding worsening, uterine revision and valve examination). As shown in Fig 2, the screenshot of the video provided clear access to the evolution of the time, patient’s bleeding and vital parameters.

thumbnail
Fig 2. Screenshot presenting the video clip and the repartition of the region of interest.

The two actors managed a simulated patient: the nurse on the left, the obstetrician on the right and the high-fidelity manikin in the center. Time, monitoring and quantification of bleeding were inserted on the right of the screen to improve the readability of the information and to better discriminate eye movements. The 17 yellow boxes correspond to the previously defined ROI. “iv”: corresponds to vascular filling fluids, “anestnurse”: nurse anesthesiologist, “anesthequip”: equipment of the anesthetist, “patient”: Noelle® face, “inject”: injection site in the perfusion, “perineal”: perineal area, “obst”: obstetrician, “obsequip”: obstetrical material, “time”: hour of the action. “Ecgcurve”: heart rate curve, “freq”: heart rate, “plethcurve”: plethysmography curve, “sat”: oxygen saturation of the blood, “respcurve”: respiratory rate curve, “resp”: respiratory rate, bBp”: blood pressure, “bloodloss”: blood collection bag.

https://doi.org/10.1371/journal.pone.0221515.g002

With a sampling rate of 60 Hz, 2 points spaced 100 ms apart around a dispersion area of 50 pixels defined our fixation points. This experimental parameterization allowed good discrimination of objects larger than 1.20 cm, respecting the precision margins of the tracker [17]. Presentation, recording, data acquisition and definition of 17 regions of interest (ROI) were managed by the OGAMA v5.0.0 software (Open Gaze and Mouse Analyzer, Freie Universität, Berlin) [18]. Fig 2 shows the distributions of the 17 ROI. The variables extracted were the number of fixations (NBF) and the fixation duration mean (FDM) for each region of interest. Missing data were computed using MCMC method [19,20]. The attentional heat maps were extracted during key steps of care.

Simulation of PPH

After watching the video clip, the anesthesiologists were asked to manage the simulated patient (high-fidelity mannequin Noëlle® 57X.100, Gaumard) with the nurse and the obstetrician. The patient suffered from a hemorrhagic shock on severe PPH with delayed management. Their acting roles were normalized. The nurse followed the instructions of the anesthetist in the order of enunciation. The obstetrician made no therapeutic decisions. He had to continue the uterine massage and tell the anesthetist the persistence of the bleeding. No more help was available. Initially the care provided during the video part was composed of moderate fluid challenge and 30 UI oxytocin injection. Each resuscitation measure undertaken by the anesthetist induced a transient corrective response on the vital parameters of the patient. These responses were standardized and triggered by the simulation pilot. Whatever the actions provided, the trend continued to worsen. The surgery decision marked the end of the scenario. The scene was filmed for performance analysis.

Data analysis

All data were registered with an anonymous inclusion number, and all questionnaires were filed by participants using a PhyMyAdmin database.

PPH performance was scored by a 16-item checklist (S1 Appendix) developed by a medical committee using professional recommendations. A percentage of bad practice was calculated when treatment was not administered at the recommended dosage or with incorrect prescribing rules.

Situation awareness was measured twice with the Situation Awareness Global Assessment Technique (SAGAT) questionnaire during the eye-tracking sequence (S2 Appendix). The 38-question SAGAT was achieved by an expert medical committee following the Goal Directed Task Analysis [21]. The numerical responses were accurate within a 10% range around the true value. We asked to justify the answers to the yes/no questions using keywords. The score was expressed as a percentage.

NASA TLX Index allowed the anesthesiologist to self-evaluate the level of cognitive workload after having been exposed to the situation [22]. Anesthetists’ self-evaluation of their knowledge, simulation performance, clinical understanding and their stress were done with Likert scales.

Statistical analysis

Qualitative data are presented as counts and percentages whereas numerical ones as mean and standard deviation. To test the hypothesis that Experts are the most efficient and less sensitive to the cognitive workload than novices, simulation performance, NASA TLX Index and posttest self-assessments were subjected to parametric tests between the groups. Chi-square test was used for the qualitative variables and the student T-test for the quantitative. To test the hypothesis that the Experts have a higher SA level (particularly perceptual), a two-factor ANOVA (Group: Expert vs. Novices and SA Level: 1, 2 or 3) was made on SAGAT scores. In order to test the hypothesis that an “expert gaze” was applied to the video sequences of the PPH, the oculometric variables (respectively, number of fixation (NBF) and fixation duration mean (FDM)) were tested by ANOVAs on the two factors Groups and ROI (17 regions of interest). Post-hoc tests of Newman-Keuls have been applied to analyze significant effects. A result with P <0.05 was considered significant. All statistical analysis was computed using R v 3.3.1 software.

Results

Descriptive data

The 30 anesthetists included in this study were divided into two groups (Experts and Novices). Table 1 presents the demographic and professional characteristics of the participants. Fig 3 shows the study flow chart. One novice left the experimental protocol before ending.

thumbnail
Fig 3. Experimental flow chart.

30 anesthetists participated in this experimental study. 15 experts with more than 5 years of experience. 15 novices with less than 5 years of experience. 1 novice left the experimental protocol before oculometry. The cumulative experience in maternity ward is significantly higher among Experts 12.1 ± 8.1 years versus 1.0 ± 1.7 years for Novices. Anesthetists’ level of theoretical knowledge on PPH is not statistically different between the groups (52.4% ± 3.5 of correct answers). The experts were confronted with more real PPH situation. They are 67% to have managed more than 50 cases and 27% more than 100 cases. Only 57% of the novices participated in the management of 10 to 50 cases for the most exposed (p <0.05). Participating anesthetists are familiar with high fidelity simulation since 67% of Experts (n = 10/15) and 87% of Novices (n = 13/15) have already practiced it. However, all the novices have already practiced PPH scenario while only 2 experts already did it (p <0.001). None of them practiced eye-tracking experiment before.

https://doi.org/10.1371/journal.pone.0221515.g003

PPH management performance

The Experts did not show a significantly better performance than the Novices, nor in terms of the percentage of expected therapeutic actions (80 ± 8.90% with 10.86 ± 6.25% of bad practices), nor in terms of simulated care duration (19:09 ± 04:45 min). Whereas novices did 73 ± 17.54% of expected therapeutic actions including 16.13 ± 13.34% of bad practices in 18:15 ± 02:44 minutes.

Oculometric data

Overall recording, the missing data rate is 5.17%. 2958 fixation points were computed for analysis. The observation of attentional heat map showed different gaze dispersion between the experts and the novices. Fig 4 shows an example of heat map extracted during the uterine revision phase. Experts paid more attention to monitoring information. As shown in Fig 5, Experts look more often at the ROI “perineal” and look at the ROI “bloodloss” longer than novices (p <0.05). In addition, the most frequently watched ROIs are not those gazed the longest on average.

thumbnail
Fig 4. Heat map during the uterine revision step.

Heat map calculated from anesthetists’ fixation points on video during the uterine revision step. The hotter the point (from blue to red), the greater the cumulative fixation time in the region of interest. Unlike the Novices (bottom), the Experts (above) showed a greater distribution of the gaze on the different elements of the scope. The interest in time and the blood collection bag appeared more important among the Experts. They paid a lot of attention to the perineal area whereas the Novices looked at Noelle’s face instead.

https://doi.org/10.1371/journal.pone.0221515.g004

thumbnail
Fig 5. Graphical representation of the fixation number and fixation duration means of Experts and Novices according to the ROI.

Presentation of the eye-tracking data by ROI according to the group (Experts or Novices). The graph on the left shows the fixation duration mean (FDM). Experts gazed on average more the ROI “bloodloss” 865 ms compared to 717 ms in Novices. The graph on the right shows the number of fixation (NBF). The post-hoc analysis revealed only difference between Novices and Experts on the ROI “perineal”. The experts will more often gaze this region than the novices (99 versus 89). The overlap of these curves makes it appear that the ROI most often viewed are not those which are viewed the longest.

https://doi.org/10.1371/journal.pone.0221515.g005

Number of fixation analysis

Analysis of the eye-tracking data revealed that Experts have in average 8.28% more fixating points than Novices (in average per participants respectively n = 791.4 vs. 725.9, F (1, 2754) = 14.5; p <0.001). Specific elements of the environment (effect of the factor ROI) are more often fixated, revealing a higher interest for anesthetists (F (16, 2754) = 155.4; p <0.001). The 5 most frequently watched ROIs were the perineal region (15.8 ± 0.6), the nurse (10.8 ± 0.5), the patient (10.6 ± 0.4), the blood loss (9.3 ± 0.4) and heart rate (7.2 ± 0.3). The post-hoc analysis of the significant interaction effect between the factors Group and ROI (F (16, 2754) = 2.22; p = 0.003) revealed that experts showed a superior interest for the perineal region. The post-hoc analysis of the interaction effect between the Group and ROI factors (F (16, 2754) = 1.82, p <0.05) only revealed that Experts fixed the perineal region significantly more often than the Novices (16.7 ± 0.44 vs. 14.9 ± 0.45, p <0.05), no other ROI were statistically more observed by Experts compared to Novices.

Mean duration of the fixation

Anesthetists had an average time of 414 ± 182 ms (ranging in average from 780 to 153ms) by areas of interest and there is no main effect of the factor Groups was observed (F (1, 2744) = 0.43 p = ns). All the ROI were not fixed with the same interest (F (16.2601) = 70.7 p <0.0001). Post-hoc analysis showed that the 5 ROIs with the highest average fixation duration are blood loss (790 ± 410 ms), significantly observed longer than the perineal region (663 ± 222 ms). Without any difference between Blood pressure (506 ± 258 ms), heart rate (494 ± 260 ms) and the patient (461 ± 215 ms), these regions are significantly gazed for a longer time. The interaction between the factors Group and ROI (F (16.2601) = 2.18, p <0.05) and post-hoc analysis only revealed that Experts did not monitor blood loss like novices. Experts gazed significantly longer blood loss ROI than Novices (865 ms ± 439 vs. 717 ms ± 362, p <0.001).

Situation awareness assessment

The global level of situation awareness has been evaluated with the SAGAT score at 64.8 ± 6.4% among anesthetists. Surprisingly, Experts did not statistically reach a significant higher level of situation awareness (66.3 ± 7.2% vs. 63.1 ± 5.1% respectively, p = ns). The level of situation awareness is burdened by the item comprehension (SA Level 2 = 51.1 ± 8.3%) which is significantly lower than perception and projection levels (SA Level 1 = 70.6 ± 10.6% and level 3 = 72.3 ± 9.7%) (F (2.81) = 43.32, p <0.001).

Post-test anesthetist self-assessment

Workload and realism of the situation.

Anesthetists found the simulated situation realistic (80.1 ± 18.4%) and immersive (79.5 ± 15.1%). Because of the lack of help and the severe condition of the patient, the simulated scenario has a very high level of cognitive load. In this experimental simulated PPH scenario, Experts and Novices have experienced a high level of cognitive load with no significant difference between groups (74.3 ± 11.4% vs. 76.6 ± 11.7%, respectively).

Feeling of personal competence.

As shown in Table 2, post-test self-assessment revealed a significant discrepancy between objective evaluation and subjective feelings in groups. Experts reported a greater feeling of performance and knowledge than Novices (self-assessment of performance 66.1 ± 16.6 vs. 47.0 ± 20.8, p <0.05, self-assessment of medical knowledge 73.8 ± 9 vs. 52.1 ± 19.9, p <0.05).

thumbnail
Table 2. PPH knowledge, PPH simulation performance and anesthetist post-test self-assessment.

https://doi.org/10.1371/journal.pone.0221515.t002

Discussion

This is the first study linking a visual behavior analysis using a standardized fixed eye tracking protocol, performance and human factors in high-fidelity simulation of PPH. Our main result is the identification of an expert visual behavior characterized by an increased number of total fixation points in comparison to residents. The study also shows the interest of eye tracking in assessing Situation Awareness at the perceptive level.

The eye-tracking protocol allowed us to analyze the visual perception objectively and to add a cognitive dimension since eye movement is a cognition-dependent process [15]. Experts have developed 3.5% more fixation points than novices. This increased number of fixations seems to be a marker of expertise in dynamic situations and appears to correspond to a more frequent updating of critical data related to the clinical need. This observation is also found in aeronautic with a 10% increase in the fixation points for expert pilots on landings [23]. However, it has been reported in medical radiology that during a chest X-ray analysis, the expertise is characterized by a reduction in the number and times of fixation [24]. But in this case—a static diagnostic task—the cognitive demand is not the same as for a changing dynamic environment requiring continuous reassessment.

Fixation time is a neurophysiological driven variable. Difference as the additional 150 ms (30% of the average fixation mean) spent by experts compared to novices for evaluating the bleeding loss mean clearly different perception and cognition processes. This is a major observation because the lack of recognition and anticipation of the bleeding problem contributes to degrading the level of maternal care and creates morbi-mortality [6,8]. Evaluation of bleeding is the center of the cognitive strategy put in place by experts to monitor the dynamic of the PPH. It results in a modification of the visual behavior and a specific cognitive treatment at different levels of integration (perception, comprehension, projection and probably the performances).

Their apparent interest for the perineal region raises questions. We make several assumptions. First, it can be a center bias, described in experimental human and animal studies [25]. This may reflect a physiological bias to return the eyes to the center of the orbits and in our eye tracking protocol to the center of the screen where the perineal region is located. Second, current research tends to show that experts use their peripheral vision more. Experienced endoscopists showed greater ability to detect adenomas when they looked at the central part of the screen, it is thus possible to extract information by maintaining a central position of the gaze and triggers a saccade when an unexpected event appears [26]. On sports fields, researchers proved that expert basket players retained a better awareness of the game than novices with a hidden foveal vision attesting that the peripheral vision of the expert can support complex integrated cognitive processes [27]. In addition, experts could have a more efficient information extraction capacity; this cognitive economy would leave more time to control gynecological procedures and can explain this center fixation tendency to the perineal region. Further studies would be needed to investigate the attractiveness of the different ROI. For example, using a different location of the ROI in the video sequences or a different point of view could add a clear ranking of the ROI in PPH management.

In a clinical simulated dynamic situation, it is interesting to see what are the cognitive determinants of clinical sense during the assessment of the hemorrhagic shock. Indeed, the analysis of visual behavior reveals processes of inhibition of some distractors (Perineal, Nurse, Patient) which are often fixed but not long observed for lack of useful information. While the anesthetists spend more time in extracting information on clinically relevant regions of their environment (Blood Loss, Heart Rate, Blood Pressure).

At first reading, the present study of human factors was disappointing. No difference was observed between anesthetist students and experienced professionals while the SAGAT, we have developed, was an arduous 38-question SAGAT (including 19 perceptive, 9 understanding, 10 anticipatory questions with justifications requested for yes/no questions) which was supposed to be more discriminant. The overall SAGAT score was 65 ± 6%. Compared to Stratton’s study on residents with sleep deprivation it was lower [28]. Indeed, in a traumatic resuscitation and respiratory distress simulated exercise, with a 10-question SAGAT administered at 3 minutes of simulation, they describe an average situation awareness value of 80%. In other nursing studies in simulated emergency settings, low SA scores were identified, averaging around 53% across scenarios and group [29]. The apparent weakness of comprehension (SA level 2 = 51%) in our SAGAT analysis is surprising where we expected a lack of perception (SA level 1 = 71%) according to the literature (11,12). A nursing study on a hypovolemic scenario also found a low level of comprehension at 29% with 2 yes / no questions asked at random times [30]. Other experimental studies tend to find perceptual failures. This weakness probably comes from the way the SAGAT questionnaire was administered. We have favored a low level of intrusion during the 8-minute video at the expense of a probable memory bias [31].

In addition, the expertise should have protected the experts from elevated cognitive load [32,33] and should have allowed better situation awareness [11,34]. Once again, the tools described in the literature (SAGAT and NASA TLX Index) were not able to discriminate our groups whereas self-assessment questionnaires allowed it. In the simulation scenario, we suppose that, for both groups, the delay of care generated a cognitive overload limiting the discriminating capacity of the NASA TLX. Current findings suggest that physiological indices are the most sensitive means for detecting variations in cognitive load while self-reporting assessment have poor validity score (1.40 ± 0.58 in medical education vs. 1.71 ± 0.70 for physiological measurements) but may not be sufficient to guarantee a better clinical patient’s outcome and physician’s performance in the real clinical practice [35]. Complementary eye tracking data analysis as measurement of the pupillary diameter could have added to our questionnaire assessment of cognitive load a physiological dimension. Indeed changes in pupillary diameter reflects adrenergic activity and cognitive load [36]. Other methods such as real-time EEG show promising results with very strong discrimination power [37,38].

This study clearly showed the effect of expertise in PPH management at the perceptual level with the eye-tracking data. However, there is currently a critical lack of efficient tools to provide insight of the behavioral level and of the cognitive mechanism at work (situation awareness and cognitive load). Understanding this discrepancy is a major issue.

Evidence of the interest of simulation-based training in improving physicians’ performances are clear both for initial training and for maintaining skills [3942]. A meta-analysis found an improvement in professional practices in the field of obstetric emergencies [43]. Simulation based training has become a national recommendation in France. Our pre-graduate residents are trained since the first year to manage obstetric incidents through simulation. Moreover, since 2013, all the centers in our region have adopted a unique protocol gathering the PPH management’s key point in the form of a checklist with cognitive aid (see S3 Appendix). It’s probably for these reasons that we did not show any statistical difference in performance between Experts and Novices despite the higher variability of the Novices performance; a marker of learning processes [44]. For example, during our experimental phase, we observed the case of a novice (second year resident) with the same level of theoretical knowledge as the participants who had never participated in a real PPH management. He was just exposed to 1 PPH scenario in high fidelity simulation the month before. In our protocol, these novices performed the same level of performance as practitioners with clinical experience of more than 100 PPH cases. Can we imagine this undergraduate resident alone on commands for a maternity night shift? Even though the evidence of the ecological validity of the simulation is growing [45,46], the issue of performance extrapolation still remains. Based on the clinical expertise we do not think that novices are competent enough but on the move to expertize. Our assessment of performance with checklist is not sufficiently discriminating. A review published in 2001 reported that the efficacy of methodologies for assessment of performance during the simulation was largely undetermined [47]. Behavioral and NTS analysis suffers from the same limitations. It would even seem that adding NTS training appears not to substantively enhance simulation-based learning [39]. Observational scales are used and there is still a lack of research examining performance transfers, sustainability, and direct patient outcomes [48]. Even though we know that they contribute to poor performance, we cannot conclude that NTS simulation training improves the safety of the patient.

In the present study, self-assessment of performances could be closer to a good assessment of the care capacity of our participants. It was much lower among novices (sometimes up to 20% lower). We assume that this difference comes from a higher self-confidence associated with better self-efficacy of experts. Indeed, considering the determinants of self-efficacy [49]: clinical successes, vicarious experience, social persuasion and physiological states, experts tended to self-assess more accurately. Eventually, the accuracy of self-assessment would be a good target marker of the expertise.

There is a need to update simulation assessment with behavioral tools. It is difficult to think that a complex performance score based on behavioral data would be less efficient than self-assessments. We should be able to create such a score soon.

Our study has some limitations. The number of anesthesiologists included in each experimental group may seem low in the context of simulation study. No a priori calculation of the sample size was sufficiently reliable due to the mixed experimental design using a first PPH video watching session and the second one involving the participants in the management of the watched PPH. However, in regard of the Tien et al. [50] review of 7360 studies using eye tracking for skills assessment and training, we included more participants than the average 22.3+/-6.3 reported from the literature. Second, our anesthesiologists originated all from the same center and the regional PPH management protocol and training program could have a homogenization effect over practitioners. The additions of these two limitations would have resulted in the non-significant statistical analysis on PPH management score and SAGAT. Despite large size effects calculated for each variable consolidating our interpretations, complementary multicenter study included more participants are needed.

Conclusion

Simulation assessment tools as performance score, SAGAT and NASA TLX have shown their limits in this study in terms of sensibility and discriminating power. This study questioned the development of healthcare high-fidelity simulation as a certification tool for medical skills and competencies. The contributions of new technologies (as portable Eye Tracking and EEG) to better characterization and recognition of expertise will have to be addressed.

Beyond the special case of PPH management, this study suggests the need for an innovative neuroscientific approach to better understand the cognitive processes that support medical expertise in lifesaving emergencies.

Supporting information

S1 Appendix. Checklist for PPH management performance.

Left column: the actions expected for the PPH management in this case. Right column: the action has been done but with a technical error (drug, dosage or procedure).

https://doi.org/10.1371/journal.pone.0221515.s001

(PDF)

S2 Appendix. Situation Awareness Global Assessment Technique (SAGAT) used in this protocol.

This 38-questions SAGAT (including 19 perceptive, 9 understanding, 10 anticipatory questions with justifications requested for yes/no questions) was designed to be hard and discriminant. It was administered during the video sequence just before anesthetist’s intervention.

https://doi.org/10.1371/journal.pone.0221515.s002

(PDF)

S3 Appendix. Regional protocol for PPH management.

This algorithm is a checklist and serves as support for PPH management in Reunion Island. It helps to remember the steps of management and each expected action is checked and dated when performed.

https://doi.org/10.1371/journal.pone.0221515.s003

(PDF)

S3 File. General characteristics of the participants dataset.

https://doi.org/10.1371/journal.pone.0221515.s006

(XLSX)

S4 File. Self-assessment questionnaires dataset.

https://doi.org/10.1371/journal.pone.0221515.s007

(XLSX)

Acknowledgments

Glenn Le Gallo, Clément Tabard, Camille Peschet, Anthony Pasquier and Yohann Dudouit: Resources for PPH Simulations.

Elodie Garnier and Nastasia Fouret: Statistical support and formal analysis.

Anaelle Pignolet: Software.

We thank Pr Silvia Iacobelli for helpful comments on the manuscript.

References

  1. 1. Khan KS, Wojdyla D, Say L, Gülmezoglu AM, Van Look PFA. WHO analysis of causes of maternal death: a systematic review. Lancet. 2006;367: 1066–1074. pmid:16581405
  2. 2. Alkema L, Chou D, Hogan D, Zhang S, Moller A-B, Gemmill A, et al. Global, regional, and national levels and trends in maternal mortality between 1990 and 2015, with scenario-based projections to 2030: a systematic analysis by the UN Maternal Mortality Estimation Inter-Agency Group. Lancet. 2016;387: 462–474. pmid:26584737
  3. 3. Hogan MC, Foreman KJ, Naghavi M, Ahn SY, Wang M, Makela SM, et al. Maternal mortality for 181 countries, 1980–2008: a systematic analysis of progress towards Millennium Development Goal 5. Lancet. 2010;375: 1609–1623. pmid:20382417
  4. 4. Saucedo M, Deneux-Tharaux C, Bouvier-Colle M-H, Le Comité national d’experts sur la mortalité maternelle. [Maternal mortality in France, 2007–2009]. J Gynecol Obstet Biol Reprod (Paris). 2013;42: 613–627. pmid:24035736
  5. 5. Deneux-Tharaux C, Saucedo M. [Epidemiology of maternal mortality in France, 2010–2012]. Gynecol Obstet Fertil Senol. 2017;45: S8–S21. pmid:29113879
  6. 6. Bonnet M-P, Deneux-Tharaux C, Bouvier-Colle M-H. Critical care and transfusion management in maternal deaths from postpartum haemorrhage. Eur J Obstet Gynecol Reprod Biol. 2011;158: 183–188. pmid:21632172
  7. 7. Bouvet C, Lehousse T, Sentilhes L, Branger B, Granry JC. [Declarative survey about postpartum haemorrhage management by anaesthesiologists in Pays de la Loire area (France): what’s the adherence to clinical practice guidelines?]. Ann Fr Anesth Reanim. 2014;33: 310–317. pmid:24836116
  8. 8. Dupont C, Deneux-Tharaux C, Cortet M, Colin C, Touzet S, Rabilloud M, et al. [Practices for management of grave postpartum haemorrhage after vaginal delivery: a population-based study in 106 French maternity units]. J Gynecol Obstet Biol Reprod (Paris). 2012;41: 279–289. pmid:22464273
  9. 9. Flin R, Patey R, Glavin R, Maran N. Anaesthetists’ non-technical skills. Br J Anaesth. 2010;105: 38–44. pmid:20522911
  10. 10. Gaba DM, Howard SK, Small SD. Situation awareness in anesthesiology. Hum Factors. 1995;37: 20–31. pmid:7790008
  11. 11. Schulz CM, Endsley MR, Kochs EF, Gelb AW, Wagner KJ. Situation awareness in anesthesia: concept and research. Anesthesiology. 2013;118: 729–742. pmid:23291626
  12. 12. Schulz CM, Krautheim V, Hackemann A, Kreuzer M, Kochs EF, Wagner KJ. Situation awareness errors in anesthesia and critical care in 200 cases of a critical incident reporting system. BMC Anesthesiol. 2016;16: 4. pmid:26772179
  13. 13. Schulz CM, Burden A, Posner KL, Mincer SL, Steadman R, Wagner KJ, et al. Frequency and Type of Situational Awareness Errors Contributing to Death and Brain Damage: A Closed Claims Analysis. Anesthesiology. 2017;127: 326–337. pmid:28459735
  14. 14. Tatler BW, Wade NJ, Kwan H, Findlay JM, Velichkovsky BM. Yarbus, Eye Movements, and Vision. i-Perception. 2010;1: 7–27. pmid:23396904
  15. 15. Hayhoe M, Ballard D. Eye movements in natural behavior. Trends Cogn Sci (Regul Ed). 2005;9: 188–194. pmid:15808501
  16. 16. Jones DG, Endsley MR. Sources of situation awareness errors in aviation. Aviat Space Environ Med. 1996;67: 507–512. pmid:8827130
  17. 17. Ooms K, Dupont L, Lapon L, Popelka S. Accuracy and precision of fixation locations recorded with the low-cost Eye Tribe tracker in different experimental set-ups. Journal of Eye Movement Research. 2015;8.
  18. 18. Vosskühler A, Nordmeier V, Kuchinke L, Jacobs AM. OGAMA (Open Gaze and Mouse Analyzer): open-source software designed to analyze eye and mouse movements in slideshow study designs. Behav Res Methods. 2008;40: 1150–1162. pmid:19001407
  19. 19. Takahashi M. Statistical Inference in Missing Data by MCMC and Non-MCMC Multiple Imputation Algorithms: Assessing the Effects of Between-Imputation Iterations. Data Science Journal. 2017;16: 37.
  20. 20. Hollenbach FM, Bojinov I, Minhas S, Metternich NW, Ward MD, Volfovsky A. Multiple Imputation Using Gaussian Copulas. Sociological Methods & Research. 2018; 0049124118799381.
  21. 21. Jones DG, Endsley MR, Bolstad M, Estes G. The Designer’s Situation Awareness Toolkit: Support for User-Centered Design. Proceedings of the Human Factors and Ergonomics Society Annual Meeting. 2004;48: 653–657. https://doi.org/10.1177/154193120404800381
  22. 22. Hart SG, Staveland LE. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. In: Hancock PA, Meshkati N, editors. Advances in Psychology. North-Holland; 1988. pp. 139–183. https://doi.org/10.1016/S0166-4115(08)62386-9
  23. 23. Kasarskis P, Stehwien J, Hickox J, Aretz A, States U, Academy AF, et al. Comparison of expert and novice scan behaviors during vfr flight. In Proceedings of the 11th International Symposium on Aviation Psychology. 2001.
  24. 24. Kelly BS, Rainford LA, Darcy SP, Kavanagh EC, Toomey RJ. The Development of Expertise in Radiology: In Chest Radiograph Interpretation, “Expert” Search Pattern May Predate “Expert” Levels of Diagnostic Accuracy for Pneumothorax Identification. Radiology. 2016;280: 252–260. pmid:27322975
  25. 25. Berg DJ, Boehnke SE, Marino RA, Munoz DP, Itti L. Free viewing of dynamic stimuli by humans and monkeys. J Vis. 2009;9: 19.1–15. pmid:19757897
  26. 26. Almansa C, Shahid MW, Heckman MG, Preissler S, Wallace MB. Association between visual gaze patterns and adenoma detection rate during colonoscopy: a preliminary investigation. Am J Gastroenterol. 2011;106: 1070–1074. pmid:21326224
  27. 27. Ryu D, Abernethy B, Mann DL, Poolton JM, Gorman AD. The role of central and peripheral vision in expert decision making. Perception. 2013;42: 591–607. pmid:24422243
  28. 28. Stratton A, Furey A, Hogan M. The effect of sleep deprivation on a resident’s situational awareness in a trauma scenario. J Orthop Trauma. 2014;28: e160–163. pmid:24296596
  29. 29. Cooper S, Porter J, Peach L. Measuring situation awareness in emergency settings: a systematic review of tools and outcomes. Open Access Emerg Med. 2013;6: 1–7. pmid:27147872
  30. 30. Cooper S, Kinsman L, Buykx P, McConnell-Henry T, Endacott R, Scholes J. Managing the deteriorating patient in a simulated environment: nursing students’ knowledge, skill and situation awareness. J Clin Nurs. 2010;19: 2309–2318. pmid:20659205
  31. 31. Endsley MR, Garland DJ. Situation Awareness Analysis and Measurement. CRC Press; 2000.
  32. 32. Byrne AJ, Murphy A, McIntyre O, Tweed N. The relationship between experience and mental workload in anaesthetic practice: an observational study. Anaesthesia. 2013;68: 1266–1272. pmid:24164559
  33. 33. Leedal JM, Smith AF. Methodological approaches to anaesthetists’ workload in the operating theatre. Br J Anaesth. 2005;94: 702–709. pmid:15817711
  34. 34. Randel JM, Pugh HL, Reed SK. Differences in expert and novice situation awareness in naturalistic decision making. International Journal of Human-Computer Studies. 1996;45: 579–597.
  35. 35. Naismith LM, Cavalcanti RB. Validity of Cognitive Load Measures in Simulation-Based Training: A Systematic Review. Acad Med. 2015;90: S24–35. pmid:26505098
  36. 36. Klingner J, Kumar R, Hanrahan P. Measuring the Task-evoked Pupillary Response with a Remote Eye Tracker. Proceedings of the 2008 Symposium on Eye Tracking Research & Applications. New York, NY, USA: ACM; 2008. pp. 69–72. https://doi.org/10.1145/1344471.1344489
  37. 37. Causse M, Fabre E, Giraudet L, Gonzalez M, Peysakhovich V. EEG/ERP as a Measure of Mental Workload in a Simple Piloting Task. Procedia Manufacturing. 2015;3: 5230–5236.
  38. 38. Yu K, Prasad I, Mir H, Thakor N, Al-Nashash H. Cognitive workload modulation through degraded visual stimuli: a single-trial EEG study. J Neural Eng. 2015;12: 046020. pmid:26065874
  39. 39. Lorello GR, Cook DA, Johnson RL, Brydges R. Simulation-based training in anaesthesiology: a systematic review and meta-analysis. Br J Anaesth. 2014;112: 231–245. pmid:24368556
  40. 40. Cook DA, Hatala R, Brydges R, Zendejas B, Szostek JH, Wang AT, et al. Technology-enhanced simulation for health professions education: a systematic review and meta-analysis. JAMA. 2011;306: 978–988. pmid:21900138
  41. 41. Bruppacher HR, Alam SK, LeBlanc VR, Latter D, Naik VN, Savoldelli GL, et al. Simulation-based training improves physicians’ performance in patient care in high-stakes clinical setting of cardiac surgery. Anesthesiology. 2010;112: 985–992. pmid:20234305
  42. 42. Cortegiani A, Russotto V, Montalto F, Iozzo P, Palmeri C, Raineri SM, et al. Effect of High-Fidelity Simulation on Medical Students’ Knowledge about Advanced Life Support: A Randomized Study. PLoS One. 2015;10. pmid:25955760
  43. 43. Calvert KL, McGurgan PM, Debenham EM, Gratwick FJ, Maouris P. Emergency obstetric simulation training: how do we know where we are going, if we don’t know where we have been? Aust N Z J Obstet Gynaecol. 2013;53: 509–516. pmid:24033002
  44. 44. Ericsson KA. Deliberate practice and acquisition of expert performance: a general overview. Acad Emerg Med. 2008;15: 988–994. pmid:18778378
  45. 45. Weller J, Henderson R, Webster CS, Shulruf B, Torrie J, Davies E, et al. Building the evidence on simulation validity: comparison of anesthesiologists’ communication patterns in real and simulated cases. Anesthesiology. 2014;120: 142–148. pmid:23903023
  46. 46. Manser T, Dieckmann P, Wehner T, Rallf M. Comparison of anaesthetists’ activity patterns in the operating room and during simulation. Ergonomics. 2007;50: 246–260. pmid:17419157
  47. 47. Byrne AJ, Greaves JD. Assessment instruments used during anaesthetic simulation: review of published studies. Br J Anaesth. 2001;86: 445–450. pmid:11573541
  48. 48. Ross AJ, Kodate N, Anderson JE, Thomas L, Jaye P. Review of simulation studies in anaesthesia journals, 2001–2010: mapping and content analysis. Br J Anaesth. 2012;109: 99–109. pmid:22696559
  49. 49. Bandura A. Self-efficacy: toward a unifying theory of behavioral change. Psychol Rev. 1977;84: 191–215. pmid:847061
  50. 50. Tien T, Pucher PH, Sodergren MH, Sriskandarajah K, Yang G-Z, Darzi A. Eye tracking for skills assessment and training: a systematic review. The Journal of Surgical Research 2014;191 (1):169–78. pmid:24881471