S2:E6 - Handling Unplanned Work and Interruptions in Engineering Teams: The Firefighter Role
S02:E06

S2:E6 - Handling Unplanned Work and Interruptions in Engineering Teams: The Firefighter Role

Episode description

In this episode, we explore a challenge that plagues every engineering team: the constant stream of interruptions that disrupt our workflows. Drawing from our combined experiences working with dozens of engineering teams, we introduce the concept of the “firefighter” role - a structured approach to managing unplanned work that’s transformed how teams handle interruptions.

We dive into the science behind context switching, examine why solutions often fail, and explore a practical framework that turns interruptions from productivity killers into opportunities for systematic improvement.

Key topics we cover:

  • The true cost of context switching in engineering teams
  • Why common interrupt-handling approaches break down
  • How to implement and rotate the firefighter role
  • Turning interruptions into systematic improvements
  • Measuring success and avoiding common pitfalls

Show Notes:

How engineering teams handle unplanned work - this article captured my thoughts exactly and includes some really good visualisations of the different approaches.

The Cost of Interrupted Work: More Speed and Stress - UC Irvine study

And a FastCompany article about the study.

Download transcript (.srt)
0:01

Péter: Hey everyone.

0:02

Welcome to the Retrospective, the engineering leadership podcast, where

0:06

we discuss topics about engineering management, technical leadership, and

0:10

similar areas today, as usual, with me is Jeremy and I'm Peter Sasz.

0:15

Welcome.

0:15

Jeremy: Welcome everybody.

0:16

Péter: What is today's topic Jeremy?

0:18

Jeremy: Yeah, so today I wanted to talk about a challenge that, I think

0:21

every engineering team faces, which is constantly being interrupted, context

0:25

switching, and I wanted to talk about some of the approaches that people

0:29

take and suggest my preferred approach based on some of the experience

0:34

I've had, which is this concept of a dedicated firefighter role in the team

0:39

that rotates regularly and handles all the interruptions for the team.

0:43

So we're going to try and to cover a little bit about how to implement a

0:46

firefighter role some of the pitfalls that people experience when they try and

0:50

do it and the best ways to be successful.

0:53

Péter: Awesome.

0:53

This is really a good topic.

0:54

I saw this working very well and I saw the concrete impact it had

0:58

on the focus performance and even external perception of the team.

1:02

So I'm really curious.

1:04

Let's dig into this.

1:05

Tell me a bit about unplanned work and interruptions.

1:07

Why are there so bad?

1:08

We are a team we need to be able to handle this.

1:10

Right.

1:11

Jeremy: Yeah, Yeah exactly.

1:13

It's a reality, right?

1:14

So you teams get bugs you have support contacting you you have other teams

1:19

in the organization who need something from you, or they have a request for,

1:23

for you to do something And people like me generate lots of slack messages.

1:29

I do I joke because like obviously slack is there.

1:32

It's an ever present.

1:33

Maybe some people still have email and use email and work too.

1:36

I would also say, you know So it's just there's just constant buzz at work and

1:41

some of this is actually really good.

1:42

Ultimately interruptions are you know, when you have a support

1:45

request you're helping customers, You need to unblock other teams.

1:48

Someone in your team needs a PR reviewed, or an external team

1:53

needs someone to review something that those are all good things.

1:56

When you have all these messages sharing info and stuff.

1:58

It's about getting everybody up to the same level of knowledge so

2:02

we can all make better decisions.

2:04

There is a good aspect to all of it.

2:07

I think we always jump to the negative of interruptions and blocking me or

2:11

unplanned stuff that's coming in.

2:13

There's a benefit to it, but there's also a cost, right?

2:16

And I think that's what we focused on.

2:18

We always want more deep work, and we want to be in, flow and all the rest.

2:22

And, but I think it's really hard and I think it's getting increasingly harder.

2:27

I remember early in my career it was mainly email and emails

2:30

weren't that frequent, now Slack and instant messages are non-stop!

2:34

I remember when hip chat became really big that was like the first moment in

2:37

work where this whole thing kicked off.

2:40

I think there's one study that I've often heard quoted, which is that every switch

2:43

you make has an impact of 23 minutes.

2:46

There was another one which I think is a really interesting

2:48

study by UC Irvine that over half of all tasks get interrupted.

2:54

And I think they also say that Any task that gets interrupted isn't very

2:58

likely to be finished in the same day.

3:00

And then there's the mental impact, stuff not getting finished cognitive

3:04

overhead, interruptions are very bad.

3:06

Péter: Yeah.

3:08

Tell me more about that, that study because it's, it's super interesting.

3:11

I think intuitively we all know the negative effects of interruptions.

3:15

We feel it on ourselves especially if interruptions get reinterrupted again,

3:20

and then you just don't even know where you started, but it's great to hear

3:24

that there are studies about that.

3:26

Tell me more about this.

3:27

Jeremy: Yeah.

3:28

I don't know.

3:28

Just on a personal level, when, we work in manager mode, you have the maker mode

3:32

and manager mode and part of manager mode is, dealing with interrupts and having

3:37

a schedule that's like really fragmented.

3:40

But yeah, even then we still have to produce stuff.

3:42

And what this study I found really interesting was that it, it said that

3:46

they, they did a study of people working in knowledge work and they

3:49

found that people were switching tasks every three minutes, which is scary.

3:55

Yeah.

3:56

When I look at myself, I'm wondering if I'm doing the same things.

4:00

And I know this bit is definitely true.

4:02

We interrupt ourselves more than others interrupt us.

4:05

So

4:05

Péter: Oh,

4:06

Jeremy: we handle interrupts as a team, we're still our own worst enemy.

4:12

Péter: I think it's like I don't want to be very gloomy and everything, but

4:16

I think today's age with all these micro contents and micro interruptions from

4:21

my phones are, are contributing to this in a, in a negative way, like Yeah.

4:25

Yeah.

4:26

I can see on people that not just the addiction aspect,

4:29

but the attention span aspect.

4:31

Like if a video is longer, and by long I mean 40 seconds instead of 10 seconds,

4:36

they don't watch it till the end.

4:38

I see the effects on myself also.

4:40

So all these interruptions are super, super draining.

4:44

Jeremy: Yeah.

4:44

I wonder if we ran this, if they ran this study again,

4:48

would the results be even worse?

4:50

So yeah, I have to say that I'm, I find I've definitely noticed my

4:56

ability to focus on long form is getting harder because of the amount

5:00

of short form stuff that we're getting

5:02

hit with.

5:03

Péter: We interrupted ourselves.

5:04

Let's, let's get back to this.

5:06

Jeremy: No, it's okay.

5:07

But what's, well, we both like feeling guilty about whatever

5:10

stuff we look at, on our phones!

5:12

What was interesting was and it's fairly obvious in a way, but the

5:15

more complex the task, the more the bigger it costs to recover from it.

5:19

They said the afternoon interruptions were hit harder.

5:22

The more people you work with in a team, obviously, which is obvious,

5:26

the more interruptions you experience.

5:27

So that's definitely another case for the small team approach and

5:33

another obvious one, but the more projects you have in flight, so

5:37

the more work in progress you have.

5:38

The bigger the impact.

5:40

And then it was interesting also was if you're interrupting yourself,

5:43

like we said, if I'm switching and then going back, there's actually

5:46

a faster recovery time in terms of being able to be back on task.

5:50

Whereas if you have an external interruption, it was a longer recovery.

5:54

And that if you're interrupted if you're in flow in a deep task, you're less

5:59

likely to get back or maybe it says, you might never get back to peak focus.

6:04

And yeah, so this is the thing I mentioned earlier, but only

6:07

40 percent of interrupted tasks get completed the same day.

6:11

Péter: That's a problem because I guess if you add the lost time and different

6:16

focus for the night then it's just so much harder to get back into the focus

6:20

the next day and finish the task.

6:22

Yeah.

6:22

Jeremy: Yeah.

6:23

Even if you're interrupted in the day just the ability to get back to that

6:27

deep focus to finish out the task.

6:29

And here's, this is the final interesting bit.

6:32

This is how we're all coping, right?

6:34

This is what I find interesting.

6:35

We all work faster.

6:37

And we're getting more stressed because of it and I can feel it, like I feel

6:41

more stressed today with everything that the way that the world feels

6:45

like it's spinning faster because you, and you have to step up your pace.

6:51

and Yeah, the most productive hours are early morning.

6:53

And that's true.

6:54

I find I'm up early and I can get stuff done in a focused way that

6:58

I struggle later in the day to do.

7:00

Péter: I'm curious to know if, if, if this is because you have

7:04

less external interruptions because others are still sleeping.

7:07

Jeremy: Yeah, exactly but even then I see people who get into work early, that's

7:10

their productive moments before then there's a buzz in the office or whatever.

7:14

The biggest way that we interrupt ourselves is checking email or messaging

7:19

where I guess in this case, Slack.

7:21

And I know that's the case for myself.

7:22

What I find interesting about all of this is there's two parts.

7:26

There's like a personal productivity and interrupting yourself, but

7:29

there's also the bigger cost

7:31

on a team that's trying to get projects done.

7:33

And it's trying to get big tasks done and they have interruptions

7:37

coming into that team.

7:38

And it's a huge cost to actually the productivity of the team.

7:42

Péter: Yeah.

7:42

That's, that's what I wanted to ask you about.

7:44

Like, like, I think we talked a bit about the personal aspect.

7:47

Let's elevate it to the team aspect and see what, what, how does the

7:51

worst case scenario look like?

7:53

Describe a team that's constantly interrupted.

7:56

Jeremy: They have a lot of projects in flight.

7:58

They're constantly being interrupted.

8:00

They're in a, what I would call like a reactive mode and they're not in charge

8:03

of their timeline and delivery anymore.

8:06

Quality of what they do is decreasing because they have to ship something

8:10

because there's, more pressure but they never had their deadlines struggling

8:14

with tech debt because it's a spiral.

8:16

You get worse and worse out of control.

8:19

And so you make things worse and worse over time that I think the energy in

8:23

the team and the morale decreases.

8:26

And obviously your external stakeholders and customers of the team start

8:30

to lose trust with you because of, The way the team is functioning.

8:34

It's, it can be really dire.

8:36

Things can turn out really bad if teams don't find a way to deal with this.

8:41

Péter: It's, it sounds like really a lose, lose situation.

8:44

And it's, it's, it's very sad and ironic because it's coming out of

8:48

a need to be better and respond to all the interruptions on time.

8:52

But in reality, the result is negative.

8:55

So how, how did you see team solve this problem?

8:58

Cause as you say it drains team energy, morality it errodes, customer trust.

9:02

So, so they know that this is not a good.

9:05

Place to be at.

9:06

What, what are the common approaches you've seen?

9:08

Mm

9:17

Jeremy: And in, it can work in the early stage, small team.

9:20

It can work in an early stage startup as well.

9:23

But it's essentially, that you don't really have a process team

9:27

just decides, you triage stuff and you prioritize as it comes in.

9:31

And it's if you think about your time, you have many slices every time an interrupt

9:35

comes and you have a, you try and have a fast, flexible way to handle stuff.

9:41

And it can work.

9:42

And this is, it can work at a certain level.

9:44

Especially if a team is taking on board some of the common advice that you are

9:48

given for handling interruptions, which generally work at this stage, which is

9:53

having some kind of focus time in the team where and where the, you try and

9:58

avoid having too many topics in flight for members and for the team and having

10:03

some kind of Way that some kind of framework for communicating with the

10:06

outside world, maybe a specific channel.

10:09

Maybe some kind of okay this is a bug.

10:12

It's not too serious.

10:13

I'm going to put it in the backlog and we'll do it in the

10:15

next sprint kind of approach.

10:16

Like all of this, having some kind of lightweight framework.

10:19

This is where teams.

10:21

Most teams sit and are most, like that's like the, almost the default mode.

10:26

And I think that works, right?

10:28

I think it works for a certain stage.

10:30

But it, I think the wheels come off quite, can come off quite

10:34

quickly as teams become bigger.

10:36

So you have we talked about the connections between the team

10:38

Great.

10:39

More interrupts.

10:40

If a business becomes more successful or your product is growing, you have

10:43

more users, more customers, more support requests more things going on

10:48

or you're part of a busy organization where there's just a lot going on, I

10:52

think then what you end up happening is that your backlog explodes.

10:56

Your team is constantly context switching your sprint start to,

11:01

if you're using sprints, you might not be meeting your sprint goals.

11:04

Or you'll see in if you're in Kanban mode, your stuff will get your tasks

11:07

will get stuck not be advancing and then you'll be bunching up.

11:10

You'll fall into all sorts of traps of trying to still push more and

11:14

ultimately you're not predictable.

11:15

Your delivery is not predictable anymore because of all those interruptions

11:19

that are sidetracking you all the time.

11:21

Péter: Yeah, yeah, yeah, this is a good summary.

11:23

The way I see this ad hoc approaches or early stage approach is that

11:28

It focuses a bit more on the individual than on the team.

11:31

Like it has some, some good ideas and prescriptions for individuals the focus

11:35

time, the limiting task shifts, but it doesn't have a holistic approach

11:40

to the whole problem of interruption.

11:42

So, but,

11:43

Jeremy: love that framing actually, I think that makes a lot of sense,

11:46

Péter: and I, I, I think.

11:48

I mean, I don't know what you're going to come up with, but I

11:50

think later approaches, I hope that they are a bit more holistic

11:54

and acknowledging approaching the interruption problem better.

11:58

What else did you see working in bigger teams or more successful

12:02

teams that were handling this?

12:03

Jeremy: yeah so then I think there's this kind of approach it's been

12:08

more popularized by Basecamp and shape up which is the cool down or a

12:12

dedicated sprint approach, essentially.

12:15

You try and ignore all of the majority of your interrupts and then

12:20

you handle them in a batch mode.

12:22

I see different variations of this one day a week or, every sprint, we have a

12:27

cool down after it or, whatever the kind of like in, for example, in shape up,

12:32

they have a six week, Sprints and then a cool down and then another six week.

12:36

Obviously I think there's an aspect of this that starts to

12:40

go to a really good principle, which is timeboxing and batching

12:43

And so it's a great idea.

12:45

Teams get desperate and they say we've got loads of bugs, so

12:48

let's stick it into a cool down.

12:50

And I've seen that, several times now.

12:52

Oh, that's a cool down task.

12:53

We'll do that.

12:54

And then the cool down starts to, it works, but I think there's

12:59

some things that break down.

13:00

First of all, the cool down can get over filled.

13:05

And, I think the other part is this does not work.

13:09

Where you need to respond quickly to something in a timely way.

13:14

Péter: Yeah, you cannot say to all requests that thanks

13:16

for your, your request.

13:17

I will handle it next week.

13:19

Jeremy: Yeah, exactly.

13:20

Bugs become outdated, right?

13:21

If, I have a bug and then I push it out by even two weeks, how can I reproduce it?

13:26

And just that context switch also the thing about bugs is the longer you

13:31

leave it to fix you, the longer you are from when you originally wrote the

13:34

code and the harder it is to switch

13:36

back into the context of that.

13:37

So at least for me, I feel like bugs should be, we want to shift, we want to

13:42

triage and fix bugs as soon as possible and get and fix them as early, shift left,

13:47

but fix them as early in the process.

13:48

I think those cooldowns can overflow and obviously the user

13:52

experience of this is not great.

13:54

We talk a little bit about it.

13:56

Péter: I mean, it's if you talk about user experience like the customer

14:00

perception of the team and the service the team provides, arguably there

14:05

there is an improvement because at least the team is more predictable

14:08

than than the ad hoc handling.

14:10

So the stakeholder might be upset that that his problem is only going

14:15

to be handled in two weeks, but that's that's something they can plan with.

14:19

As opposed to the ad hoc when one request gets handled immediately

14:23

and the other one in three months.

14:24

Jeremy: That's exactly it.

14:25

So you are trading off on being more predictable on delivery for

14:30

the responsiveness and so on.

14:32

But I think, and so this can work, and it does work.

14:36

And where you'll see it's not working is you have these crises that interrupt.

14:41

And I think, that your team with this mode is also going to struggle on that

14:45

user serve, the user responsiveness

14:47

So I think it's very hard to run in this mode and avoid interruption still,

14:52

Péter: Yeah.

14:53

Jeremy: because you end up, what you end up with is a combination

14:55

of cool down and ad hoc

14:58

to operate properly.

14:59

Péter: Guess it, it depends on, on, on the discipline and how well you can keep it.

15:03

But to your point, like if, if production is down, then, then you cannot just

15:08

push it to the cooldown period.

15:11

So, so yeah, whatever discipline you keep and however strong you

15:14

push back, there are always things, interruptions that are creeping through.

15:19

Okay.

15:20

So thanks for these approaches.

15:24

Tell us a bit more about the firefighter role because it seems

15:26

like that's a good compromise.

15:28

Jeremy: Honestly, this is the pattern that I found works really well, especially

15:34

when you have a bit more of an established organization regular, support, regular

15:40

things flowing into the team, or you're

15:42

part of a bigger organization.

15:44

And no matter how much we try to remove dependencies between teams

15:47

we still have to communicate.

15:49

And or especially and this is also.

15:51

An important role, I would say, for platform teams that provide a service

15:57

internally to other teams in organization, where they have both a build and a run

16:02

and I, most teams have some run element.

16:04

So the firefighter is a, it's taking, instead of having team interrupts, it's

16:10

having a rotating role in the team who is dedicated To handling all of those

16:16

interrupts it allows everybody else to stay focused on their work and, The

16:21

firefighter is doing all the run tasks.

16:24

They're handling any kind of production incidents, customer support requests

16:29

requests from other teams and so on.

16:31

And they're like the first line support triage before.

16:36

We do decide, and sometimes you still have to interrupt the team,

16:39

but they're definitely, they handle the majority of those things.

16:43

And then this is a really important part of this role, which I think teams get

16:47

wrong and where they fail is they don't do this on top of their normal work.

16:51

you have to plan for that person to not be in, for the period of

16:55

their rotation, you need to plan for them to be not doing any work.

16:59

And the work that they should be doing beyond all of those support requests is.

17:04

learning what causes the team to have those requests and

17:08

preventing them in the future.

17:10

So it might be fixing bugs, but it's more identifying patterns and then figuring

17:16

out, Oh, we need some better monitoring to avoid this, or I need to create some

17:19

quick tooling or something self service.

17:22

I'm going to improve the docs because we getting these requests.

17:25

This piece of tech debt is really.

17:27

Impacting us every time I'm going to work on this and ideally those

17:31

are more interruptible tasks because you're doing you know, you're the one

17:35

that's supposed to be interrupted.

17:37

But I think, this is a person that can work in that more ad hoc mode, but just

17:40

that one individual with a prioritization.

17:44

So anytime that teams are struggling to deal with interruptions, this model

17:48

has worked well consistently for me.

17:50

Péter: Hmm.

17:51

That's that's interesting.

17:52

Let's talk a bit about the practical aspects.

17:55

Like how, how do you implement something like this in a team?

17:57

What are the things to pay attention to?

17:59

And then maybe you will answer before me posing some of the

18:03

questions about the risks.

18:04

Hmm.

18:05

Mm

18:07

Jeremy: so I think the rotation is something, so when you're implementing, I

18:10

think there's some patterns for success.

18:14

I think the minimum.

18:16

of a rotation for the role is one week, though often two weeks is what people

18:20

do because you have enough time to actually tackle some of the debt stuff.

18:24

And essentially you need long enough time to build some context.

18:28

But it can't be so long that you get frustrated or burnt

18:30

out from all of the interrupts.

18:32

If you working in sprints, it makes sense to wrap it, to have it aligned

18:36

with the sprints that your team is doing, having the same cadence as the sprints.

18:41

And I think it's important to have some kind of handover

18:43

between the people who are, who's finishing and doing the next one.

18:47

Providing, maybe some kind of status on, on ongoing stuff if they're going beyond.

18:52

There's also having some kind of clear criteria for that person to, that you

18:57

develop over time, to escalate into the team if you can't handle things yourself.

19:02

Also if you're being overwhelmed by a lot of Issues, then that's also an

19:06

important part, by the way, if teams are, if that one person is being overwhelmed

19:12

all the time and every time, then I think the team has an issue that goes

19:16

beyond needing a firefighter role.

19:19

I think at that point you need to, maybe reconsider what is triggering a needing

19:24

so much that more than that, it takes more than one person to constantly handle those

19:29

Péter: Yeah, I, I think in this case is the person who is, who

19:32

cannot handle it because there are just too many interrupts.

19:35

I think that person is going to have a very good idea of how to make an

19:38

impactful improvement in the systems and, and just, just get into a better stage.

19:43

Maybe it's documentation, maybe it's setting up some self serve system.

19:47

Maybe it's just tech debt.

19:48

Yeah.

19:49

Yeah.

19:49

Thank you.

19:49

Yeah, that's

19:52

Jeremy: if the team can't handle that in the course of that firefighter time

19:56

slot, then they need to prioritize those tasks as part of the actual sprint or

20:00

put it in their, prioritize it in their backlog to make it a whole team focus.

20:06

And I think just the last thing is having some kind of tracking of the

20:10

kind of requests coming in is helpful to be able to spot the patterns and to

20:15

look back and be a bit more data driven.

20:17

I'm trying to say this in a way that I'm not just saying create tickets because

20:23

Péter: what I wanted to ask

20:24

Jeremy: that can be really.

20:25

Can be really evil and over heavyweight and so on, but yeah, some kind of

20:30

way to, to just understand what, all the requests that you have.

20:35

Péter: Well, what I like working is that the teams keep open.

20:39

They're usually Slack or some internal chat tool.

20:41

They're open to requests.

20:43

They have a dedicated room and the firefighter is immediately responding

20:47

that, okay, I will start to take a look at it and says, while I'm looking at

20:53

it or starting to address, can you just make sure to create a ticket for this?

20:56

You can link this chat, whatever, just because communicates that the issue is

21:02

not held up until the person creates all the administration, but still puts the

21:07

responsibility on the original requester to create the ticket and enforces this

21:12

culture that that's the culture of documentation being, being database.

21:16

Jeremy: Yeah.

21:17

Maybe I would say, I think that there are even better ways to

21:22

handle that now than say, I've got this, but please create a ticket.

21:26

I still think create a ticket is a horrible thing to

21:28

Péter: Yeah, yeah, I hear it.

21:29

Jeremy: And.

21:30

Now you can, if you're using Slack and I think most people are or, most,

21:35

I hope most people are hope you're not condemned to teams, but I think

21:38

you can do this with teams too, but

21:39

Péter: I think some of our listeners are so

21:41

Jeremy: yeah, sorry for you.

21:43

But what you can do is you can have emojis that you can tag on an issue on a Slack

21:49

message that triggers a workflow that creates a ticket in whatever in Jira or

21:54

linear or whatever tool you're using.

21:57

And And it just keeps the thread of that message in sync with your ticketing.

22:02

And then afterwards, you can do whatever triage you want.

22:04

And maybe some of your emojis that you can add can help like triage that.

22:09

But saying create a ticket for me is it's just not something

22:13

that I'm really a fan of,

22:15

you know.

22:15

Péter: really a service attitude.

22:18

Jeremy: Exactly.

22:18

It's not user friendly.

22:19

It's not user focused.

22:20

It's not customer focused.

22:22

So many rant over, but that, that's, I wish that people would

22:26

focus more on making it easier.

22:28

And especially if you're dealing with customer support or sales, it's such

22:32

a barrier for them, especially sales.

22:35

When they have requests they're running in the field.

22:39

Trying to chase customers usually not very technical and we need to meet them where

22:43

they are and not put straight jackets and processes around them, but that's maybe,

22:48

Péter: And I often saw the frustration of someone reaching out with something,

22:53

Hey, I noticed the bug in your systems.

22:56

It doesn't hold me back.

22:57

It's not a problem for me.

22:58

I'm just helping you out by letting you know and receiving a feedback saying

23:03

that, okay, create a ticket for that.

23:05

It's really frustrating and teaches the person not to reach out next time.

23:08

Okay, let's, let's, let's get out of this rabbit hole of communication

23:13

culture and just continue.

23:14

I really like this approach of, of the firefighter because externally.

23:20

It shows up like a 24 seven support always available approach.

23:24

And that can be really good for, for, for everyone interacting with the team.

23:28

And it also creates focus time for the people who are not on on, on the

23:33

firefighting role and they can do some deep work, what are the common pitfalls?

23:38

Why, why, how do you see this failing or what are the things that we are

23:42

trading for this better service?

23:45

Jeremy: Yeah.

23:45

So some things that I've seen so first of all, the trap of making the firefighter

23:49

a permanent part of somebody's job,

23:52

either, you know, like the best engineer because they know everything,

23:56

which is awful because you need to push like junior members of the team.

24:01

They might need some support.

24:02

Yeah.

24:02

but this is how you learn.

24:04

And, we need to take the training wheels off, but maybe it's a pairing,

24:07

but there needs to be a point where the junior people in the team need

24:11

to be able to handle this role.

24:13

The other trap that I see, which I really dislike also is the manager of the team.

24:18

Fulfills the role.

24:20

I think if you're a hands on manager, it's perfectly acceptable for you

24:24

to have a rotation and you should be having a rotation as a firefighter,

24:27

just like everybody else in the team.

24:29

But what I don't think it's good is if you Permanently do it.

24:33

I think I just think it's a real anti pattern.

24:36

The other one that this is where this implementation always fails

24:39

down when people think of it, and they make the firefighters still

24:43

have work in the sprint or whatever.

24:45

And this is like the, this is why all, a lot of this fails, because

24:49

you just overwhelm someone, and their work is still late because of all

24:54

the reasons we talked about earlier.

24:55

And then the other ones we briefly talked about not having a plan when

24:58

you have multiple things kicking off that are overwhelming the

25:01

individual and not having a handover.

25:04

And so you stuff falls in the cracks when you rotate and you don't

25:08

have a handover of the in flight.

25:11

Topics.

25:12

I think then customer service suffers as well, or what's worse is you take

25:16

it with you into your next sprint.

25:18

So some, you have to, if you're going to take it in, you need to plan it as work in

25:23

the next sprint that you're going to do.

25:24

But it needs to be really explicit and not like a side thing because,

25:28

oh, I was helping that person during my firefighting rotation and now

25:32

I'm just going to continue somehow.

25:36

Those are all the danger.

25:38

Danger.

25:38

Yeah.

25:45

Péter: juniors on on firefighter.

25:47

I saw this working amazingly in one of the places I work that there was this

25:51

this junior engineer who was Not very experienced, and she was afraid of this

25:57

role, but she really shown in this, this, this place because she had very

26:01

good soft skills and do help people who are reaching out with requests.

26:06

And it was very motivating for her to, to, to see the impact of, of her help can do.

26:12

And she balanced very well when to ask for help.

26:15

Actually, I think this is one of the risks.

26:16

If you put junior people in this role that they don't recognize.

26:21

The point where they should ask for help and not try to figure themselves out.

26:24

So if you teach them to be comfortable with not knowing things and asking for

26:30

help and how to escalate efficiently, then this can be an amazing, motivating

26:36

and learning experience for them.

26:38

Yeah,

26:42

Jeremy: podcast episode we did about what makes a senior engineer, because

26:46

this is exactly the skills that this is part of that growing of a junior

26:50

person into a senior and the kind of the skillset that they need to learn

26:54

in order to be to improve and grow.

26:57

So, yeah.

26:58

Péter: quick question before we summarize and say goodbye.

27:01

You mentioned the importance of knowledge transfer and to make sure

27:05

that handover is happening efficiently.

27:07

And I agree because I think one of the values in this firefighter situations

27:12

is that the internal knowledge is increasing in the team if it's done well.

27:16

So I'm curious, what are some practical ways you found working about

27:20

this handover to make it efficient?

27:23

Jeremy: I think the first is to discuss this in the team retrospective.

27:29

And ideally if you're doing this around some kind of sprint cadence of two weeks,

27:33

then you're having a retrospective.

27:35

And so it's an important part of building it into the retrospective of the team.

27:40

And I think then it's maybe obviously sharing what you're learning

27:43

continuously in the daily and so on, that's an important part of it.

27:46

And finally, When you're handing over it's probably not just a one to one,

27:50

but maybe like with the manager, you have a three way, the outgoing and

27:53

the incoming and maybe the manager.

27:55

I'm not sure I would have the whole team involved in

27:58

Péter: that's what I'm thinking about.

27:59

It, I guess it

28:00

Jeremy: you, that's, you cover that in the retrospective.

28:02

Yeah.

28:02

Sorry.

28:03

I would just cover that in the retro.

28:05

Yeah.

28:05

Péter: yeah, because I'm thinking if, if you're talking about a nine person

28:08

team, then, then I think you're right.

28:10

It's a waste of time or, or not the best way to spend the time, but if

28:14

it's a very small team, like three people, four people, then, then,

28:17

then maybe you can combine this and just talk about briefly what happened

28:21

during the firefighter session.

28:23

So everyone learns.

28:25

Jeremy: Yeah.

28:25

And usually the, you need the firefighter, the bigger the team goes.

28:28

So maybe in a three or four person team, you're less likely to need it and you're

28:32

on top of stuff, but and you may not need to dedicate a full time firefighter also.

28:38

I still think that even in small teams, this pattern works way better

28:42

because you have one person who's the point of contact for that week.

28:47

Péter: Yeah.

28:47

Yeah.

28:48

And to those who say that, Oh, but in a four person team having a dedicated

28:52

firefighter, it means that we are running at 75 percent capacity.

28:56

That's not true.

28:57

That's really not true because that 25 percent is super valuable work and

29:02

it creates the stability, resilience and scalability for the team by not

29:08

just answering requests, but also the little things you were talking about.

29:12

We didn't go deep into it, but.

29:13

This person, when it's not interrupted and just waiting to help this person

29:18

can do a lot of very impactful tech that work, tidying up a documentation,

29:23

refactoring something, looking into an obscure bug that the team never had time.

29:28

So yeah, push back.

29:30

If somebody tries to oppose this,

29:32

Jeremy: Yeah, plus you're just making the invisible visible by

29:35

having that dedicated capacity.

29:37

And a lot of our work is just about making stuff visible and visualized.

29:42

And if the true capacity of the team is that, then that it's just much better.

29:47

Péter: all right, we are nearing our end.

29:49

Let's wrap it up.

29:50

What are the key takeaways, how to handle interruptions in a team?

29:54

Jeremy: Interruptions are good.

29:55

Interruptions are bad.

29:57

And we, what we were trying to achieve is not to eliminate all of them, but

30:01

to handle them more efficiently and learn from the interruptions by

30:06

constantly and regularly improving how the team works and a great pattern for

30:10

that when ad hoc handling of incoming requests starts to break down is to

30:15

have a dedicated person in the team.

30:17

In this case, I call it the firefighter.

30:19

And they shield the whole team.

30:21

You regularly rotate that role and you have some kind of handover . And what

30:25

makes this work well is if during that rotation, you actually are doing

30:30

improvements For how the team works or how the team interfaces with others.

30:35

Things like documentation, automation, self service.

30:38

And obviously the team is just learning from what that person is doing.

30:43

To share knowledge inside the team of the interruptions to be

30:45

able to make bigger improvements.

30:48

Péter: Awesome.

30:49

Yeah.

30:49

Nice wrap up.

30:50

Very practical.

30:51

Thank you, Jeremy.

30:52

And to our listeners, if you have someone in your personal or

30:56

professional circle whose team is struggling with interruptions,

30:59

send them over this episode.

31:00

And see you in two weeks when we're going to tackle another

31:04

engineering leadership problem.

31:05

Jeremy: Yeah.

31:06

Brilliant.

31:07

And we'll put some links to some great resources in the show notes as well.