Services · Training and Development

Training and Development That Changes Behaviour

Sidestream behaviour change training and development in action

Most corporate training and development is measured by whether people enjoyed it. The feedback form goes round at the end of the day, the scores come back high, and the programme is recorded as a success. Some things cannot be taught, they have to be rehearsed, and a satisfaction score tells you nothing about whether anyone is doing the job differently a month later. This page is the working reference for Heads of Learning and Development, L&D Business Partners and CHROs who want training and development that reaches behaviour and can prove it.

The distinction matters more in 2026 than it did five years ago, because the money is moving. Learning and development budgets are under harder scrutiny than at any point in recent memory, and the programmes that cannot demonstrate behaviour change are the first to be cut.

What this guide covers. Why much UK L&D spend is being cut. The satisfaction-score trap. The Kirkpatrick Level 3 commitment. The four conditions training and development has to meet to change behaviour. An illustrative development programme. How Sidestream works as an L&D partner. How this differs from HR-function training. FAQs.

Why Learning and Development Spend Is Being Cut

The headline number is hard to ignore. According to the Skills England Annual Skills Report 2026, UK employer training spend fell from £55.4bn in 2011 to £44.8bn in 2024, a £10.6bn fall over thirteen years. That is not a market reinvesting in its people. It is a market quietly withdrawing.

The spend that remains is concentrated and watched. CIPD's Learning at Work research puts average UK L&D spend at roughly £1,068 per employee per year. For a finance director, that is a real line item, and the question they increasingly ask is the uncomfortable one: what did it change? When the honest answer is a folder of high satisfaction scores and a completion rate, the line item looks like a cost rather than an investment, and costs get cut.

The cut is rarely a judgement that development does not matter. It is a judgement that this development could not prove it reached behaviour. The decline in spend is, in part, the predictable consequence of a generation of programmes that never measured past the feedback form.

The Satisfaction-Score Trap

The satisfaction score is seductive because it is easy to collect and almost always flattering. A well-delivered day with a confident facilitator, a comfortable room and a few memorable exercises will score well regardless of whether it changes anything. The score measures the quality of the experience in the room, not the quality of the behaviour back at the desk.

The evidence on this is clear, and it is uncomfortable for the industry. Reaction scores correlate weakly, sometimes not at all, with whether behaviour actually changes. People can enjoy a programme, rate it highly, learn the model, pass the quiz, and return to work behaving exactly as they did before. Knowing about a behaviour and being able to perform it under real pressure are different things, and only one of them is built by listening.

Real behaviour change happens through lived experience that makes the new behaviour stick: rehearsing the difficult conversation until it is in the body, not just understanding the four-step model on a slide. Training and development that relies on passive transmission, the keynote, the e-learning module, the questionnaire, is optimising for the wrong measure.

The Kirkpatrick Level 3 Commitment

The clearest way to talk about this gap is through the Kirkpatrick model, the standard framework for training evaluation. It sets out four levels: reaction (whether people liked it), learning (whether they acquired the knowledge), behaviour (whether they do the job differently in real work afterwards) and results (the downstream business or operational outcome).

The trap lives in the gap between Level 2 and Level 3. Almost any competently delivered programme can reach Levels 1 and 2: people enjoy it and they learn the content. Very few reach Level 3, because behaviour in real work only changes when the design includes rehearsal, bespoke scenarios and structured embedding. Most training and development stops measuring at Level 2 precisely because that is the last level it can claim a result on.

Sidestream's commitment is to design every programme backwards from Level 3. We start with the observable behaviour the organisation needs to see more of, define how it will be recognised in real work, and build the programme to produce it. Level 3 is our minimum measurement standard; Level 4, the downstream metric, is added where the brief allows. If a programme cannot articulate the Level 3 behaviour it is targeting, in our view it should not be commissioned. For the full framework, see our explainer on what the Kirkpatrick model is.

Scope a development programme

Book a free 30-minute consultation. Bring the behaviour you need to see more of. We will tell you honestly whether Sidestream is the right fit.

Book a Free Consultation

The Four Conditions Training and Development Has to Meet

Four conditions separate training and development that changes behaviour from training and development that produces satisfaction. They are not exotic. They are simply absent from most of the market.

Condition 1: a named behavioural target. The programme has to be built around a specific, observable behaviour, not a topic. "Better feedback" is a topic. "Line managers hold a structured underperformance conversation within two weeks of the issue arising" is a behavioural target. The first cannot be measured at Level 3. The second can.

Condition 2: rehearsal, not transmission. Behaviour change requires people to practise the behaviour, repeatedly, with feedback, at the edge of their current capability. Watching a video about a difficult conversation does not build the capability to have one. Rehearsing it does.

Condition 3: bespoke, realistic scenarios. The situations people rehearse have to feel like their actual work. Scenarios written for the specific cohort, drawn from the contexts they really operate in and played by professional actors rather than fellow delegates, produce a realism that ordinary role-play cannot match.

Condition 4: embedding after the event. The programme day does not produce sustained change; the weeks after it do. Structured follow-through, manager accountability and behavioural-observation reviews are what convert a memorable day into a changed way of working. Without an embedding architecture, even an excellent programme decays to a fond memory within months.

The most common reason a programme reaches only Level 2 is that it meets one or two of these conditions and not all four. That is also the difference between immersive rehearsal and passive content, which we set out in our guide on immersive training versus e-learning.

An Example Development Programme

The following is an illustrative, generic example of how a Sidestream training and development programme is structured. It is not a description of a specific client engagement.

The brief. A 500-person professional-services firm has rolled out a new performance-management framework through e-learning. Completion is at 95 per cent and module satisfaction is high. Yet six months on, line managers are still avoiding the difficult underperformance conversations the framework was designed to enable. The L&D team has reached Kirkpatrick Level 2 and stalled.

The behavioural target. We define it precisely with the L&D team: line managers initiate a structured, fair underperformance conversation within two weeks of an issue becoming clear, and direct reports experience that conversation as direct and respectful. This is observable, and it can be measured.

The design. A one-day immersive workshop for cohorts of fifteen managers. Professional actors play the direct reports, against three bespoke scenarios written from the firm's real (anonymised) situations: the long-tenured underperformer, the disruptive high-flyer, and the recently promoted colleague who is struggling. Each manager rehearses, receives a structured debrief, and rehearses again.

The embedding. Six weeks of structured follow-through: a manager peer-practice session at week three, behavioural-observation check-ins, and a short reflective prompt tied to each manager's real caseload. The embedding is deliberately longer than the workshop, because that is where sustained change is won or lost.

The measurement. Level 3 at three and six months: the proportion of underperformance issues addressed within two weeks, and direct-report feedback on the quality of the conversation. Level 4 where the firm wishes to track it: time-to-resolution on performance cases. This time, the L&D team can answer the finance director's question with behaviour, not a satisfaction score.

The point is the shape: a named behaviour, rehearsal under realistic conditions, embedding, and measurement that reaches Level 3. The specifics change with every brief. The structure does not.

How Sidestream Works as a Learning and Development Partner

Sidestream does not replace an in-house L&D function. We work alongside it. The L&D team owns the strategy, the competency framework, the learning platform and the broad portfolio. We take the specific part of that portfolio where behaviour change is the genuine goal and conventional content delivery has not reached it.

In practice we integrate with the existing framework rather than imposing our own, mapping the behavioural target onto the competencies the organisation already uses. We design the bespoke immersive intervention, write the scenarios, cast and direct the professional actors, and build the embedding architecture. And we hand the measurement back in a form the L&D team can present upwards: Level 3 behaviour change, defensible to a sceptical finance team.

The method is anchored in organisational-psychology and behaviour-change research from UCL, the University of Cambridge and Bocconi University. Our own academic work in this area found immersive role-play to be roughly 20 per cent more effective than passive modalities, such as slide-shows and video e-learning, at teaching communication skills. Our verified client base includes the Metropolitan Police, UCL, the University of Cambridge, Bocconi University, Goldsmiths University of London, Imperial College London, TCS and the Innocence Project.

You can read more about how we approach the work in our six-step approach, the range of workshops and training we deliver, and our case studies.

Training and Development Versus HR-Function Training

One distinction worth drawing, because the two are often confused. This page is about training and development as an L&D discipline: the effectiveness of the learning the organisation delivers to its people, and how to design it so it changes behaviour. It is about the portfolio, the measurement standard and the design.

That is different from developing the HR and L&D function itself, the behavioural and consulting capability of HR practitioners and business partners. If your question is about building the HR team's own skills rather than the effectiveness of the training they commission, our companion guide on HR training consultancy and HR team training is the better starting point. The two are designed to be read together: one on the effectiveness of the learning, one on the capability of the function that owns it.

Frequently Asked Questions

What is the difference between training and development that changes behaviour and training that does not?

Training that changes behaviour is built around a specific observable behavioural target and uses rehearsal, bespoke scenarios and structured embedding. Training that does not is built around content delivery and measured by satisfaction. The Kirkpatrick framework names the gap: most programmes reach Level 2 (learning) but never reach Level 3 (behaviour in real work). Sidestream designs every programme from Level 3 backwards.

Why is UK learning and development spend being cut?

UK employer training spend fell from £55.4bn in 2011 to £44.8bn in 2024, a £10.6bn fall, according to the Skills England Annual Skills Report 2026. A large part of that decline reflects finance teams losing confidence that training reaches behaviour. A programme that can only show satisfaction scores cannot defend itself against a budget cut; training and development that measures behaviour change can.

What is Kirkpatrick Level 3 and why does it matter?

Level 3 is observed behaviour in real work after the programme: whether people actually do the job differently. It matters because Levels 1 and 2 are achievable by almost any well-delivered programme and predict very little about real change. Sidestream commits to Level 3 as the minimum standard, with Level 4 (downstream metric) where the brief allows. See our full explainer on the Kirkpatrick model.

Does Sidestream replace our in-house L&D team?

No. We work alongside it as a behaviour-change partner. The L&D team keeps the strategy, framework and platform; we take the part of the portfolio where behaviour change is the goal and conventional content has not reached it, and design a bespoke immersive programme for that target.

How quickly can a programme be designed and delivered?

A typical engagement runs around thirteen weeks: a diagnostic phase, a design phase, the delivery itself, then a six-week embedding phase and a measurement phase. The embedding phase is what determines whether the programme produces sustained behaviour change, so it is rarely compressed even when the earlier phases are.

Start a Training and Development Conversation

Book a free 30-minute consultation at calendly.com/info-sidestream. Bring the behaviour you need to see more of, and the programme that has not yet produced it. We will tell you honestly whether Sidestream is the right fit. Get in touch today. We are Sidestream.

More on this topic: Interactive Training That Changes Behaviour ยท Teamwork Training: From Activities to Behaviour