• Coding frequency and tools
    • Coding frequency
    • What people are using code for
    • Access to and knowledge of programming languages
    • Access to coding tools
    • Knowledge of coding tools
    • Access and knowledge gaps
  • Coding capability
    • Change in coding ability during current role
    • Where respondents first learned to code
  • Coding practices
    • General coding practices
    • Documentation
    • Dependency management
    • Reproducible workflow packages
  • RAP knowledge and opinions
  • Knowledge of RAP
    • Opinions on RAP
  • RAP scores
    • RAP components
    • Basic RAP scores
    • Advanced RAP scores

This is a living document. We want our users to get as much value as possible from our analysis. Because of that, we will be updating these pages with additional outputs and insights when possible. To make sure you always access the latest insights, we recommend linking to these pages rather than sharing this analysis any other way at this stage.

Coding frequency and tools

Coding frequency

We asked respondents “In your current role, how often do you write code to complete your work objectives?”.

Show chart Show table
Coding frequency Count
Never 1
Rarely 7
Sometimes 11
Regularly 19
All the time 11
a Sample size = 49


What people are using code for

We asked respondents what data operations they carry out in their work, and whether they use code to do them. Please note we did not ask how much of each data operation is done with code, or how often.

Show chart Show table
Data operation I do some or all of this by coding I do this without coding
Analysis 43 5
Data cleaning 38 1
Data linking 40 0
Data transfer 22 5
Data visualisation 21 22
Machine learning 3 0
Modelling 14 6
a Sample size = 49


Access to and knowledge of programming languages

Given a list of programming tools, we asked respondents to answer “Yes”, “No” or “Don’t know” for the following statements;

  • This tool is available to use for my work.
  • I know how to program with this tool to a level suitable for my work.

Access to tools does not necessarily refer to official policy. Some analysts may have access to tools others cannot access within the same organisation.

Please note that capability in programming languages is self-reported here and was not objectively defined or tested.


Access to coding tools

Show chart Show table
Programming language Yes Don't Know No
C++ / C# 2 18 29
Java / Scala 4 20 25
Javascript / Typescript 7 16 26
Matlab 1 17 31
Python 17 7 25
R 22 8 19
SAS 48 0 1
SPSS 13 15 21
SQL 29 9 11
Stata 7 22 20
VBA 32 9 8
a Sample size = 49


Knowledge of coding tools

Show chart Show table
Programming language Yes Don't Know No
C++ / C# 4 3 42
Java / Scala 3 2 44
Javascript / Typescript 6 2 41
Matlab 12 3 34
Python 9 3 37
R 14 5 30
SAS 42 0 7
SPSS 10 1 38
SQL 30 4 15
Stata 6 3 40
VBA 19 2 28
a Sample size = 49


Access and knowledge gaps

Using the data presented above we calculated the number of respondents with:

  • Access to tools they do not have the capability to use (access only),
  • Access to tools they are able to use (access and knowledge)
  • Or capability to use tools they cannot access at work (knowledge only)
Show chart Show table
Programming language Access only Access and knowledge Knowledge only
C++ / C# 1 1 3
Java / Scala 3 1 2
Javascript / Typescript 2 5 1
Matlab 1 0 12
Python 14 3 6
R 14 8 6
SAS 6 42 0
SPSS 9 4 6
SQL 5 24 6
Stata 6 1 5
VBA 16 16 3
a Sample size = 49

Coding capability


Change in coding ability during current role

This question was only asked of respondents with coding experience outside of their current role. This means analysts who first learned to code in their current role are not included in the data.

Show chart Show table
Ability Change Count
Significantly worse 3
Slightly worse 6
No change 13
Slightly better 15
Significantly better 5
a Sample size = 42


Where respondents first learned to code

Respondents with coding experience outside their current role were asked when they first learned to code. This output also includes analysts who code in their current role but reported no other coding experience.

These data only show where people first learned to code. They do not show all the settings in which they had learned to code, to what extent, or how long ago.

Show chart Show table
First coding experience Count
In current role 8
In education 22
In private sector employment 2
In public sector employment 8
Self-taught 8
Other 1
a Sample size = 49


Coding practices

We asked respondents who said they currently use code in their work how often they carry out various coding practices. For more information on the practices presented below, please read our guidance on Quality Assurance of Code for Analysis and Research.

General coding practices

Show chart Show table
Percent
Question I don't understand this question Never Rarely Sometimes Regularly All the time
I use open source software when programming 4.2 56.2 14.6 10.4 6.2 8.3
My team open sources its code 6.2 81.2 6.2 2.1 4.2 0.0
I use a source code version control system e.g. Git 2.1 58.3 18.8 4.2 12.5 4.2
Code my team writes is reviewed by a colleague 2.1 0.0 12.5 31.2 31.2 22.9
I write repetitive elements in my code as functions 8.3 8.3 14.6 33.3 27.1 8.3
I unit test my code 35.4 2.1 12.5 20.8 16.7 12.5
I collect my code and supporting material into packages 16.7 35.4 16.7 14.6 14.6 2.1
I follow a standard directory structure when programming 20.8 8.3 8.3 16.7 29.2 16.7
I follow coding guidelines or style guides when programming 2.1 2.1 8.3 37.5 37.5 12.5
I write code to automatically quality assure data 4.2 12.5 16.7 37.5 14.6 14.6
My team applies the principles set out in the Aqua book when carrying out analysis as code 25.0 0.0 4.2 37.5 27.1 6.2
a Sample size = 48


Documentation

Embedded documentation is one of the components which make up a RAP minimum viable product. Documentation is important to help others be clear on how to use the product and what the code is intended to do.

Show chart Show table
Percent
Question I don't understand this question Never Rarely Sometimes Regularly All the time
Code comments 2.1 0.0 0.0 8.3 43.8 45.8
Documentation for each function or class 10.4 18.8 18.8 33.3 12.5 6.2
README files 8.3 20.8 25.0 27.1 16.7 2.1
Desk notes 14.6 33.3 6.2 16.7 20.8 8.3
Analytical Quality Assurance (AQA) logs 18.8 18.8 20.8 20.8 16.7 4.2
Data or assumptions registers 12.5 35.4 16.7 16.7 14.6 4.2
Flow charts 4.2 33.3 27.1 29.2 4.2 2.1
a Sample size = 48


Dependency management

Respondents who currently use code in their work were asked whether they use any tools for dependency management. We provided examples of tools that may be used for dependency management:

  • Requirements files, e.g. python requirements.txt or R DESCRIPTION files
  • Virtual environments (e.g. venv or renv) or virtual machines
  • Containers e.g. Docker
Show chart Show table
Use dependency management software Count
Yes 3
No 25
I don't know what dependency management is 20
a Sample size = 48


Reproducible workflow packages

Respondents were asked whether they use continuous integration technologies. As above, respondents were provided with examples of what those might be:

  • GitHub actions
  • Jenkins
  • Travis
Show chart Show table
Use reproducible workflow packages Count
Yes 1
No 23
I don't know what reproducible workflows are 24
a Sample size = 48


RAP knowledge and opinions

We asked respondents about their knowledge of and opinions on reproducible analytical pipelines (RAP). RAP refers to the use of practices from software engineering to make analysis more reproducible. These practices build on the advantages of writing analysis as code by ensuring increased quality, trust, efficiency, business continuity and knowledge management. The RAP champions are a network of analysts across government who promote and support RAP development in their departments.


Knowledge of RAP

We asked our respondents whether they had heard of RAP and what their knowledge is of their own department RAP champion.

Show chart Show table
RAP champion knowledge Count
Have not heard of RAP 12
Heard of RAP, have not heard of RAP champions 11
Heard of RAP, does not know department champion 15
Heard of RAP champions, no champion in department 1
Knows department RAP champion 8
a Sample size = 49


Show chart Show table
Line manage anyone who writes codes count
Yes 19
No 10
I don't line manage anyone 20
a Sample size = 49


Opinions on RAP

We asked respondents who had heard of RAP whether they agreed with a series of statements.

Show chart Show table
Percent
Question Strongly disagree Disagree Neutral Agree Strongly agree
I feel confident implementing RAP in my work 18.9 32.4 29.7 16.2 2.7
I feel supported to implement RAP in my work 10.8 21.6 35.1 27.0 5.4
I know where to find resources to help me implement RAP 13.5 18.9 27.0 35.1 5.4
I understand what the key components of the RAP methodology are 5.4 29.7 16.2 40.5 8.1
I think it is important to implement RAP in my work 8.1 10.8 21.6 40.5 18.9
I and/or my team are currently implementing RAP 24.3 35.1 18.9 10.8 10.8
I or my team are planning on implementing RAP in the next 12 months 18.9 29.7 24.3 13.5 13.5
a Sample size = 37


RAP scores

RAP component scores “regularly” or “all the time” to the relevant questions. For documentation, this includes both code comments and README files. For the continuous integration and dependency management components we only collected “yes”, “no” or “I don’t understand the question” responses. As such, we gave “yes” responses a score of 1. The sum total of each respondent’s scores is presented here as “RAP scores”.

A score of one for each RAP component is derived where respondents answered “regularly” or “all the time” to the relevant questions. For documentation, this includes both code comments and README files. For the continuous integration and dependency management components we only collected “yes”, “no” or “I don’t understand the question” responses. As such, we gave “yes” responses a score of 1. The sum total of each respondent’s scores is presented here as “RAP scores”. “Basic components” are the components which make up the RAP MVP. “Advanced components” are components that help improve reproducibility, but were are considered part of the minimum standard.


RAP components

Show chart Show table
Component Type Count
AQUA book guidance Basic 16
Documentation Basic 8
Peer review Basic 26
Team open source code Basic 2
Use open source software Basic 7
Version control Basic 8
Code packages Advanced 8
Continuous integration Advanced 4
Dependency management Advanced 3
Follow code style guidelines Advanced 24
Function documentation Advanced 9
Functions Advanced 17
Unit testing Advanced 14
a Sample size = 48


Basic RAP scores

Show chart Show table
Basic RAP score Count
0 11
1 15
2 15
3 6
4 1
a Sample size = 48


Advanced RAP scores

Show chart Show table
Advanced RAP score Count
0 11
1 15
2 9
3 9
4 2
5 1
6 1
a Sample size = 48