The publication of the revised National Planning Policy Framework in December 2024 was accompanied by a new set of Housing Targets for each of England’s unitary, district and borough authorities.
Much has been made of the new targets by rural councils, claiming that these will require large areas of countryside will need to be “concreted over” to deliver the number of homes demanded of them. This is, of course, hyperbolic nonsense: the amount of land required is often a fraction of one percent of the total area of the council areas, even if this assumed that all the new homes were to be built on virgin land. In fact, most of the new homes will come from urban and suburban intensification.
For each authority in England, I’ve created a single map tile which demonstrates how much land would be needed to deliver the local housing targets were they built at a modest density of 40 dwellings per hectare (dph). Clearly in urban areas this figure would be significantly higher, and lower in rural regions; but sticking with a consistent density makes a comparison easier.
Here are all of the authorities in England and the number of new homes required per hectare needed to meet the new targets. As you’d expect, the highest concentration of homes is in London, with the Royal Borough of Kensington & Chelsea topping the league with a requirement to find 20 new homes per hectare (bearing in mind that many parts of outer London are already below this density, even consider existing homes).
For the last few months I’ve been attempting to map every bit of “grey belt” land within London’s green belt. As there’s been some uncertainty over exactly what qualifies as grey belt, I’ve adopted a cautious definition, including areas which appear to have been previously developed, including car parks and yards, earthworks and hard standing.
This builds on work I’ve previously undertaken which attempts to use AI to locate small site development opportunities in London’ssuburban areas. You can read more about this here.
I used our existing learning model (based on the InceptionV3 convoluted neural network) to assess an area of green belt using higher-resolution aerial photography provided to me by aerial mapping specialist Bluesky with the purpose of categorising the types of land cover. Our learning model was trained on aerial photography / satellite imagery from Google.
The following describes the methodology followed and some of the initial findings.
The area of sample data provided by Bluesky is contained within the white rectangle shown within the following image. The pink masked area is not within metropolitan green belt; the remainder is. The total area of the sample data provided by Bluesky covers an area of approximately 390ha, with just over three quarters of this within the metropolitan green belt.
Within this area I created a grid of points at 25m centres (i.e. a 625sqm tile). The image to the right shows the distribution of these within the examination area.
Using our existing learning model (based on Google satellite data), I assessed the ground conditions at each data point. The image on the right shows how the AI learning model has categorised the landscape features at each point. The pattern of ground cover can be seen here, with dark green representing woodland and tree cover, blue corresponding with areas of water (although there are some limitations to this, described below).
One of the limitations of this is that using the higher-resolution Bluesky photography some tiles contain multiple “types” and the AI struggles to distinguish between, say, a domestic garden and a residential house. Because of the limited resolution of the Google imagery this was not a significant issue, but at higher resolutions the data points can miss key features, such as entire houses, classifying the gardens to the front and rear, but not the structures themselves.
To address this, I introduced a data point grid with double the resolution, with points at 12.5m intervals. The image here shows the distribution of these on the same map. Given that the 25m grid across the entirety of London’s metropolitan green belt results in more than 8m data points, adopting a 12.5m grid will result in four times this amount.
Running the AI again using data with the Bluesky imagery results in this image, which includes the non-green belt area in the south west of the map, (which was included in error, but demonstrates the efficacy of the process). The patterns of land cover are clearly visible at this scale.
The image below shows a small area of the sample data. Moving the slider across the page reveals the AI predictions for each point at 12.5m intervals.
Golf courses are generally identified correctly, likely due to the presence of sand bunkers and the curved shapes of the fairways visible in the image, although there are some odd predictions for “water” which need further investigation.
Water is being identified correctly, as this lake within Aldwickbury Park Golf Club shows. There are some anomalous categorisations occurring around the periphery of the lake, likely due to surface markings fooling the AI into thinking these are sports pitches, as the pale line around the edge resembles the white lines of a football pitch.
As with the golf course, crops and woodland which appear as dark areas of consistent colour are incorrectly being identified as water. An improved learning model will be needed to account for this, as with this resolution these types should be distinguishable from bodies of water.
Car parks appear to be identified correctly, although due to the resolution of the image some areas within the car parking areas are being incorrectly identified. In some cases the markings on roads, or areas where no cars are parked, are fooling the AI into thinking that these are hard courts. The presence of parked cars is generally necessary for the AI to correctly identify land used for this purpose.
Domestic gardens need to be included within the “allotments and garden centres” type. Previously, most gardens fell within the “buildings – domestic” type as the 25m tile would generally include both. At 12.5m, the two are usually found in separate image tiles and therefore correctly identified.
For an unknown reason, some crops appear to be categorised as golf courses. Unclear as to why. Further investigation required. It is likely that adding these areas to the test and training data will teach the AI to distinguish between these with a greater degree of accuracy.
Fallow fields appear to be correctly identified, although the sample data from Bluesky does not include and “earthwork” sites so the accuracy of this prediction cannot be assessed. Note some anomalous identifications as “water” to the right of this image. Futher training of the learning model using a larger sample of the Bluesky data should assist in correcting these.
This exploratory exercise has demonstrated that higher-resolution mapping data, with imagery taken within a narrower seasonal timeframe, results in more accurate predictions, although to take advantage of this, a significant increase in data points will be required. The length of time required to generate the imagery and process the data will be significant and demand an increase in processing power.
The following describes a potential methodology for establishing the extent of “grey belt” using a geometrical approach to settlements within London’s metropolitan green belt.
1. This example uses Brentwood in Essex. The settlement is entirely enclosed by London’s metropolitan green belt.
In this diagram the orange colour shows the extent of Brentwood which is not within this green belt.
2. Creating a new boundary around the furthest extents of Brentwood’s boundary.
This identifies those areas which do not contribute to the five purposes of the green belt according to the NPPF. These areas could be developed without the settlement encroaching into the countryside.
3. To ensure that neighbouring settlements are prevented from “merging”, establish a 250m buffer where no development should occur.
Should this distance vary depending on the nature of the landscape? Is 250m too great, or too little?
4. Those areas within easy access of a station should also be developable, provided that this does not encroach on the buffer zone established above. Here, this has been set at 1,600m from existing stations: approximately 5 minute’s cycle ride.
4. Subtracting the geometries above, arrive at a net area which is contained entirely by the outer extents of the settlement, but excludes those areas not covered by green belt protection.
5. From this area remove any protected habitats such as SSSIs, Ancient Woodlands, Areas of Outstanding Natural Beauty, and so on.
6. Here the final geometry is shown with an 800m / 10-minute walking distance and 1,600m 5-minute cycle distance.
7. And then overlaid onto a satellite photo of Brentwood, showing the landscape around the town which could be considered as grey belt.
Following on from my experiments using AI to identify and classify small site development opportunities in London’s suburbs, I’ve been deploying the same pattern recognition software on aerial photography to see if it’s possible to categorise, and quantify, how much of London’s green belt has the potential for new homes.
The new government has made much of the potential for so-called “grey belt” to meet the country’s housing need, particularly where this is located close to public transport. The Labour manifesto stated that “the release of lower quality ‘grey belt’ land will be prioritised and we will introduce golden rules’ to ensure development benefits communities and nature.”
This was followed by a consultation version of the National Planning Policy Framework (NPPF) in July 2024, which established a definition for what grey belt is:
“For the purposes of plan-making and decision-making, ‘grey belt’ is defined as land in the green belt comprising Previously Developed Land and any other parcels and/or areas of Green Belt land that make a limited contribution to the five Green Belt purposes (as defined in para 140 of this Framework), but excluding those areas or assets of particular importance listed in footnote 7 of this Framework (other than land designated as Green Belt).”
The Green Belt today accounts for more land in England than land that is developed – around 13 per cent compared to 10 per cent. Yet as many assessments show, large areas of the Green Belt have little ecological value and are inaccessible to the public. Much of this area is better described as ‘grey belt’: land on the edge of existing settlements or roads, and with little aesthetic or environmental value. It is also true that development already happens on the Green Belt, but in a haphazard and non-strategic way, leading to unaffordable houses being built without the amenities that local people need.
There are wildly varying estimates as to how much of the green belt could be considered as “grey”. Property technology outfit LandTech suggested it could be around 0.4%, but CBRE estimated it as 7%. Bearing in mind that England’s metropolitan green belt covers some 16,384 square kilometres (with London’s 5,085sqkm of this), even at 2% this would be enough for more than 1.6m homes. It’s a lot.
Setting aside for one moment the varying definitions of grey belt, I’ve been trying to calculate how much of the green belt is covered by stuff that’s clearly not green. This isn’t necessarily an attempt to find locations to build new homes, but simply an exercise to find the car parks, breaker’s yards, quarries, waste transfer depots and landfill sites that are currently protected from development by virtue of green belt protection.
Counting the Dots
Using GIS mapping I created a grid with intersections every 25m and applied this to the entirety of London’s green belt. In total that’s approximately 8.1 million data points. This is too much to handle in one go, so I divided this down according to planning authority; there’s 72 district councils, unitary authorities and London boroughs who have at least some of the capital’s green belt within them.
The unitary authority of Buckinghamshire has the greatest area of green belt, with 15,630 hectares. This map is taken from article I wrote about the impact of new housing targets on each of England’s planning authorities, and provides a quick overview of the green belt in each:
The smallest is the Royal Borough of Greenwich, which has a tiny area of green belt in its south east corner (so small it can’t be seen in the map below).
At every data point within an authority area I exported a Google satellite image covering an area of 625sqm (i.e. contained within a 25m x 25m tile). I then manually grouped a number of these into categories. Here’s an example of each classification showing the types of images generated:
Armed with a large set of images sorted into folders according to what I could see in each tile, I then trained an AI learning model on the resulting data. Applying this learning model to the remaining images and returning this data to the GIS software, I was able to map the location of every data point and show the distribution of different land types across each authority area.
I started with St Albans City and District Council (SACDC). Below you can see an area in the south of the district, close to the border with Hertsmere. Each coloured circle corresponds to a different designation. The outer colour shows the AI’s best estimate at what’s on the ground, the inner colour are those categories I have determined could be classified as “grey belt”. Tarmac’s Harper Lane asphalt plant appears as a concentration of red and orange blobs; woodland as dark green rings, fields as lighter green, and agriculture in a lighter shade. The grey rings represent buildings, which you can see to the right of the image, with car parks among these in red.
An AI identification of data points in south SACDC showing how the learning model interprets satellite imaging using 25m x 25m tiles.
Although it’s possible to spot some inaccuracies in the classifications, overall this appears to be pretty convincing. One of the issues is that Google’s satellite data varies in resolution across the country, and at which time of the year it was taken. This can lead to agricultural land being misidentified as earthworks, for example, when the picture was taken in winter and the crops harvested. In some remote places the ground is obscured by cloud or vapour trails from passing planes. The use of better aerial photography will help iron out these inconsistencies.
Zooming further into the image you can see numbers appearing within the centre of each dot. This is the “confidence level” that the AI has ascribed to each identification. A score of, say, 60%, means that the AI thinks the tile meets the characteristics of a classification with this degree of confidence. The higher the figure, the greater the chance it’s correct.
This is useful, because the AI also then includes a secondary and tertiary prediction with reducing levels of confidence. Because the tiles cover an area of 625sqm it is likely that they might include different types of ground cover: part car park, part woodland, for example. By multiplying the percentage confidence level by 625 we can get an approximation of how much of the tile is covered by each category. So, a tile that the AI has identified as having a primary classification of “tree cover” with a confidence level of 80% and a secondary classification as “car park”, with 20% confidence, might contain 500sqm of the former and 125sqm of the latter.
While individual data points can display odd results, there are ways to even out the anomalies. Grey belt land uses tend to cover a wider area than just 625sqm, and as can be seen above, the significant opportunities tend to be where a number of these are grouped together. Using a clustering algorithm in GIS we can find those parts of the study area in which many of the positive data points appear. Although these are not particularly useful for analysis, they can help signpost parts of the green belt which are worthy of further investigation.
The map below shows the southeastern corner of Buckinghamshire, a unitary authority. Here I’ve used a clustering algorithm to create a series of circles where groups of grey belt points occur. The grey areas in this image are not within the green belt – everything else is.
Zooming into this area further shows that these clusters are broadly correct. There are some large solar farms in this area, something that’s not found in St Albans (and the dataset on which the learning model was trained), so the AI is incorrectly identifying these as water. A further refinement will be to include a new category for photovoltaic panels. I’ve masked the areas not covered by green belt protection, which are shown in grey.
In the image comparison above you can see the large earthworks (Cemex’s Langley quarry) in the centre of the frame, and the eastern part of Thorney business park to the north. The two areas to the west are Traveller sites, with a solar farm misidentified as water in the bottom right. What this shows is that the use of clustering can help in identifying the areas of potential grey belt. The M25 can be seen running north to south on the right-hand side, and the Elizabeth Line across the centre, broadly parallel to the Grand Union Canal above it. Importantly, Iver Station is just about visible to the left of the intersection between the railway and motorway. Trains stop here every few minutes, so one wonders whether this not make a good location for new homes: the area to the north of the station is clearly a candidate for new development were it not protected by virtue of its green belt designation.
We know that each data point represents an area of 625sqm, so by counting the number of these within each category as determined by the AI, we can quantify the total amount of land occupied by each.
The following table shows the classification of each of the 210,283 points within St Albans District. I’ve highlighted in green those categories of land that I think could be included within a “grey belt” designation.
Category
Data points
Area (ha)
% of Total
Grey Belt? (ha)
Allotments & Garden Centres
2,102
131
1.00%
0
Buildings – Commercial
2,035
127
0.97%
0
Buildings – Domestic
5,208
326
2.48%
0
Car Parks
1,963
123
0.93%
123
Cemeteries
202
13
0.10%
0
Earthworks
1,066
67
0.51%
67
Golf Courses
8,217
514
3.91%
0
Highway
11,612
726
5.52%
0
Open Space – Agriculture
65,846
4,115
31.31%
0
Open Space – Fallow Fields
11,349
709
5.40%
0
Open Space – General
38,778
2,424
18.44%
0
Open Space – Wasteland
171
11
0.08%
11
Polytunnels & Greenhouses
1,325
83
0.63%
0
Railways
1,205
75
0.57%
0
Sewage Farms
4
0
0.00%
0
Sports – Hard Courts
4,493
281
2.14%
0
Sports – Pitches & Fields
2,868
179
1.36%
0
Tree Cover
40,772
2,548
19.39%
0
Water
9,680
605
4.60%
0
Yards
1,387
87
0.66%
87
Total
210,283
13,143
100.00%
287
2.18%
Based on these figures we can see that 287 hectares (or 2.18%) of St Alban’s green belt is being classified by the AI as “grey”. This includes earthworks, some of which are likely to have been misidentified due to the similarity between this and dry / fallow fields.
Looking generally at the distribution of these “earthworks” classifications, it appears that around half are likely identified correctly. That leaves around 250 hectares of land with which there’s a high level of certainty over the nature of the surface. That’s just shy of 2% of St Alban’s green belt: or 12,500 homes, if this were built out to a notional capacity of 50dph.
Of course not all of this land will be suitable for development—some is far from public transport or the road network, or otherwise occupied by productive pourposes (some of those concrete batching plants will actually be needed to build these homes!). But even if we generously discount half of the space identified, that still leaves a huge area of grey belt that can be developed for new homes. St Albans’ new five-year housing target, under the new Standard Method, is 7,720 homes. Most will be delivered, one assumes, through urban and suburban intensification. But let it not be argued that the district doesn’t have enough land to deliver these homes.
England’s metropolitan green belt covers an area of 1.64m hectares. Even if just one percent of this falls under the commonly-accepted decision of “grey belt”—car parks, breaker’s yard and landfill—then at over 800,000 homes that’s more than half of the new government’s five-year housing target. This is clearly an opportunity not to be missed.