Budget Galaxy
| Rank ▲ | Country ▲ | Records ▲ | Organizations ▲ | Total Funding ▲ | Top NGO | Sources ▲ |
|---|
Top 20 Countries by Records
Regional Distribution
Funding by Ministry
Yearly Trend
Organizations
Data Sources
Transparency Comparison: EU vs Germany
How the EU tracks every euro to a named beneficiary — while Germany reports 92% as anonymous flows
About Budget Galaxy
Budget Galaxy aggregates UK public spending and tax data from the official primary sources. Every number traces back to a named file we committed with a SHA256 hash.
- HM Treasury OSCAR II: ~55 departments, executive agencies and bodies. The 2023 OSCAR edition was reprocessed to correct a £114B NHS artifact that would otherwise double-count health spending. Same n/a-trust artifact also removed for 2020-2022.
- NHS: trust-level operating cost breakdowns for 206 provider trusts (118 acute, 45 mental health, 18 community, 15 specialist, 10 ambulance), with workforce and cost-type metadata where published. NHS commissioner (ICB) and provider (trust) values are netted against each other to avoid double-counting.
- HMRC tax receipts 2005-2024 (annual bulletin): full breakdown by tax type — Income Tax, NI, VAT, Corporation Tax, fuel duty, stamp taxes, IHT, CGT, APD, IPT, duties and more.
- Income Tax + NI bands for rUK and Scotland, fiscal years 2017-18 through 2024-25, from HMRC's "Rates and thresholds for employers" and Scottish Government publications.
- VAT & indirect tax estimation: ONS "Effects of Taxes and Benefits" decile consumption shares.
- 401 English local authorities from MHCLG Revenue Outturn (2017/18-2024/25), plus 32 Scottish councils from gov.scot for council tax bands.
- 19 English councils with supplier-level spend parsed from their own published registers and classified into MHCLG service categories. An additional 7 police and fire authorities are covered as separate entities.
- Population data: ONS mid-2023 local authority estimates for per-capita analysis.
- OBR Historical Public Finances Database: Public Sector Net Borrowing (PSNB) and Net Debt (PSND) from 2000-01 through 2022-23, with nominal GDP for % calculations.
The full data pipeline is in the project repository. Every number on this site can be traced back to a specific line in a specific source file. See the council spend source manifest and the tax data source manifest for the three-layer audit trail: live landing pages, Wayback Machine snapshots, and SHA256-hashed raw files committed to git.
Public spending data has gaps, inconsistencies, and quirks that most transparency tools hide. Budget Galaxy documents them here because they matter for how you read the numbers.
When additional countries are enabled (US / Germany / France / Brazil are next on the roadmap), each will get its own gap-disclosure section below.
When you click any node in Explore & Compare or Budget Galaxy, you see a contextual "About this programme" panel. The content comes from four layers of increasing quality, documented here honestly because quality varies.
- Tier A — Hand-curated (36 programmes).
Written from six batches of research against official publishers — DWP Stat-Xplore, HMRC Annual Report, NHS England TAC, ONS, IFS, OBR, NAO, House of Commons Library briefings. Each entry has a description, beneficiary count, legal basis, 7–11 key stats, recent-change notes, and source citations. Caveat: the underlying research was prepared by LLM agents using their training data (through early 2026). Numbers are cross-checked against our own tree JSON by
scripts/verify_enrichment.py— the 2026-04-20 audit caught ten stat labels where scope wasn't clear (e.g. "total DSG" nationally vs the specific sub-line a user might be clicking). Those labels now include scope clarifiers. Not every source has a URL yet (31 of 36 entries cite the publisher + title but not a clickable link); URL enrichment is on the post-launch list. - Tier B — Imported (2,225 nodes).
Pre-existing enrichment content in the repo:
program_enrichment.json(779 compact entries, schema year / description / beneficiaries) andenrichment_top50.json(651 rich entries, adding legal basis / notes / OSCAR category). These cover 25% of the tree. Caveat: most of this content was LLM-drafted at an earlier stage and not re-audited. The 2026-04-20 audit flagged 11 top-level departments with incorrect founding years (Home Office listed as founded in 2024 vs actual 1782, MoJ as 2024 vs 2007, etc.). The frontend now suppresses any Established: 2024 field coming from this layer as a sentinel for "unknown" — but other factual details in these 2,225 entries are not individually verified. - Tier C — Procedural (~1,500 council + council-service nodes).
Generated at render time from our own data:
_top_suppliersmetadata joined with ONS mid-2023 local-authority populations (345 England + Wales authorities) to show population, spend per resident, tier-average comparator, top supplier, supplier concentration, and top purposes. 69% of councils in the tree match an ONS population row; the other 31% are police & crime commissioners, fire authorities and combined authorities — they don't have population rows by design (they serve multiple councils' populations combined). Those still get Tier D. - Tier D — Contextual fallback (every remaining node). For any node that doesn't match A/B/C, the frontend generates a card showing the ancestor chain ("part of X, part of Y"), the node's rank among siblings, and the neighbours immediately above and below it in size. Purely from the tree; zero external data needed. This guarantees every single node in the UK tree shows some context — no bare "MEH" card anywhere.
- Scotland, Wales and Northern Ireland councils get Tier D (not C) because ONS publishes England & Wales population data only; SG/WG/NISRA equivalents are not yet loaded.
- Combined Authorities (GLA, Greater Manchester, West Midlands etc.) get Tier C via a heuristic but the "tier comparator" line uses the wrong reference group. Cosmetic, not factual.
- Most Tier A source citations point at a publisher + title but not a clickable URL. The publications are all real and findable, but users can't one-click.
- The Tier B 2,225-entry layer has not been individually fact-checked. A user may encounter a date or figure that's out of sync. Please file an issue — the verifier script (
scripts/verify_enrichment.py) is kept in the repo to make each flag quick to reproduce.
The verifier is reproducible: py scripts/verify_enrichment.py writes data/uk/_enrichment_audit.json with per-entry flags for tree-match, scope discrepancy, missing URLs, and suspicious founding years. It runs in under a second.
Budget Galaxy is built and maintained by an independent developer. The project is open and every number is traceable to an official source.
The project exists because public spending data is technically published but practically unreadable. Central government datasets are scattered across official portals, tax receipts are in monthly ODS bulletins, local government data is fragmented across 400+ councils, and nothing connects your specific tax bill to the services it funds. Budget Galaxy connects all of it for the UK; more countries are being onboarded with the same depth.
The data, the pipeline, and the full source are in the project repository.
What this dataset is
This is a compiled dataset from 13 official public primary sources documenting German public funding flows to non-governmental organizations. It covers the period 2010–2025 and contains 528,000+ records.
Every record carries a confidence_level field (verified, estimated, or inferred) and a traceable source_url linking to the original primary source. The dataset documents funding flows — it does not attribute conduct, intent, or wrongdoing to any organization.
Statistical patterns in the dataset are detected by automated queries. They indicate data points that warrant further investigation by qualified journalists or researchers. They are not accusations.
Press Standards (Pressekodex Compliance)
| Pressekodex Ziffer | Requirement | This dataset |
|---|---|---|
| Ziffer 1 Wahrhaftigkeit | Facts must be verified | Every claim links to official primary source URL |
| Ziffer 2 Sorgfaltspflicht | Independent verification | 13 sources cross-checked; confidence_level per record; limitations documented |
| Ziffer 2 Sorgfaltspflicht | Right of reply | Dataset is factual — no conduct attributed to any organization |
| Ziffer 3 Richtigstellung | Correct errors promptly | Public versioning log with timestamps |
| Ziffer 8 Persönlichkeitsrechte | Protect personal data | All data relates to organizations, not natural persons |
| Ziffer 13 Unschuldsvermutung | No prejudgment | Statistical patterns only — no accusations |
Data Source Licenses
| Source | License | Attribution Required | ShareAlike | Commercial Use |
|---|---|---|---|---|
| OECD CRS | OECD Terms (open, attribution) | Yes | No | ✓ |
| IATI Datastore | ODC-PDDL v1.0 (public domain) | Not required | No | ✓ |
| EU FTS | CC BY 4.0 / CC0 (raw data) | Yes — cite EC | No | ✓ |
| Berlin ZDB | CC BY-SA | Yes | Yes | ✓ |
| Bundeshaushalt | Datenlizenz Deutschland Zero v2.0 | Not required | No | ✓ |
| Bundestag DIP | Public parliamentary docs (§5 UrhG) | Cite Drucksache number | No | ✓ |
| FragDenStaat | CC0 / public domain | Cite fragdenstaat.de | No | ✓ |
| Bremen ZEBRA | Datenlizenz Deutschland | Yes | No | ✓ |
| Demokratie leben! PDFs | Public government docs | Cite BMFSFJ | No | ✓ |
| NGO Monitor | Editorial/research use | Cite ngo-monitor.org | No | ⚠ Research only |
You are free to share, copy, redistribute, adapt, and build upon this material for any purpose, including commercial use, under the following terms:
- Attribution: Cite as 'German Budget Galaxy (2026)' + cite the original primary source for each record used
- ShareAlike: Derivative datasets must use the same CC BY-SA 4.0 license
- No endorsement: Use of this data does not imply endorsement by any source organization
Acceptable Use & Limitations
Permitted uses:
- Journalism, academic research, public policy analysis
- Personal, educational, and non-profit use
- Reproduction with attribution to primary source
Prohibited uses:
- Attributing criminal, fraudulent, or immoral conduct to any organization based solely on this dataset
- Presenting data as “exclusive” when the primary source is publicly available
- Combining with personal data without appropriate legal basis
GDPR / DSGVO note:
All data relates to organizations, not natural persons. Journalistic and research use is covered under § 85 DSGVO and § 41 BDSG (Medienprivileg). Contact for corrections: pookie.data@proton.me
Dataset corrections log:
No corrections recorded. Dataset version: 1.0 (2026-03-30)