r/dataisbeautiful • u/oscarleo0 • 38m ago
r/dataisbeautiful • u/AutoModerator • 24d ago
Discussion [Topic][Open] Open Discussion Thread — Anybody can post a general visualization question or start a fresh discussion!
Anybody can post a question related to data visualization or discussion in the monthly topical threads. Meta questions are fine too, but if you want a more direct line to the mods, click here
If you have a general question you need answered, or a discussion you'd like to start, feel free to make a top-level comment.
Beginners are encouraged to ask basic questions, so please be patient responding to people who might not know as much as yourself.
To view all Open Discussion threads, click here.
To view all topical threads, click here.
Want to suggest a topic? Click here.
r/dataisbeautiful • u/paveloush • 17h ago
OC [OC] I visualized 52,323 populated places in European part of Spain and accidentally uncovered a stunning demographic phenomenon.
r/dataisbeautiful • u/TeslaTorah • 3h ago
71% of US workers now have pets and most aren’t returning to the office [2025 Research]
It’s not just about flexibility anymore. For the majority of American workers, remote work means being better employees and better pet parents.
r/dataisbeautiful • u/IdkJustPickSomething • 46m ago
OC [OC] My 18k wedding for ~80 people
Trying this again when it's Monday for my [OC]. My data source was manually tracked expenses and categorized into SankeyMATIC.com I love a Sankey. Other graphs were from Excel. Please be kind if I made a mistake, I am a human.
My total headcount given was 79 adult guests, 96 with vendors and children (the math to count kids was weird). Honestly most of our guests were married couples, a few kids, and 4 single people total.
Sankey: We planned a wedding we wanted, not expecting anything from parents. We are very grateful of their unexpected contributions. *Most* of the contributions came with no strings attached, which was very stress free. Ask away, this is the bulk of the info!
Excel graphs:
We had very few no shows: one couple missed their flight and one plus one didn't show. One coworker randomly sent me $20 on venmo the morning of my wedding, so she's the "not invited" and man do I feel bad about not inviting her!
Day of, we had 2 gifts to take home. The rest were sent before or slightly after. Just a bunch of cards!
I excluded the monetary gifts noted on the left of the Sankey in an effort to not distort the data, so you could see how much was actually given by guests. As you can see, most cards represented two people (as mentioned, mostly couples), so the amount is how much was given by the couple. One 0 was the coworker who sent money, the other 0 was the no show couple (kept them on the list to send a thank you, since they tried).
I'm not sharing this to comment on the price of weddings in general, or any commentary on the wedding industry. Don't come at me for spending money that you wouldn't spend. I'm voluntarily sharing data, so don't judge my choices.
r/dataisbeautiful • u/WhySoTedious • 3h ago
OC [OC] My [41M] on/off ~2.5-year dating journey with people I met on OLD apps
r/dataisbeautiful • u/Sarquin • 1h ago
OC [OC] Distribution of Prehistoric Mines and Lithic Assemblages across Ireland
Using National Monument Service data for Ireland and Department for Communities data for Northern Ireland, here’s my attempt at mapping out prehistoric mine locations across the island. I’ve also added in lithic assemblages as a possible proxy for flint locations though appreciate that’s more of a stretch.
It’s worth noting that the DfC data (Northern Ireland) doesn’t include the same breakdown for mine locations so it’s not a like for like comparison.
The map was built using some PowerQuery transformations and then designed in QGIS. I’m still learning so this is just my latest attempt and hopefully they’ll keep getting better.
Feedback always welcome.
r/dataisbeautiful • u/jargs92 • 1h ago
OC [OC] The semantic embedding and visualization of the entire corpus of cancer research (2.5 million papers)
I created an interactive map of the entire corpus of cancer research from 2010 to 2025, representing ~ 2.5 million papers. The map is based on the titles and abstracts of papers, which were embedded using a transformer neural network, projected with UMAP, and clustered with Leiden.
The atlas is available for full exploration on my website: https://www.litletter.net/cancer-atlas, where you can zoom into any area of the atlas, and click on paper titles to read them
There are 46 distinct communities, each representing a core area of focus within the field.
These clusters span the breadth of cancer research, including:
- Cancer types: Breast, lung, prostate, pancreatic, glioma, colorectal, melanoma, and more
- Treatment strategies: Immunotherapy, targeted therapies, neoadjuvant approaches, drug delivery systems
- Molecular and cellular biology: Signaling pathways, non-coding RNAs, epigenetics, metabolism
- Clinical and diagnostic domains: Patient outcomes, imaging, diagnostics, risk assessment
- Cross-cutting and emerging themes: Tumor microenvironment, inflammation, viral therapies, AI in oncology
r/dataisbeautiful • u/oscarleo0 • 1d ago
OC [OC] Percentage of people who say that Religion is very or rather important in their life
r/dataisbeautiful • u/frozenpandaman • 15m ago
OC [OC] I passed 15,000 unique km of railways traveled in Japan, including six prefectures and 52 companies' lines ridden completely!
I tend to think the prefectural symbols (used on flags) and railway company logos are both pretty cool-looking, heavily based on – or sometimes directly taken from – [kamon](https://en.wikipedia.org/wiki/Mon(emblem))_, emblems used by families and clans beginning in the Heian period and throughout Feudal & Early Modern Japan. Besides updating the map with some new milestones, I decided to show the prefectures, major distance markers, and companies that I've "completed" as well. Figured people might enjoy looking through the various symbols!
r/dataisbeautiful • u/be_data • 51m ago
OC [OC] Europe 2024: Higher GDP → Shorter Weeks, Longer Careers — Longer Weeks in Central & Eastern Europe
r/dataisbeautiful • u/Luton_Enjoyer • 23h ago
Horatio Hornblower's rank in each story and year of publication
r/dataisbeautiful • u/birdbirdeos • 21m ago
OC [OC] Applications for PhD in Molecular Microbiology
r/dataisbeautiful • u/Proud-Discipline9902 • 6h ago
OC [OC]Japan’s Listed White Goods Leaders — Market Cap Trends at a Glance
Data Sources: Market Capitalization: Sourced from MarketCapWatch.
We selected only publicly listed Japanese companies (Market Cap>USD $10 Billion) with significant operations in the white goods sector (large household appliances such as refrigerators, washing machines, and air conditioners). Market cap data as of Aug 22, 2025 (converted to USD where appropriate).
FAQ — Understanding This Chart
Q1: Why isn’t Sony included? Sony’s core business is in consumer electronics, gaming, entertainment, and financial services. It does not operate at scale in the white goods category (large household appliances like fridges, washing machines, ovens, and air conditioners), so it falls outside the scope of this chart.
Q2: Why isn’t Toshiba included? Toshiba was delisted from the Tokyo Stock Exchange in December 2023 after a buyout and is now privately held. Because our dataset only covers publicly listed companies, Toshiba is excluded.
Q3: What exactly counts as “white goods” in this analysis? We define white goods as large household appliances for cooling, cleaning, and cooking — e.g., refrigerators, washing machines, ovens, dishwashers, and air conditioners. Companies must have significant sales in these product categories to qualify.
Q4: Why are companies mainly HVAC specialists (e.g., Daikin) included? HVAC products — particularly large air conditioning systems — fall under the “white goods” umbrella in many industry classifications. Companies with significant domestic appliance presence in HVAC are included if they are publicly listed in Japan.
Q5: Why does the market cap vary so much between companies? Some giants (like Hitachi or Mitsubishi Electric) are diversified conglomerates with revenue streams far beyond appliances.
r/dataisbeautiful • u/ramnamsatyahai • 2d ago
OC [OC] Night-time Light in Asia, 2014 vs 2024 Comparison (Updated)
Reposting with updated data , the 2012 composite used a different method and partial coverage, which made some regions (like Thailand) appear darker. This version uses average annual masked VIIRS data for a fairer 2014–2024 comparison.
r/dataisbeautiful • u/Proud-Discipline9902 • 2d ago
OC [OC]Top 20 Global Defense Contractors by Market Capitalization
Methodology & scope:
- Universe: Publicly traded companies with ≥25% of revenue from defense‑related products/services.
- Source: Market capitalization (USD) as of Aug 2025, sourced from MarketCapWatch, Nasdaq’s 2025 defense stock review, and Forbes’ 2025 defense picks, cross‑checked with recent filings.
- Inclusions: Dual‑sector aerospace & defense firms (e.g., Boeing, Safran) where defense is a major revenue driver.
- Exclusions: Fully private/state‑owned entities (e.g., Rostec, NORINCO) without a listed arm.
r/dataisbeautiful • u/willkoeppen • 2d ago
OC [OC] The July 4 flash flood on the upper Guadalupe River (water level heights above normal)
This animation shows water levels on the upper Guadalupe River from midnight July 4, 2025, to 6 p.m. July 5 (local time). The flood killed 119 people in Kerr County, including 25 girls and two teenage counselors at Camp Mystic.
Data sources
- Raw stream gauge data from the USGS was downloaded and processed to be consistent 5-minute data; it was then normalized to the average July water level at each station to get "height above normal."
- The basemap was created using data from Natural Earth, the National Hydrography Dataset, and the U.S. Census Bureau's TIGER database
Tools:
- Python for data harvesting, processing, and basemap generation
- Svelte 5, D3, and custom JavaScript for visualization
Interactive version with contextual information: https://www.willkoeppen.com/datavis/guadalupe-floods/
r/dataisbeautiful • u/TheHonestRedditer • 2d ago
OC Emotional Categories in 1548 Anonymous Daily Letters Exchanged Between Strangers [OC]
Data source: Collected from my web app Daylettr, where users anonymously write one daily note for the next user and receive a random one from the previous one. This captures raw human thoughts under guaranteed anonymity (no logins, no tracking). Full dataset: 1548 messages
Tools: Python (pandas for processing, seaborn/matplotlib for visualization). Emotions classified via keyword matching (e.g., 'hope' for words like 'hope', 'better'; expandable for nuance).
Insights: Anonymity seems to encourage positivity (even if it seems that it might do the opposite), over 60% of messages fall into uplifting categories like kindness, gratitude, and hope. But there's depth: reflection dominates when people ponder life, with rare but raw sadness or humor peeking through. It shows humanity's spectrum: supportive yet vulnerable.
r/dataisbeautiful • u/USAFacts • 2d ago
OC Charter school enrollment (percentage of students) by state [OC]
r/dataisbeautiful • u/haydendking • 2d ago
OC [OC] Housing and Utilities Expenditures in the US
r/dataisbeautiful • u/FluidModeNetwork • 22h ago
OC [OC] Overall ranking for 51+ Countries
My sheets document includes the sources, but the ranking uses 13 different sources. Sadly, not every country is included in every source so you will see blank spaces for countries that are left out in the data. I've also created a correlation index to see how different metrics matched up with each other and you can see the data I used for each ranking.
https://docs.google.com/spreadsheets/d/1YbfVevxEthNgDtK69P48Xm39bXLHi8eqfeFwxTTYEJE/edit?usp=sharing
Hope you like it, lemme know if you have any questions.
r/dataisbeautiful • u/andtitov • 22h ago
OC [OC] 14 days of unbelievable mental and physical rollercoaster captured in one graph
I tracked my body composition before a 7-day water fast, right after, and then after 7 days of refeeding.
- Total weight dropped from 162.1 → 150.4 lbs, then came back up to 157.2 lbs.
- Fat mass went down 21.4 → 16.8 lbs, then only partially returned (17.3 lbs).
- Lean tissue dipped during the fast but mostly came back after refeed.
- Bone mass stayed stable.
One picture shows just how extreme - and fascinating - the changes were 😊
r/dataisbeautiful • u/Inboxmeyourcomics • 1d ago
OC [OC] The cascading file folders naturally became a galaxy
When using the file visualization graph view, the files from this subset naturally form a two-arm galaxy. Data source shown in following images. Tools used: obsidian MD
r/dataisbeautiful • u/ppsreejith • 1d ago
Who’s Really Getting Green Cards? A Look at 200K+ PERM Certifications (2020-2024)
A dataset of PERM applications from the US Dept of Labor & AI chat to allow you to explore the data
r/dataisbeautiful • u/Rauram99 • 2d ago
OC [OC] Housing prices and salaries - Three immigration levels (2023-2024)
Notes:
I only included countries with >0.830 HDI >5 Millions population.
Net migration rates are a cumulative average for the last 5-10 years.
r/dataisbeautiful • u/rsrgrimm • 2d ago
How did draft position affect fantasy football league performance in 2024? (12-man leagues, snake draft)
To assess how draft position affected league performance, I looked into over 400 12-man leagues (all snake drafts) and plotted win ratio, normalized points earned (normalized within a given league to account for various scoring and roster settings), and final league ranking for each draft position.
Surprisingly, 1st pick performed worst on average across all metrics.
League data collected from Sleeper API.