Datasets

30 datasets

YouTube Shorts Engagement and Growth Velocity (799 Videos)

Engagement rates, views per day, and growth velocity across 799 YouTube Shorts from diverse channels and view counts.

799 rows · 11 columns

AI Job Market: 15,000 Positions Across 20+ Roles and 30+ Countries

Salary, skills, experience level, and remote work data for 15,000 AI job postings spanning 20 roles and 30+ countries.

15000 rows · 19 columns

Adult Census Income Dataset (1994)

32,561 US census records with demographics, education, occupation, and income classification (above/below $50K).

32561 rows · 15 columns

Pima Indians Diabetes Risk Factors (768 Patients)

Clinical data for 768 Pima Indian women with glucose, BMI, age, and diabetes outcomes showing strong glucose-diabetes correlation.

768 rows · 9 columns

US Population by State (2020 Census)

Population of all 50 US states and DC from the 2020 Census, ranked by size with percentage of national total.

52 rows · 5 columns

Motor Function in α-Synuclein Mice Across Gut Microbiome Treatments

Motor function tests (beam, pole, adhesive removal, hindlimb clasping) comparing wild-type and α-synuclein overexpressing mice across 5 microbiome con...

106 rows · 6 columns

Mouse Gut Microbiome and Motor Function (Hindlimb Clasping Scores)

Hindlimb clasping scores across 5 gut microbiome treatments in alpha-synuclein mice, showing microbiome depletion reduces motor deficits.

108 rows · 6 columns

AlphaZero-Style RL Training Metrics (13 Iterations)

Policy and value network training logs tracking losses, game length, MCTS agreement, and value calibration across 13 self-play iterations.

13 rows · 51 columns

AlphaZero-Style Game Agent Training Metrics (13 Iterations)

Policy and value loss, game outcomes, and MCTS statistics from a reinforcement learning agent training run over 13 self-play iterations.

13 rows · 1 columns

Catan AI Self-Play Training Metrics (171 Iterations)

AlphaZero-style training run for Catan: policy/value losses, game lengths, MCTS agreement, and value calibration across 171 self-play iterations.

171 rows · 38 columns

Sleep Actigraphy and Cognitive Scores — Winter 2020 Class Study

Actigraphy sleep metrics and Cambridge Brain Sciences cognitive scores for 58 participants in a university sleep class.

60 rows · 15 columns

College Sleep Quality: Winter 2020 Actigraphy & Cognitive Scores

Actigraphy sleep metrics and CBS cognitive scores for 58 college students, showing wide variation in sleep efficiency (47-93%) and sleep duration.

60 rows · 15 columns

Average Height by Country, Age Group, and Gender (2024)

Average heights for boys and girls at ages 5, 10, 15, and 19 across 195 countries, from the Netherlands (183.8 cm) to Timor-Leste (160.1 cm).

195 rows · 9 columns

AlphaZero Training Metrics: Policy and Value Loss Over 13 Iterations

Reinforcement learning training run tracking policy/value losses, game length, MCTS simulations, and value calibration across 13 self-play iterations.

13 rows · 51 columns

GATE Exam Qualifying Cutoff Scores by Discipline (2015–2020)

GATE cutoff marks across 24+ engineering and science disciplines for General, OBC, and SC/ST categories from 2015 to 2020.

93 rows · 8 columns

US Census Adult Income Data (1994)

Demographics, education, and occupation for 32,561 adults from the 1994 Census, with income above or below $50K.

32561 rows · 15 columns

Fisher's Iris Dataset — Sepal & Petal Measurements for 3 Species

150 iris flowers measured by sepal and petal length/width across Setosa, Versicolor, and Virginica species.

150 rows · 5 columns

Social Media Self-Censorship Survey (MTurk, 2022)

Survey of 50 MTurk workers on self-censorship fears, personality traits, and political orientation, showing right-leaning respondents worry 2.6x more ...

50 rows · 67 columns

Annual CO₂ Emissions by Country (1750–2024)

Annual CO2 emissions in tonnes for 217 countries from 1750 to 2024, showing how global emissions grew from under 1B to 38.6B tonnes.

29384 rows · 4 columns

pEEG-Guided Anesthesia RCTs: Study Characteristics (2002–2025)

Extracted characteristics from 96 randomized controlled trials comparing processed EEG-guided vs. standard anesthesia monitoring across 31,662 patient...

96 rows · 34 columns

Humanized Mouse Motor Function: ASO vs Wild-Type in Parkinson's Disease Model

Motor function tests (beam cross, pole descent, adhesive removal, hindlimb score) comparing ASO and WT mice across healthy control and PD cohorts.

107 rows · 6 columns

AlphaZero-Style Training Run (177 Iterations)

Reinforcement learning training metrics tracking policy loss, value loss, game length, and MCTS agreement over 177 self-play iterations.

176 rows · 41 columns

YouTube Shorts Engagement vs. Virality (799 Videos)

Engagement rates, views per day, and growth velocity for 799 YouTube Shorts showing how engagement drops 3x as videos go viral.

799 rows · 11 columns

AlphaZero-Style Self-Play Training Metrics (177 Iterations)

Policy loss, value loss, game length, and MCTS agreement tracked over 177 self-play iterations of AlphaZero-style reinforcement learning.

176 rows · 41 columns

AlphaZero-Style Training Metrics (177 Iterations)

Self-play reinforcement learning run tracking policy loss, value loss, game length, and MCTS agreement across 177 training iterations.

176 rows · 41 columns

AlphaZero-Style Training Run: 171 Iterations of Self-Play

Policy and value network training metrics over 171 iterations, tracking loss convergence, game length, MCTS agreement, and value calibration.

171 rows · 38 columns

Catan RL Training — Implementation

171 training iterations of a Catan RL implementation. Tracks policy and value loss convergence, game length evolution, and self-play performance metri...

171 rows · 38 columns

Available .com One-Word Domains

10,000 one-word .com domains with availability status, attractiveness score, demand rating, and registrar details.

10000 rows · 8 columns

AlphaZero Training Run: Policy and Value Network Convergence

212 iterations of AlphaZero-style self-play training tracking policy/value loss, MCTS agreement, game outcomes, and value calibration.

212 rows · 38 columns

AI One-Word .ai Domain Availability (10K Domains)

Availability, demand, and registrar data for 10,000 one-word .ai domains — shorter names are dramatically more likely to be taken.

10000 rows · 8 columns