Our Story
Common Crawl is a 501(c)(3) non-profit founded in 2007, dedicated to making wholesale extraction, transformation, and analysis of open web data accessible to researchers.
EIN: 26-1635908 · Beverly Hills, CA
Common Crawl is a 501(c)(3) non-profit founded in 2007, dedicated to making wholesale extraction, transformation, and analysis of open web data accessible to researchers.
In 2014, Common Crawl added 1.3 petabytes of data, comprising 18.5 billion web pages, and has been cited in over 10,000 research papers.
Donations to Common Crawl support the expansion of its data corpus and educational programs, enabling groundbreaking innovations and solutions.
Your contribution, whether in cryptocurrency or cash, empowers researchers and technologists to access vital data, fostering innovation and education. Join the mission to enhance data accessibility!
Do you represent Common Crawl? Sign up and claim this page to customize it and gain access to our dashboard. Learn More.