Skip to content

Yahoo Releases Largest-ever Web Browser Data Set to Researchers

Yahoo! data set

Recently, Yahoo, Inc. released the largest-ever web browser data set to university researchers. With 13.5 terabytes of anonymized information on the browsing and reading habits of approximately 20 million users, the move benefits both academic researchers and the struggling technology company.

According to Yahoo, the data dump is a way to “promote independent research in the fields of large-scale machine learning and recommender systems, and to help level the playing field between industrial and academic research.”

But it also could result in attracting more researchers to the field of artificial intelligence—specifically in machine learning and deep learning. These areas of focus involve “training machines to mine massive data sets so they can respond to complex queries or make predictions,” as the Wall Street Journal points out.

Those interested in accessing the Yahoo News Feed dataset, may do so through Yahoo Webscope.

The University of Tennessee, Knoxville maintains a list of public use data sets for researchers. Investigators can request a dataset be added to the Institutional Review Board’s list by filling out and submitting a nomination form.

The flagship campus of the University of Tennessee System and partner in the Tennessee Transfer Pathway.