Freebase

Data Information

Freebase is a large collaborative knowledge base consisting of data composed mainly by its community members. It is an online collection of structured data harvested from many sources, including individual, user-submitted wiki contributions.
Freebase contains data harvested from sources such as Wikipedia, NNDB, FMD and MusicBrainz, as well as individually contributed data from its users. The structured data is licensed under the Creative Commons Attribution License, and a JSON-based HTTP API is provided to programmers for developing applications on any platform to utilize the Freebase data.

We provide 2 kinds of sampled datasets which are extracted from freebase.

freebase_music:
sampled RDF dataset containing (subject entity, relation, object entity) triples where the entries are related with music from Freebase.

freebase_sampled:
sampled RDF dataset containing (subject entity, relation, object entity) triples where the entries are related with music, book, tv shows, film, people, and sport from Freebase.

Data Statistics
Mode 3
Dimension music 23,344,784 * 166 * 23,344,784
sampled 38,955,429 * 532 * 38,955,429
Nonzero music 99,546,551
sampled 139,920,771
Format <Subject entity> <Relation> <Object entity>

Source

www.freebase.com

Citation


@inproceedings{haten2_ICDE2015,
  title={HaTen2: Billion-scale Tensor Decompositions},
  author={Inah Jeon and Evangelos E. Papalexakis and U Kang and Christos Faloutsos},
  booktitle={IEEE International Conference on Data Engineering (ICDE)},
  year={2015},
}

Files

File Description
freebase_music.tar.gz Freebase tensor that is related with music
freebase_sampled.tar.gz Sampled freebase tensor