Princeton
University
COS IW01 Fall 2015:
Analyzing
relationship networks: social
networks and beyond
Resources for Projects
This page contains a
selection of resources from the COS435
list of data and software resources for projects.
The selection contains those that are most
applicable to this seminar. You are
welcome to use other resources on the larger list or find some
on your own.
Data Sets
- 4
universities data set: from CMU. CS
department Web pages from various universities, hand-classified
into 7 categories.
- Facebook social graph data from the
Online
Social Networks Project at UC Irvine.
(No experience with these datasets.)
- Two sources of the social network for a sample of Twitter:
-
data for "What is Twitter, a Social Network or a News Media?"
by Kwak, Lee, Park, and Moon, Inter. World Wide Web (WWW)
Conf., 2010. See
http://an.kaist.ac.kr/traces/WWW2010.html
- data for "
Measuring User Influence in Twitter: The Million Follower
Fallacy by Cha, Haddadi, Benevenuto, and Gummadi, Inter.
AAAI Conf. on Weblogs and Social Media (ICWSM), 2010.
See http://twitter.mpi-sws.org/
Software
-
Tweepy: "
A Python library for accessing the Twitter API."
-
HTMLAsText
utility "converts HTML documents to simple text files, by
removing all HTML tags and formatting the text according to your
preferences." (copyright Nir Sofer).
-
Stanford
Network Analysis Platform (SNAP) From the site: "
a general purpose, high performance system for analysis and
manipulation of large networks." The core of SNAP is
written in C++ . There is a Python interface called Snap.py
- GraphStream A
Dynamic Graph Library
- Gephi: an open-source
network visualization and analysis platform. This is a
volunteer effort, but students have been pleased with it
lists last updated Fri
Jul 10 15:56:26 EDT 2015
Copyright
2008-2015
Andrea S. LaPaugh