WPI Worcester Polytechnic Institute

Computer Science Department
------------------------------------------

DS504/CS586 - Big Data Analytics - Spring 2020

------------------------------------------

Home Class Info Schedule Projects
Grading Reviews Resources

------------------------------------------

Useful Resources

Where to find relevant material and resources?

    1. WPI Library contains subscriptions to most of the resources below, including free access to search services to the IEEE digital library and the ACM digital library, citation indices. You get free text-search access to these resources, including the ones below, if you are a student at WPI and you log on at a WPI machine or via a proxy. George C. Gordon Library http://www.wpi.edu/academics/library.html
    2. ACM Digital Library for ACM conferences (SIGKDD, SIGMOD, etc) and journals (ACM KDD, ACM TODS, etc). ACM Digital Library: http://dl.acm.org/dl.cfm
    3. IEEE Xplore for IEEE conferences (ICDE, ICDM) and journals (TKDE, etc): IEEE Xplore: http://ieeexplore.ieee.org/Xplore
    4. Springer and Kluwer publications: DBLP (Digital Bibliography and Library Project: http://link.springer.com
    5. Michael Ley's DBLP bibliography server for CS bibliography linking to citations (and in some cases on-line papers) for conferences and journals: DBLP (Digital Bibliography and Library Project: http://www.informatik.uni-trier.de/~ley/db/index.html
    6. Google scholar also holds some citations and papers: Google Scholar - http://scholar.google.com

What journals and conference proceedings to consider:

    1. IEEE International Conference on Big Data
    2. ACM Special Int. Group on Data Management (ACM SIGMOD)
    3. ACM Special Int. Group on Knowledge and Data Discovery (ACM SIGKDD)
    4. ACM Conferece Transactions on Database Systems (ACM TODS)
    5. IEEE International Conference on Data Mining (ICDM)
    6. IEEE Int. Conf. on Data Engineering (ICDE)
    7. Int. Conf. on Very Large Databases (VLDB)
    8. IEEE Transactions on Knowledge and Database Systems (ACM TKDE)
    9. ACM Transactions on Knowledge Discovery from Data (ACM KDD)
    10. ACM Transactions on Database Systems (ACM TODS)
    11. Data and Knowledge Engineering Journal (DKE)
    12. Information Systems Journal
    13. Int. Conf on Information and Knowledge Management (CIKM)
    14. WWW Conference
    15. ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (GIS)
    The above lists are just a subset and other good sources for big data material exist.

Subset of Sources of Data

    Below to be expanded.

    1. Urban Network Data Rutgers University, NYC Taxi Trip Data, Boston Bike Sharing System Data, Open Streat Map.
    2. The Lawrence Berkeley Laboratory has wide-area TCP trace data available. You can view it at : this location.
    3. Scientific data such as meteorological measurements are also possible. Their data rates tend to be slow (e.g., one measurement every ten minutes). See NOAA for possible source of such data. .
    4. DBES challenge cups, such as, DBES2013 live sports analytics data set.
    5. Twitter fire hose.
    6. KDD challenge cup; in fact, most conferenes release data sets once a year for experimentation.
    7. You could use data generated for the Linear Road benchmark. View Linear Road Website here.
    8. You could look at data from the temperature sensors from the CMU SensorNets project, accessible on the web at this location.
    9. At the University of Wisconsin, some online auction data from eBay (crawled in late 2001) had been collected. They are now stored at CMU. See about.txt.. The actual files are : items.dtd; ; items-snippet.xml; and items.zip;
    10. Stanford University Dataset on Complex Networks: Data

Links:

Writing Skills

    This class requires you to submit various written exercises, from critiques to reports. If you find that you need assistance, the Writing Center on campus can help. They have experienced peer tutors that can help you to organize and revise written work, if you give them ample lead time. Please take advantage of this precious resource - It is free for WPI students.



yli15 at wpi.edu