云计算与大数据小组学习

组会信息

时间:每周周三上午10:00-12:00S
地点:计算中心4楼会议室

组会记录
Date Papers Presenter
2013/1/10 研究方向介绍与假期作业
slides
张岩峰
2013/1/18 MapReduce: Simplified Data Processing on Large Clusters [OSDI 2004]
slides
李士炜
2013/1/24 Google File System [SOSP 2003]
slides
李晨
2013/1/25 PageRank
slides
王春磊
2013/3/8 假期作业
常栋_slides 王春磊_slides 李晨_slides 韩雪_slides
常栋,王春磊,李晨,韩雪,王强
2013/3/15 Pregel: A System for Large-Scale Graph Processing  [SIGMOD 2010]
slides
Bigtable: A Distributed Storage System for Structured Data [OSDI 2006]
slides
常栋

王强
2013/3/22 Themis: An I/O-Efficient MapReduce [SOCC 2012]
slides
Balancing Reducer Skew in MapReduce Workloads using Progressive Sampling [SOCC 2012]
slides
魏彦婧

刘芳
2013/3/29 Review Spam Detection via Temporal Pattern Discovery [KDD 2012]
slides
Sailfish: A Framework For Large Scale Data Processing [SOCC 2012]
slides
李晨

李士炜
2013/4/12 Graph Storage 林涵
2013/4/19 Delta-SimRank Computing on MapReduce [BigMine 2012]
slides
王春磊
2013/4/26 Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud [VLDB 2012]
slides
常栋
2013/5/3 Pig Latin: A Not-So-Foreign Language for Data Processing [SIGMOD 2008]
slides
王强
2013/5/10 Omega: flexible, scalable schedulers for large compute clusters [EuroSys 2013]
slides
刘芳
2013/5/17 Lifetime Management of Flash-Based SSDs Using Recovery-Aware Dynamic Throttling [USENIX ATC 2013]
slides
林涵
2013/5/31 YSmart: Yet Another SQL-to-MapReduce Translator [ICDCS 2011]
slides
魏彦婧
2013/6/7 本科生毕业设计预答辩 王春磊,常栋,林涵
2013/6/21 MRPGA: An Extension of MapReduce for Parallelizing Genetic Algorithms [eScience 2008]
slides
李晨
2013/6/28 Twister: A Runtime for Iterative MapReduce [MAPREDUCE 2010]
slides
李士炜
2013/7/5 Accelerate Large-Scale Iterative Computation through Asynchronous Accumulative Updates [ScienceCloud 2012]
slides
王春磊
2013/9/6 VLDB Summer School总结
slides
林涵,王春磊
2013/9/15 Spanner: Google’s Globally-Distributed Database [OSDI 2012]
slides
李晨
2013/9/22 Hystor: Making the Best Use of Solid State Drives in High Performance Storage Systems [ICS 2011]
slides
常栋
2013/9/28 Write Policies for Host-side Flash Caches [FAST 2013]
slides
韩冰
2013/10/13 SILT: A Memory-Efficient, High-Performance Key-Value Store [SOSP 2011]
slides
王春磊
2013/10/20 RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems [ICDE 2011]
slides
林涵
2013/10/27 Distributed Matrix Factorization with MapReduce using a series of Broadcast-Joins [RecSys] 2013
slides
李士炜
2013/11/3 Clustering by passing messages between data points [SCIENCE 2007]
slides
魏彦婧
2013/11/6 SKewTune:Mitigating Skew in MapReduce Applications [SIGMOD] 2012
slides
王强
2013/11/13 Bi-Hadoop: Extending Hadoop To Improve Support For Binary-Input Applications [CCGRID 2013]
slides
李晨
2013/11/20 FlashStore: High Throughput Persistent KeyValue Store [VLDB 2010]
slides
常栋
2013/11/27 MadLINQ: Large-Scale Distributed Matrix Computation for the Cloud [EuroSys 2012]
slides
李士炜
2013/12/04 Asyn-SimRank An Asynchronous Large-Scale SimRank Algorithm 
slides
王春磊
2013/12/11 Presto: Distributed Machine Learning and Graph Processing with Sparse Matrices [EuroSys 2013]
slides
林涵
2013/12/18 Naiad: A Timely Dataflow System [SOSP 2013]
slides
王强
2013/12/25 From "Think Like a Vertex" to "Think Like a Graph" [VLDB 2014]
slides
韩冰
2014/01/02 K-AP: Generating Specified K Clusters by Efficient Affinity Propagation [CDM 2010]
slides
魏彦婧
2014/01/08 论文进度

李晨
2014/01/15 TAO: Facebook’s Distributed Data Store for the Social Graph [USENIX 2013]
slides
常栋
2014/01/20 Fast Top-K Path-based Relevance Query on Massive Graphs [ICDE 2014]
slides
王春磊
2014/2/21 Minimal MapReduce Algorithms [SIGMOD 2013]
slides
林涵
2014/2/28 Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs [VLDB 2011]
slides
h韩冰
2014/3/7 Hierarchical Affinity Propagation 
slides
魏彦婧
2014/3/14 Optimizing Graph Algorithms on Pregellike Systems 
slides
王强
2014/3/21 Copysets: Reducing the Frequency of Data Loss in Cloud Storage 
slides
李晨
2014/3/28 Analysis of HDFS Under HBase: A Facebook Messages Case Study 
slides
常栋
2014/4/3 Hadoop,Spark,Pregel,GraphLab类比及Graphchi原理 
slides
王春磊
2014/4/11 TurboGraph: A Fast Parallel Graph Engine Handling Billion-scale Graphs in a Single PC [KDD 2013]
slides
林涵
2014/4/18 Beyond DCG: User Behavior as a Predictor of a Successful Search [WSDM 2010]
slides
李帅
2014/4/25 GraphX: Unifying Data-Parallel and Graph-Parallel Analytics [Berkeley TR 2014]
slides
魏彦婧
2014/5/4 Scaling Memcache at Facebook [NSDI 2013]
slides
王强
2014/5/9 研二预答辩>

王强 李晨 魏彦婧
2014/5/16 Gelling, and Melting, Large Graphs by Edge Manipulation [CIKM 2012]
slides
韩冰
2014/5/24 Beliefs and Biases in Web Search [SIGIR 2013]
slides
黄云波
2014/5/31 A Dynamic Caching Mechanism for Hadoop using Memcached [2012]
slides
常栋
2014/6/6 A Dynamic Caching Mechanism for Hadoop using Memcached [2012]
slides
王春磊

推荐阅读列表

Introductory

New papers

Big Data Programming

Cloud Data Storage

Cloud Resource Management/Load Balancing

Privacy and Security

DBs in the cloud and moving stuff