அண்டமனைத்தும் ஓர் அடியால் அளக்கலாம்.
அணுக்குள் அடக்கலாம். அவ்வறிவின் அளவையறி.
– தத்துவஞானி வேதாத்திரி மகரிஷி.

Course Content

The main objective of “Big Data intelligence” is to understand all of us better to predict the future. Be it 4 billion google queries a day or 1 billion FB users, we need smarter AI algorithms to learn and connect the dots from the ocean of data. With massive parallelism and Map-Reduce technique, millions of servers take us one step closer to the “Turing’s Intelligent machine”. Near AI success stories are google, facebook, twitter, youtube and Amazon. Let’s begin our journey to understand the basic data operations and mining techniques involved to extract big data intelligence.

memexOne Circle

A dream (Memex) in 1945 lead to WWW and Web Scale Data.

Day Topics
Day 1 FN BDI: The Beginning
DFS and Map-Reduce
Page Rank algorithm
BDI Tools Landscape
Day 1 AN Scala Basics for MR apps
Demo
Day 2 FN Spark projects using Scala
More fun with Scala
Day 2 AN Recommendation systems
(TF-IDF, Jaccard, Cosine and Collaborative filtering)
Day 3 FN Finding similar Items
(Shingling, Minhashing and Locality Sensitive Hashing)
Tweets sentiment classification using Naïve Bayes
Day 3 AN Distributed Graph (Pregel)
Dremel and Big Query

Text book: Jeffrey D. Ullman, Mining of Massive Datasets

Download pdf version of the course content.

To access the secret links of password protected slide decks, please contact Prof. Ashok (+91-9943900101, ashok@zettab.com).

Slide Decks

  1. Big Data Intelligence- The Beginning [View]
    [Video] 1a. What is Big Data Intelligence
  2. DFS and Map-Reduce
  3. Page rank algorithm
  4. BDI Tools Landscape
  5. Scala Basics for Map Reduce applications
  6. Spark Projects using Scala
  7. Recommendation systems
    (TF-IDF, Jaccard, Cosine and Collaborative filtering)
  8. Finding similar Items
    (Shingling, Minhashing and Locality Sensitive Hashing)
  9. Tweets sentiment analytics
    (Naïve Bayes Classifier)
  10. Distributed Graph Processing using Pregel
  11. Dremel and Big Query

Remote Internship Projects

I assist selected mentees after the course to get and complete remote internship projects. If your institute is interested in setting up an exclusive Big Data Intelligence Lab under my supervision, please contact me for discussion (+91-9943900101, ashok@zettab.com).