Department of Computer Engineering (2019 course) Course Name & Code: Data Science and Big Data Analytics Laboratory (310256) Group A: Data Science 1. Data Wrangling I 2. Data Wrangling II 3. Descriptive Statistics - Measures of Central Tendency and variability 4. Data Analytics I 5. Data Analytics II 6. Data Analytics III 7. Text Analytics 8. Data Visualization I 9. Data Visualization II 10. Data Visualization III Group B: Big Data Analytics – JAVA/SCALA 1. Write a code in JAVA for a simple Word Count application that counts the number of occurrences of each word in a given input set using the Hadoop MapReduce framework on local-standalone set-up. 2. Design a distributed application using MapReduce which processes a log file of a system. 3. Locate dataset (e.g., sample_weather.txt) for working on weather data which reads the text input files an...
Posts
Showing posts from January, 2025