Kursus: CCS5202 Big Data Technology

Section outline

Select section Welcome to Big Data Technology Micro-credential course

Tutup Buka
Welcome to Big Data Technology Micro-credential course

Tutup semua Kembangkan semua
Welcome to Big Data Technology Micro-credential course!

In today’s data-driven world, immense streams of information are generated every second. This course is designed to equip you with the essential concepts, tools, and architectures needed to capture, store, and analyze massive datasets. Whether you are looking to enhance your current technical skills or dive into the world of data engineering, you will gain practical insights into how modern organizations turn raw data into actionable intelligence.

This intermediate-level academic micro-credential programme offers comprehensive knowledge of big data technologies, including distributed processing, scalable platforms, data analytics, and real-world applications. It is meticulously designed to equip learners with the ability to design, evaluate, and apply robust big data solutions within digital intelligence environment.

Programme Structure: Earn as You Learn!

This macro-certificate is broken down into 3 specialized Micro-certificates and a total of 11 stackable Digital Badges. Completing the modules earns you digital badges, which stack up into micro-certificates, ultimately culminating in your full Macro-Certificate.

1. Micro-certificate 1: Foundations of Scalable Big Data Systems and Distributed Intelligence

Focuses on the core architecture and fundamental processing mechanics of large-scale data.

Digital Badge 1: Big Data Concepts (Characteristics, applications, and computing concepts).

Digital Badge 2: Big Data Modelling and Management (Hadoop architecture, HDFS, and NoSQL databases).

Digital Badge 3: Big Data Processing Technology (MapReduce frameworks and workflow logic).

Digital Badge 4: Real-Time Data Processing Technology (Data streaming concepts and Apache Storm).

2. Micro-certificate 2: High-Performance Big Data Platforms, Cloud Ecosystems, and Intelligent Analytics

Focuses on high-performance frameworks, modern cloud infrastructure, and turning data into actionable insights.

Digital Badge 5: Spark Technology (Spark SQL, Streaming, Scala, and PySpark workflows).

Digital Badge 6: Cloud-Based Big Data Management Technology (Cloud, Edge, Fog, and Quantum computing paradigms).

Digital Badge 7: Big Data Visualisation Technology (Data storytelling and dashboard design using tools like Power BI/Tableau).

Digital Badge 8: Big Data Analytics Technology (Machine learning integrations and the data analytics lifecycle).

3. Micro-certificate 3: Data-Driven Applications, Digital Economy Systems, and Cybersecurity

Focuses on industry-specific domain applications and critical data protection.

Digital Badge 9: Big Data in Healthcare (Healthcare data management, applications, and ethical challenges).

Digital Badge 10: Big Data in Finance (Fintech, Blockchain, and navigating the digital economy).

Digital Badge 11: Big Data Security (Data privacy, threat identification, and security tools).

The course balances 30 hours of guided virtual engagement (short lecture videos, podcast videos, live demo simulations, and synchronous breakout room discussions) with 90 hours of self-paced learning (hands-on lab work, case study reports, and mini-project development).

Assessments are completely coursework-based and practical ranging from interactive quizzes and visualization assignments to real-world architectural design tasks ensuring you build a portfolio ready for the digital industry!
- Select activity Announcements
  
  Announcements Forum

Select section Micro-certificate 2: High-Performance Big Data Platforms, Cloud Ecosystems, and Intelligent Analytics

Tutup Buka
Micro-certificate 2: High-Performance Big Data Platforms, Cloud Ecosystems, and Intelligent Analytics
- Tutup Buka
  Digital Badge 1: Spark Technology
  
  Apache Spark is an open-source, distributed data processing engine designed to handle massive datasets with incredible speed.
  
  If big data is a mountain of raw materials, Spark is the high-speed factory that processes it into finished goods. It is currently the industry standard for big data analytics, machine learning, and real-time data processing.
  
  Select activity Spark Technology (infographics)
  
  Spark Technology (infographics) Interactive Content
  
  The provided text offers a comprehensive technical summary of Apache Spark, an open-source framework designed for high-speed distributed computing. It highlights how the platform utilizes in-memory processing and parallel execution to overcome the performance limitations found in traditional disk-based systems.
  
  Select activity Real-time Data Processing (slides)
  
  Real-time Data Processing (slides) File
  
  Apache Spark is a robust open-source framework engineered to facilitate large-scale data processing through a distributed computing model. By utilizing in-memory computation, the system achieves significantly faster speeds than traditional disk-based methods, making it ideal for real-time analytics and complex machine learning tasks. The platform’s architecture relies on a driver program and executor nodes to manage tasks across vast clusters, ensuring both scalability and fault tolerance. Its diverse ecosystem includes specialized components for SQL queries, stream processing, and graph analysis, supporting multiple programming languages like Python and Scala.
- Tutup Buka
  Digital Badge 2: Cloud-Based Big Data Management Technology
  
  Cloud-Based Big Data Management Technology is the practice of storing, cleaning, organizing, and analyzing massive datasets using remote servers hosted on the internet, rather than using physical, on-premise servers located inside a company's own data center.
- Select activity Cloud-Based Big Data Management Technology video
  
  Cloud-Based Big Data Management Technology video Interactive Content
  
  The provided video outlines five distinct paradigms of computational architecture, detailing how each handles data processing and storage. Cloud computing serves as a centralized hub for scalable, remote resources, while edge and fog computing decentralize this power by moving tasks closer to the data source to minimize delay. In contrast, High-Performance Computing (HPC) utilizes massive clusters of traditional hardware to solve labor-intensive mathematical problems through brute-force parallel processing. The most radical shift is quantum computing, which leverages the unique principles of subatomic physics to perform calculations that are impossible for standard computers.
- Tutup Buka
  Digital Badge 3: Big Data Visualisation Technology
  
  Select activity Big Data Visualisation Technology (slides)
  
  Big Data Visualisation Technology (slides) File
  
  Here we explore the essential role of Big Data Visualization in converting overwhelming, high-speed datasets into actionable graphical formats. It emphasizes that traditional charting methods must be replaced by advanced techniques like heatmaps, treemaps, and data binning to prevent visual clutter and system crashes. The source highlights the importance of interactive design, allowing users to start with a broad summary before drilling down into specific details.
- Tutup Buka
  Digital Badge 4: Big Data Analytics Technology
  
  Big Data Analytics is the complex process of examining massive, fast-moving, and varied datasets to uncover hidden patterns, correlations, market trends, and customer preferences.
  
  Essentially, it is the science of turning raw, overwhelming "noise" into actionable intelligence that organizations can use to make better, faster business decisions.
  
  Select activity Big Data Analytics video tutorial
  
  Big Data Analytics video tutorial Interactive Content
  
  This video provides a comprehensive overview of Big Data Analytics, detailing how organizations transform massive volumes of complex information into actionable intelligence.
Select section Micro-certificate 3: Data-Driven Applications, Digital Economy Systems, and Cybersecurity

Tutup Buka
Micro-certificate 3: Data-Driven Applications, Digital Economy Systems, and Cybersecurity
This section focuses on building things with that data (Applications), understanding the broader marketplace where it operates (Digital Economy), and defending it all from attacks (Cybersecurity).
- Tutup Buka
  Assessment for Data-driven Applications, Digital Economy Systems and Cybersecurity
  
  Select activity Assessment Quiz
  
  Assessment Quiz
- Tutup Buka
  Digital Badge 1: Big Data in Healthcare
  
  Big Data in Healthcare refers to the massive volume of health-related information collected from various sources such as electronic health records (EHRs), medical imaging, genomic data, wearable devices, and patient feedback that can be analyzed computationally to reveal patterns, trends, and associations, especially relating to human health and disease.
  
  Select activity Big Data in Healthcare (slides)
  
  Big Data in Healthcare (slides) File
  
  Modern healthcare is being transformed by Big Data, which involves managing massive and complex sets of medical information through the lenses of volume, velocity, variety, and veracity. To process this information, institutions utilize advanced tools like cloud computing, artificial intelligence, and interoperability standards to bridge the gap between different software systems.
- Tutup Buka
  Digital Badge 2: Big Data in Finance
  
  Select activity Big Data in Finance (slides)
  
  Big Data in Finance (slides) File
  
  Modern financial institutions leverage Big Data and machine learning to transform traditional banking into a high-speed, intelligent ecosystem. By processing diverse information from market feeds and consumer behavior, organizations can detect fraudulent activities and manage complex financial risks in real time.
- Tutup Buka
  Digital Badge 3: Big Data Security
  
  This section provides the knowledge about the integration of big data analytics and artificial intelligence within the modern cybersecurity landscape to combat sophisticated digital threats. These technologies allow organizations to process massive datasets from diverse sources, enabling real-time monitoring, anomaly detection, and automated incident responses.
  
  Select activity Big Data Security slides (pt 1)
  
  Big Data Security slides (pt 1) File
  
  Select activity Big Data Security slides (pt 2)
  
  Big Data Security slides (pt 2) File

Section outline

Welcome to Big Data Technology Micro-credential course!