Data engineering is about developing the software (and hardware) infrastructure to support data science. This unit introduces software tools and techniques for data engineering, but not hardware. It will cover an introduction to big data processing, covering volume, variety, and velocity; large volume data processing using parallel technologies; variety data formats, including unstructured and semi-structured data, using NoSQL databases; and velocity data processing, covering data streaming.
Minimum total expected workload to achieve the learning outcomes for this unit is 144 hours per semester typically comprising a mixture of scheduled online and face to face learning activities and independent study. Independent study may include associated reading and preparation for scheduled teaching activities.
apply modern big data processing tools for various data structures;
solve complex big data streaming problems.
identify big data concepts and technologies;
design and develop parallel database processing algorithms and methods;
explain a variety of data formats in big data;
Big Data Velocity Innovation
Big Data Volume, Variety and Velocity
Big Data Volume Coding
Big Data Volume Coding
Big Data Volume, Variety and Velocity
