class: center, middle, inverse, title-slide .title[ # BAA1030 - Data Analytics and Story Telling ] .subtitle[ ## Lecture 1: Introduction ] .author[ ### Damien Dupré - Dublin City University ] --- # Module Contact Details .pull-left[ ## Damien Dupré, PhD - email: damien.dupre@dcu.ie - phone: 00353 (0)1 700 6360 - office: Q233 DCU Business School ] .pull-right[ <img src="https://pbs.twimg.com/profile_images/1221820283159490565/96a3XnSg_400x400.jpg" width="60%" style="display: block; margin: auto;" /> ] --- # About Me #### Developement of the DynEmo Facial Expression Database (Master) * Dynamic and spontaneous emotions * Assessed with self-reports and by observers #### Analysis of Emotional User Experience of Innovative Tech. (Industrial PhD) * Understand users' acceptance of technologies from their emotional response * Based on multivariate self-reports #### Evaluation of Emotions from Facial and Physiological Measures (Industrial PostDoc) * Applications to marketing, sports and automotive industries * Dynamic changes with trend extraction techniques (2 patents) #### Performance Prediction using Machine Learning (Academic PostDoc) * Application to sport analytics * Big Data treatment (> 1 million users with activities recorded in the past 5 years) --- # Module Content ## Knowledge - Data Organisation - Data Analytics Pipeline - Data Cleaning - Professional Visualisation Outputs - Academic Statistical Analyses - First Steps in Programming Languages ## Skills - EXCEL - PowerBI/Google Looker Studio - TABLEAU - R & RStudio or Python & Jupyter The lecture will alternate between **theory sessions** and **practice sessions**. --- # Module Content ## Targets - **Short Term**: Success in BAA1030 assignments and winning the EDHEC [Data Viz Challenge](https://www.tableau.com/academic/edhec-business-school-starts-data-viz-challenge-students-across-europe) - **Mid Term**: Success in all other DCU lectures (academic data analytics style and conventions) - **Long Term**: Professional data analytic expertise (CV optimised with essential data analytics skills) ## Professional Requirements
<i class="fas fa-bomb faa-burst animated faa-slow " style=" color:blue;"></i>
Data Analytics is the best, easiest and highly paid job you can obtain with your master, see by yourself: - [MSc Management Business – Deep Dive with Cathal Dunne, AIB, Data Quality Analyst](https://business.dcu.ie/msc-management-business-deep-dive-with-cathal-dunne-aib-data-quality-analyst/) - [Let's ask to ChatGPT](https://chat.openai.com/chat) - [Data Analyst Positions on Indeed.com](https://ie.indeed.com/jobs?q="data analyst"&l=Dublin%2C County Dublin) --- # Module Content <img src="https://c.tenor.com/DIWxMkR9kyUAAAAC/fortune-and-glory-indiana-jones.gif" width="100%" style="display: block; margin: auto;" /> --- # Module Organisation ### Lecture Every Week - Wednesday, between 11am and 1pm - Recordings on-demand and not shareable ### Practice at Home - Practice with the technologies and methods shown during the lectures - Essential for the assignments ### Optional but Strongly Adviced Online Trainings - Free and Short Courses Online (Kubicle, Udemy + other learning platforms) --- # Assessment Structure ### Assignment 1. Tableau Online Dashboard (40%) > Deadline 23/03/2025 > Design a professional looking data dashboard aiming to communicate a narrative (also called storytelling) ### Assignment 2. Quarto Report (50%) > Deadline 13/04/2025 (Only 3 weeks after Assignment 1) > Using R and RStudio, reproduce the results presented in the tableau dashboard with a Quarto report ### Kubicle Online Courses (10%) > Deadline 01/05/2025 --- # Assessment Structure ###
<i class="fas fa-exclamation-triangle faa-flash animated faa-slow " style=" color:red;"></i>
Warning - Each assignment's instruction will be uploaded on loop after the deadline of the previous assignment. - Please submit professional outputs, as I'm not expecting student's approximative work from you any more. - Please avoid any form of plagiarism, copy-pasting or group collaborations. I will easily spot rephrased sentences, similar figure design and code sharing. --- # Kubicle Courses #### Excel Essentials - [Data Manipulation and Formatting](https://app.kubicle.com/courses/data-manipulation-and-formatting) (150 Minutes) - [Formulas and Functions](https://app.kubicle.com/courses/formulas-and-functions) (210 Minutes) #### Data Analysis - [Lookups and Database Functions](https://app.kubicle.com/courses/lookups-and-database-functions) (150 Minutes) - [Pivot Tables](https://app.kubicle.com/courses/pivot-tables) (150 Minutes) #### Data Presentation Fundamentals - [Communicating Data Effectively](https://app.kubicle.com/courses/communicating-data-effectively) (90 Minutes) - [Telling Stories with Data](https://app.kubicle.com/courses/telling-stories-with-data) (60 Minutes) - [Presenting Your Data](https://app.kubicle.com/courses/presenting-your-data) (90 Minutes) #### Visualization Fundamentals - [Visual Data Thinking](https://app.kubicle.com/courses/visual-data-thinking) (60 Minutes) - [Applying Visual Data Skills](https://app.kubicle.com/courses/applying-visual-data-skills) (60 Minutes) - [Visualization in Practice](https://app.kubicle.com/courses/visualization-in-practice) (30 Minutes) --- class: inverse, mline, center, middle # Data Analytics and Big Data --- # What are Big Data? ### Definition The term Big Data corresponds to a table containing observations (i.e. database or dataset) that is **too long, too large or too complex to be handled by conventional tools** ### Microsoft Excel's Limits (v16.77 - Office 365): - Total number of rows: **1,048,576 rows** - Total number columns: **16,384 columns** .pull-left[ <img src="https://qph.fs.quoracdn.net/main-qimg-e4072574da0c7a785bc4b138b694189f" width="60%" style="display: block; margin: auto;" /> ] .pull-right[ *Have you ever tried to scroll down to the end of Excel? Because I did!* ] --- # Data vs. Information Without data, an organization could not successfully complete most business activities. However, organisations need to convert these data into meaningful information - Data consists of raw facts - Information is often confused with the term data ### Example: Sales Manager - Knowing number of sales for each representative (fact – data) - Knowing total monthly sales (transformed – information)
--- # Value of Information - **Goals**: Helps decision makers achieve organisational goals - **Performance**: Valuable information helps people and organisations perform - **Accuracy**: Inaccurate/Incomplete information leads to Poor Decisions and can result in High Cost for the organisation -- ### Data Analytics - The science of using data to build models that lead to better decisions that in turn add value to individuals, companies and institutions - The analysis of data, typically large sets of data, by the use of mathematics, statistics, and computer software <div class="figure" style="text-align: center">
<p class="caption">Data analytics provides an integrated view of business performance.</p> </div> --- # Styles of Data Analytics It uses a combination of historical information about past transactions or events and reference data about, for example, customers or products, to enable a wide variety of analyses and decision support techniques. - **Standard Reports:** Preformatted information for predefined backward-looking analysis. - **Academic Reports:** Application of research methods to business information using descriptive and inferential statistics. - **Dashboards:** Business performance metrics using specific variables presented in a tabular or graphical format. - **Alerts:** Communication to designated business people when a key business variable is outside a predefined performance standard or range. - **Predictive Analytics:** Application of historical business information to predict future the performance and potentially prescribe a favoured course of action. --- class: inverse, mline, center, middle # Data Analytics and Storytelling --- # Role of Data Storytelling > **Stories** are how we translate core, essential **content**<br>to different **forms**<br>for specific **audiences**. -- ### Purpose of stories: Visual communication plays an important role in a visual analytics process. No matter how advanced and sophisticated data visualisation techniques are used, if we failed to tell a compelling story by using the data visualisation designed, all the hard work and efforts will be wasted. -- ### Motivation: .pull-left[ **Exploratory analysis**: - exploring and understanding the data, conducting the analysis ] .pull-right[ **Explanatory analysis**: - explaining your findings from your analysis in a coherent narrative that leads to a call to action ] --- # Build your Story > When adding texts or visualisations, ask yourself: "Does this element support the point I want to make about the data?" -- ### Guiding Your Viewer Another way we can guide people through our visualization is by using **annotations**, which can be very helpful in guiding someone through our figure. However, think about only labelling the data that matters. -- ### Use your titles/captions! - Titles can guide people to the point of your figure - Primes people to know what to look for - "If there is a conclusion you want your audience to reach, state it in words" -- ### References - [Fundamentals of Data Visualization](https://clauswilke.com/dataviz/) - [Hands-On Data Visualization: Interactive Storytelling from Spreadsheets to Code](https://handsondataviz.org/) --- class: inverse, mline, left, middle <img class="circle" src="https://github.com/damien-dupre.png" width="250px"/> # Thanks for your attention and don't hesitate to ask if you have any question! [
@damien_dupre](http://twitter.com/damien_dupre) [
@damien-dupre](http://github.com/damien-dupre) [
damien-dupre.github.io](https://damien-dupre.github.io) [
damien.dupre@dcu.ie](mailto:damien.dupre@dcu.ie)