lOMoARcPSD| 23136115
Interacve features play a crucial role in visual analysis of big datasets, helping users to
derive insights from complex and voluminous data. Two of the most important interacve
features are:
1. Dynamic Filtering and Selecon:
Descripon:
Dynamic ltering and selecon allow users
ly modify what data is visible in a
based on certain criteria or selecons. This
iders, checkboxes, dropdown menus, or brush
cng ranges of data directly on the
tself. For example, in a large dataset
les data over several years, dynamic ltering a user to
focus on data from a specic year,
duct category.
This feature is crical because it enables down into large
datasets and focus on subsets rcular interest, making it
easier to uncover
nds, and outliers. It supports exploratory data lowing users
to form and test hypotheses a, iteravely rening their
quesons based on dback the data provides.
to interacve
visualizaon can
involve sl tools
for sele
visualizaon i
containing sa
could enable
region, or pro
users to drill that
are of pa
paerns, tre
analysis by al
about the dat
the visual fee
Importance:
Descripon:
Interacve dashboards and linked
visualizaons refer to the coordinated use of mulple data visualizaons
that update in response to user interacons. When a user interacts with
one visualizaon, such as selecng a segment of a pie chart or a range of
values in a histogram, other visualizaons on the dashboard update to
reect this selecon, showing only data related to the
This feature is crucial for analyzing big
datasets because it provides a holisc view of the data from mulple
perspecves simultaneously. It allows for cross-ltering, where interacng
with one element of the dashboard dynamically inuences the content of
others, facilitang a deeper understanding of the relaonships and
correlaons within the data. This mul-faceted approach to data
exploraon helps in idenfying paerns and insights that might not be
apparent from a single visualizaon, making it an essenal tool in the
analysis of complex
user's interacon.
datasets.
Importance:
lOMoARcPSD| 23136115
These features are integral to the visual analysis of big datasets because they empower
users to navigate and make sense of the complexity and scale of the data. By enabling
interacve exploraon, users can more eecvely idenfy relevant paerns, trends, and
insights, leading to more informed decision-making and analysis.
At university X, the late graduaon rate is high meaning that most students cannot nish
the program with the designed meline of the curriculum.
The reasons could be that:
- Students cannot pass courses aer nish aempt,
- Students did not register for courses in the designated semester,
- Students intenonally lengthen the leaning path by registering for fewer courses
than the suggested courses in the curriculum,
- Or the reasons
Hence, people would like to analyze the learning progress of students to make early
warnings to students and the dean of the school regarding the potenal late graduaon
problem.
- The data for the learning progress of student is grouped by enrolment batch.
- Each patch has an enrolment year and a major i.e. Batch 2020-Computer Science,
Batch 2021-Data Science, and Batch 2019-Informaon Technology
- Each batch has it curriculum inclosing a list of courses students of the batch should
take each semester like the following table:
+ Batch – Semester – Course
+ 2020-CS – 1 - Introducon to compung (3 credits), C++ Programing (4 credits)
+ 2020-CS – 2 – OOP (4 credits), CN (4 credits)
- Each bath has a summarized of grades of students by semester like the following
table:
+ Batch – Semester/Year – Course – Number of registraons/Total students of the
batch Number of students who passed the course (score > 50) + 2020-CS
Sem1/2020 – Introducon to Compung – 60/60 – 60
+ 2020-CS – Sem1/2020 – C/C++ Programming – 50/60 – 40
+ 2020-CS – Sem1/2020 – C/C++ Programming – 5/60 – 5
+ 2020-CS – Sem2/2020 – CN – 60/60 – 50
- In the table above, row 2 shows that 50 students of 60 students of batch
2020Computer Science registered for “C/C++ Programming in semester 1 of
2020.
Add only 40 of those 50 students passed the course.
- Row 3 show that in semester 2 of 2020, 5 students of batch 2020-Computer
Science registerd for “C/C++ Programmingand all those 5 students passed the
course?
lOMoARcPSD| 23136115
Your tasks are to design chart(s) to analyze the learning progress of students and
highlight potenal late graduaon rate.
a) (10pts) State the problem described above in terms of abstract task in
visualizaon.
b) (10pts) What are types of data? What are the marks and channels to use? Draw
the sample chart(s) to illustrates your design.
c) (10pts) Explain and describe how your chart(s) could help people detect the
potenal high late graduaon r
Interacve features play a crucial role in visual analysis of big datasets, helping users to derive insights
from complex and voluminous data. Two of the most important interacve features are:
1.
Dynamic Filtering and Selecon:
Descripon:
Dynamic ltering and selecon allow users to modify what data is
visible in a visualizaon based on
a or selecons. This can involve sliders, checkboxes,
enus, or brush tools for selecng ranges of data
e visualizaon itself. For example, in a large dataset
les data over several years, dynamic ltering could to focus on data
from a specic year, region, or
gory.
This feature is crical because it enables users to drill ge datasets
and focus on subsets that are of parcular ing it easier to uncover
paerns, trends, and outliers.
xploratory data analysis by allowing users to form and es about the
data, iteravely rening their quesons visual feedback the data
provides.
interacvely
certain criteri
dropdown m
directly on th
containing sa
enable a user
product cate
down into lar
interest, mak It
supports e test
hypothes based
on the
Importance:
2.
Interacve Dashboards and Linked Visualizaons:
Descripon:
Interacve dashboards and linked visualizaons refer
to the coordinated use of mulple data visualizaons that update in response to user
interacons. When a user interacts with one visualizaon, such as selecng a segment
of a pie chart or a range of values in a histogram, other visualizaons on the
dashboard update to reect this selecon, showing only data related to the
This feature is crucial for analyzing big datasets
because it provides a holisc view of the data from mulple perspecves
simultaneously. It allows for cross-ltering, where interacng with one element of the
dashboard dynamically inuences the content of others, facilitang a deeper
understanding of the relaonships and correlaons within the data. This mul-
faceted approach to data exploraon helps in idenfying paerns and insights that
might not be apparent from a single visualizaon, making it an essenal tool in the
analysis of complex
user's interacon.
datasets.
Importance:
lOMoARcPSD| 23136115
These features are integral to the visual analysis of big datasets because they empower users to
navigate and make sense of the complexity and scale of the data. By enabling interacve exploraon,
users can more eecvely idenfy relevant paerns, trends, and insights, leading to more informed
decisionmaking and analysis.
You
At university X, the late graduaon rate is high meaning that most students cannot nish the program
with the designed meline of the curriculum. The reasons could be that: - Students cannot pass
courses aer nish aempt, - Students did not register for courses in the designated semester, -
Students intenonally lengthen the leaning path by registering for fewer courses than the suggested
courses in the curriculum, - Or the reasons Hence, people would like to analyze the learning progress
of students to make early warnings to students and the dean of the school regarding the potenal late
graduaon problem. - The data for the learning progress of student is grouped by enrolment batch. -
Each patch has an enrolment year and a major i.e. Batch 2020-Computer Science, Batch 2021-Data
Science, and Batch 2019-Informaon Technology - Each batch has it curriculum inclosing a list of
courses students of the batch should take each semester like the following table: + Batch – Semester
– Course + 2020-CS – 1 - Introducon to compung (3 credits), C++ Programing (4 credits) + 2020-CS –
2 OOP (4 credits), CN (4 credits) - Each bath has a summarized of grades of students by semester like
the following table: + Batch – Semester/Year – Course – Number of registraons/Total students of the
batch – Number of students who passed the course (score > 50) + 2020-CS Sem1/2020 Introducon
to Compung – 60/60 – 60 + 2020-CS – Sem1/2020 – C/C++ Programming – 50/60 –
40 + 2020-CS – Sem1/2020 C/C++ Programming 5/60 – 5 + 2020-CS Sem2/2020 – CN – 60/60 –
50 - In the table above, row 2 shows that 50 students of 60 students of batch 2020-Computer Science
registered for “C/C++ Programming” in semester 1 of 2020. Add only 40 of those 50 students passed
the course. - Row 3 show that in semester 2 of 2020, 5 students of batch 2020Computer Science
registerd for “C/C++ Programmingand all those 5 students passed the course? Your tasks are to design
chart(s) to analyze the learning progress of students and highlight potenal late graduaon rate. a)
(10pts) State the problem described above in terms of abstract task in visualizaon. b) (10pts) What
are types of data? What are the marks and channels to use? Draw the sample chart(s) to illustrates
your design. c) (10pts) Explain and describe how your chart(s) could help people detect the potenal
high late graduaon rate of a
batch. ChatGPT
ChatGPT
a) Abstract Task in Visualizaon
The primary task is to design a visualizaon system that enables the idencaon and analysis of
factors contribung to late graduaon rates among university students by batch and major. This system
should facilitate the exploraon of course registraon paerns, pass rates, and adherence to the
prescribed curriculum meline, enabling stakeholders to make informed decisions and intervene
where necessary.
b) Types of Data, Marks, and Channels
T
ypes of Data:
1. Categorical Data:
Batch idencaon (e.g., "2020-CS"), majors
e.g., "Computer Science"), semesters (e.g., "Sem1/2020"), and courses (e.g.,
"Introducon to Compung").
(
lOMoARcPSD| 23136115
2. Quantave Data: Number of registraons per course, total students per batch, and
number of students who passed each course.
Marks and Channels:
Marks: Points, lines, bars.
Channels: Posion, length, color, and size.
Sample Chart Design:
1. Stacked Bar Chart for Course Registraon and Pass Rates by Semester:
X-Axis: Semesters (me-ordered).
Y-Axis: Number of students.
Bars: Each bar represents a course, with the length indicang the number of
registraons.
Segments within Bars: Dierenate between students who passed and
those who didn't using color (e.g., green for passed, red for not passed).
Faceng: Separate charts for each batch and major to allow comparison.
2. Line Chart for Cumulave Course Compleons: X-Axis:
Semesters.
Y-Axis: Cumulave number of courses completed by students.
Lines: Represent individual students or an average trend line for the batch.
Color: Dierenate lines by major or other relevant categories.
Sample Visualizaon:
Let's create a simplied illustraon of the stacked bar chart design for course registraon and pass
rates by semester, which will provide a visual representaon of how this concept might be applied to
the data described.
dalleCopy code
{ "prompt": "Create an illustrave stacked bar chart showing course registraon and
pass rates for a university batch. The x-axis should have labels for semesters, such
as 'Sem1/2020', 'Sem2/2020', and so on. Each bar represents a course, with the
length indicang the number of registraons. Within each bar, use green to show the
poron of students who passed and red for those who didn't. Include a legend
indicang what green and red represent. Facet the chart by batches and majors,
such as '2020-CS', '2021-Data Science', to allow comparison between dierent
groups. Add a tle 'Course Registraon and Pass Rates by Semester for University
Batches'.", "size": "1024x1024" }
c) Detecon of Potenal High Late Graduaon Rate
The designed charts would help in detecng potenal high late graduaon rates
in several ways:
1. Idenfying Challenging Courses:
The stacked bar chart can highlight courses with low pass rates, indicang
subjects that may be barriers to mely graduaon.
2. Trends in Course Registraons:
lOMoARcPSD| 23136115
By observing the registraon numbers across semesters, stakeholders can idenfy
paerns where students might be avoiding certain courses or registering for fewer
courses than suggested, potenally delaying graduaon.
3. Comparison Across Batches:
Faceng by batch and major allows for the comparison of course compleon rates
and potenal issues across dierent groups, helping to idenfy if certain batches or
majors are more prone to late graduaons.
4. Cumulave Progress Tracking:
The line chart showing cumulave course compleons provides a clear visual
representaon of students' progress towards their degree requirements over me.
Deviaons from the expected trajectory can prompt early intervenons.
5. Aconable Insights:
The visualizaon can serve as a basis for discussions among educators and
administrators about curriculum adjustments, support systems, and intervenons to
assist students at risk of late graduaon.
d) ate of a batch.

Preview text:

lOMoAR cPSD| 23136115
Interactive features play a crucial role in visual analysis of big datasets, helping users to
derive insights from complex and voluminous data. Two of the most important interactive features are:
1. Dynamic Filtering and Selection:
Description: Dynamic filtering and selection allow users to interactiv ly
e modify what data is visible in a visualization c b an ased
on certain criteria or selections. This involve sl to id
ols ers, checkboxes, dropdown menus, or brush for cti
sele ng ranges of data directly on the visualization
i tself. For example, in a large dataset containing
sa les data over several years, dynamic filtering a user to could
enable focus on data from a specific year, Importance: duct category.
This feature is critical because it enables down into large region, or pro
datasets and focus on subsets rticular interest, making it easier to uncover users to drill that
nds, and outliers. It supports exploratory data lowing users are of
pa to form and test hypotheses a, iteratively refining their patterns,
tre questions based on dback the data provides. analysis by al about the dat the visual fee
2. Interactive Dashboards and Linked Visualizations:
Description: Interactive dashboards and linked
visualizations refer to the coordinated use of multiple data visualizations tIha m t p up ort d a a n te
ce:i n response to user interactions. When a user interacts with
one visualization, such as selecting a segment of a pie chart or a range of
values in a histogram, other visualizations on the dashboard update to
reflect this selection, showing only data related to the user's interacti onTh
. is feature is crucial for analyzing big
datasets because it provides a holistic view of the data from multiple
perspectives simultaneously. It allows for cross-filtering, where interacting
with one element of the dashboard dynamically influences the content of
others, facilitating a deeper understanding of the relationships and
correlations within the data. This multi-faceted approach to data
exploration helps in identifying patterns and insights that might not be
apparent from a single visualization, making it an essential tool in the analysis of complex datasets. lOMoAR cPSD| 23136115
These features are integral to the visual analysis of big datasets because they empower
users to navigate and make sense of the complexity and scale of the data. By enabling
interactive exploration, users can more effectively identify relevant patterns, trends, and
insights, leading to more informed decision-making and analysis.
At university X, the late graduation rate is high meaning that most students cannot finish
the program with the designed timeline of the curriculum. The reasons could be that:
- Students cannot pass courses after finish attempt,
- Students did not register for courses in the designated semester,
- Students intentionally lengthen the leaning path by registering for fewer courses
than the suggested courses in the curriculum, - Or the reasons
Hence, people would like to analyze the learning progress of students to make early
warnings to students and the dean of the school regarding the potential late graduation problem.
- The data for the learning progress of student is grouped by enrolment batch.
- Each patch has an enrolment year and a major i.e. Batch 2020-Computer Science,
Batch 2021-Data Science, and Batch 2019-Information Technology
- Each batch has it curriculum inclosing a list of courses students of the batch should
take each semester like the following table:
+ Batch – Semester – Course
+ 2020-CS – 1 - Introduction to computing (3 credits), C++ Programing (4 credits)
+ 2020-CS – 2 – OOP (4 credits), CN (4 credits)
- Each bath has a summarized of grades of students by semester like the following table:
+ Batch – Semester/Year – Course – Number of registrations/Total students of the
batch – Number of students who passed the course (score > 50) + 2020-CS –
Sem1/2020 – Introduction to Computing – 60/60 – 60
+ 2020-CS – Sem1/2020 – C/C++ Programming – 50/60 – 40
+ 2020-CS – Sem1/2020 – C/C++ Programming – 5/60 – 5
+ 2020-CS – Sem2/2020 – CN – 60/60 – 50
- In the table above, row 2 shows that 50 students of 60 students of batch
2020Computer Science registered for “C/C++ Programming” in semester 1 of 2020.
Add only 40 of those 50 students passed the course.
- Row 3 show that in semester 2 of 2020, 5 students of batch 2020-Computer
Science registerd for “C/C++ Programming” and all those 5 students passed the course? lOMoAR cPSD| 23136115
Your tasks are to design chart(s) to analyze the learning progress of students and
highlight potential late graduation rate.
a) (10pts) State the problem described above in terms of abstract task in visualization.
b) (10pts) What are types of data? What are the marks and channels to use? Draw
the sample chart(s) to illustrates your design.
c) (10pts) Explain and describe how your chart(s) could help people detect the
potential high late graduation r
Interactive features play a crucial role in visual analysis of big datasets, helping users to derive insights
from complex and voluminous data. Two of the most important interactive features are:
1. Dynamic Filtering and Selection:
Description: Dynamic filtering and selection allow users to modify what data is
visible in a visualization based on interactively certain criteri
a or selections. This can involve sliders, checkboxes, dropdown m
enus, or brush tools for selecting ranges of data directly on t e
h visualization itself. For example, in a large dataset containing s les
a data over several years, dynamic filtering could to focus on data enable a use fr
r om a specific year, region, or gory. Importance:
This feature is critical because it enables users to drill ge datasets product cate
and focus on subsets that are of particular ing it easier to uncover
patterns, trends, and outliers. down into xp
lar loratory data analysis by allowing users to form and es about the interest, mak da
It ta, iteratively refining their questions visual feedback the data supports e tes pr t ovides. hypothes based on the
2. Interactive Dashboards and Linked Visualizations:
Description: Interactive dashboards and linked visualizations refer
to the coordinated use of multiple data visualizations that update in response to user in I ter m ac po tio rta ns.
nce :When a user interacts with one visualization, such as selecting a segment
of a pie chart or a range of values in a histogram, other visualizations on the
dashboard update to reflect this selection, showing only data related to the user's interactio Thi
n. s feature is crucial for analyzing big datasets
because it provides a holistic view of the data from multiple perspectives
simultaneously. It allows for cross-filtering, where interacting with one element of the
dashboard dynamically influences the content of others, facilitating a deeper
understanding of the relationships and correlations within the data. This multi-
faceted approach to data exploration helps in identifying patterns and insights that
might not be apparent from a single visualization, making it an essential tool in the analysis of complex datasets. lOMoAR cPSD| 23136115
These features are integral to the visual analysis of big datasets because they empower users to
navigate and make sense of the complexity and scale of the data. By enabling interactive exploration,
users can more effectively identify relevant patterns, trends, and insights, leading to more informed decisionmaking and analysis. You
At university X, the late graduation rate is high meaning that most students cannot finish the program
with the designed timeline of the curriculum. The reasons could be that: - Students cannot pass
courses after finish attempt, - Students did not register for courses in the designated semester, -
Students intentionally lengthen the leaning path by registering for fewer courses than the suggested
courses in the curriculum, - Or the reasons Hence, people would like to analyze the learning progress
of students to make early warnings to students and the dean of the school regarding the potential late
graduation problem. - The data for the learning progress of student is grouped by enrolment batch. -
Each patch has an enrolment year and a major i.e. Batch 2020-Computer Science, Batch 2021-Data
Science, and Batch 2019-Information Technology - Each batch has it curriculum inclosing a list of
courses students of the batch should take each semester like the following table: + Batch – Semester
– Course + 2020-CS – 1 - Introduction to computing (3 credits), C++ Programing (4 credits) + 2020-CS –
2 – OOP (4 credits), CN (4 credits) - Each bath has a summarized of grades of students by semester like
the following table: + Batch – Semester/Year – Course – Number of registrations/Total students of the
batch – Number of students who passed the course (score > 50) + 2020-CS – Sem1/2020 – Introduction
to Computing – 60/60 – 60 + 2020-CS – Sem1/2020 – C/C++ Programming – 50/60 –
40 + 2020-CS – Sem1/2020 – C/C++ Programming – 5/60 – 5 + 2020-CS – Sem2/2020 – CN – 60/60 –
50 - In the table above, row 2 shows that 50 students of 60 students of batch 2020-Computer Science
registered for “C/C++ Programming” in semester 1 of 2020. Add only 40 of those 50 students passed
the course. - Row 3 show that in semester 2 of 2020, 5 students of batch 2020Computer Science
registerd for “C/C++ Programming” and all those 5 students passed the course? Your tasks are to design
chart(s) to analyze the learning progress of students and highlight potential late graduation rate. a)
(10pts) State the problem described above in terms of abstract task in visualization. b) (10pts) What
are types of data? What are the marks and channels to use? Draw the sample chart(s) to illustrates
your design. c) (10pts) Explain and describe how your chart(s) could help people detect the potential
high late graduation rate of a batch. ChatGPT ChatGPT
a) Abstract Task in Visualization
The primary task is to design a visualization system that enables the identification and analysis of
factors contributing to late graduation rates among university students by batch and major. This system
should facilitate the exploration of course registration patterns, pass rates, and adherence to the
prescribed curriculum timeline, enabling stakeholders to make informed decisions and intervene where necessary.
b) Types of Data, Marks, and Channels T y pes of Data: 1. Categorical Data:
Batch identification (e.g., "2020-CS"), majors (e.
g., "Computer Science"), semesters (e.g., "Sem1/2020"), and courses (e.g., "Introduction to Computing"). lOMoAR cPSD| 23136115
2. Quantitative Data: Number of registrations per course, total students per batch, and
number of students who passed each course. • Marks and Channels:
Marks: Points, lines, bars. •
Channels: Position, length, color, and size. • Sample Chart Design:
1. Stacked Bar Chart for Course Registration and Pass Rates by Semester:
X-Axis: Semesters (time-ordered). •
Y-Axis: Number of students. •
Bars: Each bar represents a course, with the length indicating the number of registrations. •
Segments within Bars: Differentiate between students who passed and
those who didn't using color (e.g., green for passed, red for not passed). •
Faceting: Separate charts for each batch and major to allow comparison.
2. Line Chart for Cumulative Course Completions: X-Axis: Semesters. •
Y-Axis: Cumulative number of courses completed by students. •
Lines: Represent individual students or an average trend line for the batch. •
Color: Differentiate lines by major or other relevant categories. • Sample Visualization:
Let's create a simplified illustration of the stacked bar chart design for course registration and pass
rates by semester, which will provide a visual representation of how this concept might be applied to the data described. dalleCopy code
{ "prompt": "Create an illustrative stacked bar chart showing course registration and
pass rates for a university batch. The x-axis should have labels for semesters, such
as 'Sem1/2020', 'Sem2/2020', and so on. Each bar represents a course, with the
length indicating the number of registrations. Within each bar, use green to show the
portion of students who passed and red for those who didn't. Include a legend
indicating what green and red represent. Facet the chart by batches and majors,
such as '2020-CS', '2021-Data Science', to allow comparison between different
groups. Add a title 'Course Registration and Pass Rates by Semester for University
Batches'.", "size": "1024x1024" }
c) Detection of Potential High Late Graduation Rate The d in se esi ver gned c al way ha
s: rts would help in detecting potential high late graduation rates
1. Identifying Challenging Courses:
The stacked bar chart can highlight courses with low pass rates, indicating
subjects that may be barriers to timely graduation.
2. Trends in Course Registrations: lOMoAR cPSD| 23136115
By observing the registration numbers across semesters, stakeholders can identify
patterns where students might be avoiding certain courses or registering for fewer
courses than suggested, potentially delaying graduation.
3. Comparison Across Batches:
Faceting by batch and major allows for the comparison of course completion rates
and potential issues across different groups, helping to identify if certain batches or
majors are more prone to late graduations.
4. Cumulative Progress Tracking:
The line chart showing cumulative course completions provides a clear visual
representation of students' progress towards their degree requirements over time.
Deviations from the expected trajectory can prompt early interventions.
5. Actionable Insights:
The visualization can serve as a basis for discussions among educators and
administrators about curriculum adjustments, support systems, and interventions to
assist students at risk of late graduation. d) ate of a batch.