DS831: Programming for Data Science

The Study Board for Science

Teaching language: Danish or English depending on the teacher
EKA: N340103102
Assessment: Second examiner: Internal
Grading: 7-point grading scale
Offered in: Odense
Offered in: Autumn
Level: Master

STADS ID (UVA): N340103101
ECTS value: 10

Date of Approval: 15-03-2021


Duration: 1 semester

Version: Archive

Entry requirements

The course cannot be taken by students enrolled in the master programme in Computer Science.
The course cannot be taken by students who have taken DM561 or DM562 or another course with a similar content.

Academic preconditions

None

Course introduction

The aim of the course is to enable the student to solve data analysis tasks for a diversity of problems from different research areas. Next to algorithmic thinking, data analysis workflows include activities like data modeling, gathering, cleaning, processing, and means to visualize certain attributes in basic plots. This is important in regard to the remainder of the Data Science education as it provides the basis for carrying out data analysis projects.

The course gives an academic basis for studying the topics Data Mining and Machine Learning, Applied Machine Learning, Visualization and Deep Learning, that are part of the degree.

In relation to the competence profile of the degree it is the explicit focus of the course to:
  • Give the competence to develop solutions for data analysis tasks
  • Give the competence to apply and integrate existing modules for data processing
  • Give knowledge and understanding of the principles of programming
  • Give knowledge and understanding of algorithmically processing large amounts of data within different subject areas
  • Give skills in software development 
  • Give skills in data collection, cleaning, validation, integration and visualization

Among others, students partaking the course will particularly earn the following 21st Century Skills:

  • The ability to integrate and assess information
  • Competently find, utilise and assess information
  • Being able to execute and implement
  • Be flexible and adaptable
  • Co-create solutions to existing problems and work effectively in teams

Expected learning outcome

The learning objectives of the course is that the student demonstrates the ability to:
  • Apply learned problem solving strategies to different data processing tasks
  • Adapt existing solutions to related tasks across domains
  • Develop new data analysis strategies
  • Develop Python programs that implement data processing workflows
  • Find, select and utilize existing modules to collect, clean and process data
  • Collaboratively develop data analysis solutions in project teams

Content

The following topics are contained in the course:
1. Basics of computing and algorithmic thinking
2. Python programming
  • basic data types
  • branching
  • loops
  • functions
  • mutable data types
  • modules
  • file I/O, exceptions
  • classes
  • basic data visualization
3. Data analysis workflows with data sets from different domains, e.g.:
  • biography data
  • climate data
  • textual data
  • numerical data

Literature

See itslearning for syllabus lists and additional literature references.

Examination regulations

Exam element a)

Timing

Autumn and January

Tests

Portfolio

EKA

N340103102

Assessment

Second examiner: Internal

Grading

7-point grading scale

Identification

Student Identification Card

Language

Normally, the same as teaching language

Examination aids

Allowed, a closer description of the exam rules will be posted in itslearning.

ECTS value

10

Additional information

The portfolio exam includes small written assignments and a group project with a written report and an oral discussion.

Indicative number of lessons

90 hours per semester

Teaching Method

At the faculty of science, teaching is organized after the three-phase model ie. intro, training and study phase. The intro phase facilitates the introduction to new material and topics, which in the skills training phase are processed with exercises prepared at home and discussed in class to validate the acquired knowledge. The study activity in form of practical applications gives the students the possibility to apply and use the knowledge acquired.

Teacher responsible

Name E-mail Department
Stefan Jänicke stjaenicke@imada.sdu.dk Data Science

Timetable

Administrative Unit

Institut for Matematik og Datalogi (datalogi)

Team at Educational Law & Registration

NAT

Offered in

Odense

Recommended course of study

Profile Education Semester Offer period