Skip navigation
Skip navigation

An Analysis of Student Representation, Representative Features and Classification Algorithms to Predict Degree Dropout

Manrique, Ruben; Nunes, Bernardo Pereira; Marino, Olga; Casanova, Marco A.; Nurmikko-Fuller, Terhi

Description

Identifying and monitoring students who are likely to dropout is a vital issue for universities. Early detection allows institutions to intervene, addressing problems and retaining students. Prior research into the early detection of at-risk students has opted for the use of predictive models, but a comprehensive assessment of the suitability of different algorithms and approaches is complicated by the large number of variable features that constitute a student's educational experience....[Show more]

dc.contributor.authorManrique, Ruben
dc.contributor.authorNunes, Bernardo Pereira
dc.contributor.authorMarino, Olga
dc.contributor.authorCasanova, Marco A.
dc.contributor.authorNurmikko-Fuller, Terhi
dc.coverage.spatialTempe, United States
dc.date.accessioned2019-10-14T03:59:30Z
dc.date.createdMarch 4-8 2019
dc.identifier.isbn9781450362566
dc.identifier.urihttp://hdl.handle.net/1885/175222
dc.description.abstractIdentifying and monitoring students who are likely to dropout is a vital issue for universities. Early detection allows institutions to intervene, addressing problems and retaining students. Prior research into the early detection of at-risk students has opted for the use of predictive models, but a comprehensive assessment of the suitability of different algorithms and approaches is complicated by the large number of variable features that constitute a student's educational experience. Predictive models vary in terms of their amplitude, temporality and the learning algorithms employed. While amplitude refers to the ability of the model to operate on multiple degrees, temporality is often considered due to the natural temporal aspect of the data. In the absence of a comparative framework of learning algorithms, the aim of this paper has been to provide such an analysis, based on a proposed classification of strategies for predicting dropouts in Higher Education Institutions. Three different student representations are implemented (namely Global Feature-Based, Local Feature-Based, and Time Series) in conjunction with the appropriate learning algorithms for each of them. A description of each approach, as well as its implementation process, are presented in this paper as technical contributions. An experiment based on a dataset of student information from two degrees, namely Business Administration and Architecture, acquired through an automated management system from a university in Brazil is used. Our findings can be summarized as: (i) of the three proposed student representations, the Local Feature-Based was the most suitable approach for predicting dropout. In addition to providing high quality results, the Local Feature-Based representations are simple to build, and the construction of the model is less expensive when compared to more complex ones; (ii) as a conclusion of the results obtained via Local Feature-Based, dropout can be said to be accurately predicted using grades of a few core courses, so there is no need for a complex features extraction process; (iii) considering temporal aspects of the data does not seem to contribute to the prediction performance although it increases computational costs as the model complexity increases.
dc.description.sponsorshipThis work was partially supported by COLCIENCIAS PhD scholarship (Call 647-2014).
dc.format.mimetypeapplication/pdf
dc.language.isoen_AU
dc.publisherAssociation for Computing Machinery
dc.relation.ispartofseries9th International Conference on Learning Analytics and Knowledge, LAK 2019
dc.rights© 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM
dc.sourceACM International Conference Proceeding Series
dc.titleAn Analysis of Student Representation, Representative Features and Classification Algorithms to Predict Degree Dropout
dc.typeConference paper
local.description.notesImported from ARIES
local.description.refereedYes
dc.date.issued2019
local.identifier.absfor080505 - Web Technologies (excl. Web Search)
local.identifier.ariespublicationu3102795xPUB1174
local.publisher.urlhttps://dl.acm.org/
local.type.statusPublished Version
local.contributor.affiliationManrique, Ruben, Universidad de los Andes
local.contributor.affiliationNunes, Bernardo Pereira, Pontifical Catholic University of Rio de Janeiro (PUC-Rio)
local.contributor.affiliationMarino, Olga, Universidad de los Andes
local.contributor.affiliationCasanova, Marco A., Pontifical Catholic University of Rio de Janeiro
local.contributor.affiliationNurmikko-Fuller, Terhi, College of Arts and Social Sciences, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.startpage401
local.bibliographicCitation.lastpage410
local.identifier.doi10.1145/3303772.3303800
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
dc.date.updated2019-04-28T09:22:49Z
local.identifier.scopusID2-s2.0-85062769647
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Manrique_An_Analysis_of_Student_2019.pdf947.65 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator