30 credits – Natural language processing on technical documents

Thesis project at Scania is an excellent way of making contacts for your future working life. Many of our current employees started their career with a thesis project. This thesis is within the data science team at Scania IT and you will be working within a field of great strategic value for Scania.


There are a lot of technical documents and complaints arriving to Scania on a regular basis from workshops. These short texts are usually written in haste and therefore they are sometimes filled with mistakes to a degree where the language is difficult to evaluate by machine. Due to many technical terms in the texts ordinary spelling correction will fail.


Purpose of the thesis project is to find a way of doing spelling correction that will work well with the technical texts.


Creating a spelling algorithm that will work in the context of the existing environment.


Specify education or specialisation: master student in IT or statistics, data science or similar.

Knowledge in the following subjects would be beneficial: Big data, Hadoop and related technologies, data mining, machine learning, natural language processing, statistics and programming.

Number of students: 1-2

Start date: January 2019

Estimated time needed: 20 weeks

Contact persons and supervisors:

Isolde Snellman, IXAD, 08-553 71 117

Annette Hultåker, IXAD, 08-553 82 097


Your application should contain a covering letter, CV and transcripts.

Selections will be made throughout the application period.

Publication date from - until

2018-08-24 – 2018-12-02


Skicka din ansökan till med rubrikraden Ny Teknik Jobb.

Aktuellt inom