About DoKS      NL  |  EN Search: Advanced Search
  Part of a word (e.g. tele*)    Exact wordgroup (e.g. "wireless communication")
 
Home
folder Authors
folder Departments
folder Help
folder Years
 
Most popular theses: 2014 2015 2016 2017 2018 2019


1,260 theses on-line.



Doks PHL
Doks XIOS



Open Archives Initiative
Home
ETD
Title : Data gathering and analysis: recognising Personally Identifiable Information
Author :
Vanbrabant, Casper
Abstract : The internship assignment consists of the building and maintenance of a web scraper. The goal of this assignment is to collect Big Data sets from social media websites. The Institute then uses the Big Data sets to perform dialect analyses on them.

There are many web scraping tools available, they often have different features, and sometimes they can be quite costly. At first a web scraping technology has to be selected in according to the needs of the assignment. In this case the tool has to be able to scrap data, filter data and subsequently store it in a database. After the comparison of some web scraping tools, the best one is selected and implemented.

The focus of the research assignment is on Personally Identifiable Information. This type of information can be found almost everywhere on the worldwide web, especially on social media. Most people do not understand the possible danger of having their personal information falling into the wrong hands. A literature study explains the definition of Personally Identifiable Information, the difference with Personal Data defined by the General Data Protection Regulation, and demonstrates how criminals can (ab)use Personally Identifiable Information.
Furthermore, there is a basic principle for training a model that could be used to recognise PII in the data sets that are collected by the web scraper.
Publisher : Hogeschool PXL
Publication date : 2019
Full text :
File Size Type Checksum  
Eindwerk_Vanbrabant_Casper_Definitief.pdf 2 MB PDF MD5 Open file
Appendices :
File Size Type Checksum  
Type : Bachelorproef
Language :
English
Rights : All rights reserved
Degree name : Professionele bachelor in de toegepaste informatica
Degree level : Bachelor
Discipline : Systemen en netwerkbeheer
Grantor of degree : Hogeschool PXL
Department : PXL-Digital
Awards :
Status : PUBLISHED
Persistent Identifier :
Created : 27/08/2019 13:52
Last modified : 13/10/2019 23:42
Collections :
PXL-IT
2019
11302205
Views : 256
Downloads : 230
 
 

Simple view




©2004-2008 - Hogeschool PXL - webmaster - Contact - Disclaimer