Home

Melanie Tosik

31 August 2014

Research internship at Textkernel

In 2014, I was a research intern at Textkernel, where we explored new methods of improving resume parsing for multi-lingual documents.


In order to extract structured information in the form of specific phrases like name or address, we adopted the probabilistic conditional random fields (CRF) framework. In addition, we experimented with a novel approach that integrates continuous vector representations of words as input features for such a model.


Next to my internship report, we also published a paper detailing our work at NAACL-HTL 2015. Textkernel also released an interview about my overall experience.


Til next time,
Melanie

scribble