jeudi 21 juillet 2016

How to extract specific content, like name or DOB, from a document using NLP and python?

I want to extract very specific content like name, address and dob from a document (say for example, a resume). Assuming I have 1000 of such documents, I want to automate it using machine learning and natural language processing. And preferably python. How can I do that? or Where do I start? Update: I am aware of NER but I am looking to extract very specific information from a document which can be loaded into an excel or something. Example: From a project report, I would like to extract the topic, team member names and tenure of the project.

Aucun commentaire:

Enregistrer un commentaire