Project details

Web Crawling and Data Extraction

Published by: P4I

Status: READY FOR PUBLISHING

Category: DATA INTELLIGENCE

Application domain: Information Technology

Budget (EUR): UP TO 15000

Project description

Our company would like to analyze the job posting status in the italian market by extracting data about required skills and digital competences from the recruiting companies web sites.

Project goal

Having a list of predefined websites and a list of predefined fields, the tool is required to perform a web crawling fetching all the relevant URLs and parsing the web pages to extract structured data from the pages.