Goutte is a light weight php web-scraping library that can scrape websites like wikipedia, google, facebook, instagram and many more. In this tutorial, I will be showing you how to setup a simple goutte web scraper that scrapes data from wikipedia in 3 steps. Step Install composer if you don’t already have it installed: for mac run: “Brew install composer” for other operating systems(or if “Brew install composer” doesn’t work) follow the instructions here: https://getcomposer.org/download/ Step Install the goutte library by running this command: composer require fabpot/goutte
The web scraper goes to wikipedia, looks up “web crawler” in the search bar and prints the first paragraph of the wikipedia page about web crawlers.