Web Scraping with NodeJS

My new tutorial – Web Scraping with NodeJS – is up on Nettuts+. I go through basic scraping methods but the real gold is in the second half, where I talk about using PhantomJS to scrape dynamic pages (pages with AJAX content). Check it out!

  1. Saw on tutsplus lots of comments about “No path or port provided”. I had this problem too and found a solution to fix it. As i have no account at Tutsplus i will post it here:

    – Go to the phantomjs version (the node.js version).
    – Remove node_modules/dnode + node_modules/dnode-protocol
    – open package.json
    – change :

    “dependencies”: {
    “dnode-protocol”: “*”,
    “dnode”: “*”,
    “express”: “*”


    “dependencies”: {
    “dnode-protocol”: “~0.2.2”,
    “dnode”: “~0.9.12”,
    “express”: “*”
    – in the phantomjs folder excecute: “npm install phantom”
    – The error is gone

Comments are closed.