|
Description: |
Prototype required as proof of concept.
Crawler to validate the presence of markup fragments.
Main deliverables:
1 - CLI script (cronjob) to recursively scan website for markup, updates results to db.
2 - SQL script to create database and tables for 1.
Secondary deliverables:
Simple 3 page website to demonstrate the functionality of the main deliverables and allows for acceptance testing.
1 - Website listing
2 - Website detail (name, toplevel URL, markup to find)
3 - display scan results from db
Cosmetics unimportant. Functionality is main focus.
Site will be reengineered and made pretty in a separate project.
Refer to the attachment wmv_20081112.zip for details of the requirement.
I look forward to hearing from you! Additional Info (Added 11/18/2008 at 11:43 EST)...Escrow will be used. Additional Info (Added 11/19/2008 at 11:19 EST)...Folks,
please pay attention to this and take this into account when bidding.
The web site is unimportant. The only reason for it is to make acceptance testing simple for us both. Neither of us want to dig through phpmyadmin and SQL to prove that the code works or has bugs. I expect a few simple web pages that show that the bot is doing it's job. Not CSS, javascript and rounded rectangles. If all you do is web design, sorry, but you're in the wrong place. Fancy design and html skills, while always impressive, are NOT relevant here.
Rather show me that you have solid skills in with bots, scraping and parsing.
I consider this to be a fairly standard exercise for anyone who has experience with bots.
Thanks
|