Posted February 09, 2012
orcishgamer: Sure, you can create a hash of each page, or if you want to only check for changes to content use a screen scraper and hash the result (which in practice very well may amount to the same thing, depending on how good the screen scraper is).
BTW, doing all of that in C++ is going to be a real pain as a beginner, even opening a socket is hard in C++. You might consider a language that's a little easier for that kind of thing if it's an option (Java is a good choice, C# should be a good choice as well).
jamyskis: I don't see why it should be that much of a pain in C++. You can use libcurl to fetch the page, create a base hash as you suggest, download the new page using libcurl, hash that and compare the two hashes without ever having to parse the markup. BTW, doing all of that in C++ is going to be a real pain as a beginner, even opening a socket is hard in C++. You might consider a language that's a little easier for that kind of thing if it's an option (Java is a good choice, C# should be a good choice as well).
Simples.
Using even a well-designed and well-documented library like libcurl can be intimidating for a beginner, though it's definitely the best way to get the job done.