I was asked to build a control-system for a Ebay-like Finnish auction-site huuto.net.
The system would reopen closed auctions by a specific rules. It would be completely external from the main site, running at an external website.
The site is however unwilling to release its API and Schema. I know no way to build such a system without knowing its API.
How do you build an internet site without its API and Schema?
4
Answers
It might be possible to get the data you need by screen scraping the site. You could perform the operations you want to do by POSTing data into their forms or using a WebClient type API to make your program act like a web browser but that’s likely to be an extremely brittle solution.
Honestly though, without an API, there really is no good solution.
You could try some form of automatic browsing: mechanize
Edit:
Examples here.
you either need access to the database or an API, otherwise no point in even trying.
I think you’re asking about building a site that interacts with another site without using a well-defined API. Is that right?
You can interact with an external site without using an official API – in order to do so, you need to imitate a normal site visitor and send your requests to the site frontend (in much the same way as a web crawler does). Tools like hpricot, mechanize and curl can help you parse the content of pages and send requests, but in doing so your system may be quite brittle. Any change to the target site might mean you have to rewrite portions of your system.