Jump to content
Claris Engage 2025 - March 25-26 Austin Texas ×

This topic is 7874 days old. Please don't post here. Open a new topic instead.

Recommended Posts

Posted

IMDB being the Internet Movie Database smile.gif

I don't even really wanna know how it's done (yet smile.gif, but we use the IMDB to get data on the films in our database (Year, Length, Actors, Director, Cinematographer, etc.), and we've been doing it manually for all these years, copy/pasting and formatting. Takes quite a lot of time, as you can guess.

So, could this be automated? As I said, I don't expect a full solution or anything, just a pointer to the right direction maybe...

many thanks!

  • 2 weeks later...
Posted

I won't address the legality of what you are trying to do, but make sure you do. smile.gif

This is similar to a number of questions that have been posted about importing and parsing raw text files. You need to save the page source and then open it in a simple text editor to examine the formatting that surrounds the data you want to extract. When you find the text that immediately precedes and follows the data (usually html tags), then you can design a calculation formula to extract the part you want. To find the start of the data:

Start = Position(PageSource,PreambleText,1,1)+Length(PreambleText)

Similarly, the end of data:

End = Position(PageSource,PostambleText,1,1)

Then to extract the data itself:

Data = Middle(PageSource, Start,End-Start)

You will have to repeat these calculations for each item of data that you need to extract from the page source.

Posted

IMDB used to have an XML interface to its data which served as a reference implementation in many XML tutorials. Unfortunately, it seems to have been taken down as of May 2003 ... - Would have been just perfect for XML import.

afterthought: probably just moved to the pro services section ... give it a try

googlesearch imdb xml xsl

Posted

IMDB used to have an XML interface to its data which served as a reference implementation in many XML tutorials. Unfortunately, it seems to have been taken down as of May 2003 ... - Would have been just perfect for XML import.

This topic is 7874 days old. Please don't post here. Open a new topic instead.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...

Important Information

By using this site, you agree to our Terms of Use.