Jump to content

Working on a new review site


RadioRob
This topic is 1041 days old and is no longer open for new replies.  Replies are automatically disabled after two years of inactivity.  Please create a new topic instead of posting here.  

Recommended Posts

17 minutes ago, RadioRob said:

Reviews would not automatically import over unfortunately. There are too many differences that prevent me from creating an actual converter.

Just how bizarre is it? Is it a jungle of HTML pages, or is it a database? If the latter, I've used Apple's Filemaker Pro database for doing all manner of obnoxious database conversions over the years. If I can read the data in, maybe I can munch it and spit out something you can use. 

Link to comment
Share on other sites

25 minutes ago, Orin said:

Just how bizarre is it? Is it a jungle of HTML pages, or is it a database? If the latter, I've used Apple's Filemaker Pro database for doing all manner of obnoxious database conversions over the years. If I can read the data in, maybe I can munch it and spit out something you can use. 

I might ping you for some help/advice. I’m a system administrator by trade, not a DBA or a programmer. So my speciality is getting systems up and running. Not transferring data. Haha

The raw reviews are a jungle of HTML pages, but the list of escorts themselves are in a database. 

Part of the challenge is that each listing combined the escort data itself along with the review. In a true relational database this is split so it will be dividing data up among multiple databases with proper keys and indexes. I will need to handle importing some static HTML sources and some SQL sources  across multiple databases to properly import everything.

The other challenge I ran into is Daddy changed the raw html template used in the reviews over years. So when I tried to import data from raw HTML by screen scraping files, it still failed randomally because there was not regular consistently with how each field was labeled and was it’s names were. For example, a name field was renamed at some point to instead of have one field named “name” it was later split to “first name” and “last name”. Other fields were added or removed or renamed over the years. So it’s going to be super messy. 
 

Link to comment
Share on other sites

39 minutes ago, RadioRob said:

The other challenge I ran into is Daddy changed the raw html template used in the reviews over years. So when I tried to import data from raw HTML by screen scraping files, it still failed randomally because there was not regular consistently with how each field was labeled and was it’s names were. For example, a name field was renamed at some point to instead of have one field named “name” it was later split to “first name” and “last name”. Other fields were added or removed or renamed over the years. So it’s going to be super messy. 

I don't know how you would be storing the HTML part of this, but Filemaker lets you store an HTML page in a container field. Let's give it try.

Link to comment
Share on other sites

  • 2 weeks later...
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...