Author |
Message |
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | Quoting Corma: Quote: Looks great! Can't await it, thanks! But can you pause it? I guess I won't do the crew if it takes so long and can't be paused. I'll do my best, Ms. Sophie! | | | Karsten DVD Collectors Online
|
|
Registered: March 13, 2007 | Posts: 681 |
| Posted: | | | | Quoting DJ Doena: Quote: Little sneak preview for the next version:
I've build a scanner that scans your local cast and crew file and checks if IMDb has merged two actors that are considered different in your database.
But fair warning: This will take a while. My 90,000 cast members took 13 hours to scan. This is due to the fact that for each actor IMDb has to be called and checked if they re-route the URL to a different IMDb-ID.
The result will look something like this (this is test data taken from my own collection):
Thanks in advance! This sounds like a great tool. | | | Mika I hate people who love me, and they hate me. (Bender Bending Rodriguez) |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| |
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| |
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| |
Registered: July 29, 2007 | Posts: 183 |
| Posted: | | | | Looks great. Thank you very much. But 157814 People in Cast.xml - 1 day and 28 min remaining - my electricity supplier should thank you more than I do |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | Well, you requested a pause, you got a pause. So you can have it run whenever your PC is running anyways. | | | Karsten DVD Collectors Online
|
|
Registered: March 29, 2007 | Reputation: | Posts: 4,479 |
| Posted: | | | | Quoting DJ Doena: Quote: ... new FindMergedCastCrew tool. Very useful tool. Works fine I found 13 persons to fix, some I had no idea they could be the same person (Gozie Agbo= Joe Russo, for example) Many thanks. | | | Images from movies |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | Quoting surfeur51: Quote: I found 13 persons to fix, some I had no idea they could be the same person (Gozie Agbo= Joe Russo, for example) Yeah, I had some weird examples as well, e.g. nm0914321: James Watkins nm0160589: Julian Christopher nm4029524: Kristen Stewart nm2286991: Tara Cardinal nm0676508: Jocelyne Peters nm0500376: Sheila Leighton nm0613790: James Murdock nm0048371: David Baker nm2004153: Steve McGowan nm0569707: Richard Plantagenet | | | Karsten DVD Collectors Online
|
|
Registered: July 29, 2007 | Posts: 183 |
| Posted: | | | | Great tool as usual DJ Doena. But the usual problems, porno people in mainstream productions and women changing their name like I do my unterpants, are ruining my day again Almost all my 'yellow' results (and there a a LOT) look like this: These people are considered identical by IMDb. Any call to the first person will lead to the second person. Please adapt your DVD Profiler data accordingly. nm0817381: Randy Spears (1961) nm0665856: Gregory Patrick (5856) The problem is that often the only correct thing of the second entry is the url. Common name and BY of the first one are usually correct. Any suggestions how to resolve this best and in wich order? I guess I have to first replace Randy Spears (1961) with Gregory Patrick (5856) and then normally parse a profile with him including BY check to make CCE2 notice the changed namy and BY? |
|
| T!M | Profiling since Dec. 2000 |
Registered: March 13, 2007 | Reputation: | Posts: 8,739 |
| Posted: | | | | Quoting DJ Doena: Quote: I've build a scanner that scans your local cast and crew file and checks if IMDb has merged two actors that are considered different in your database. Sounds great, but from what I gather it only looks at the cast/crew data that was previously mined from IMDb using the Cast/Crew Edit tool. It would be even better if it could do the same check on *ALL* cast/crew in my database, not just those of the titles I previously used the Cast/Crew Edit 2 tool for. Couldn't it use DVD Profiler's XML-export to do so? Or am I missing something and is that functionality present already? |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | All the people I found I had never heard of anyway so I took the result as is, corrected the data to the second entry (whatever that was) and if I ever come across this person again, it will hopefully show me during the normal scan that common name and/or birth year has changed again. | | | Karsten DVD Collectors Online
|
|
Registered: July 29, 2007 | Posts: 183 |
| Posted: | | | | Quoting DJ Doena: Quote: All the people I found I had never heard of anyway... Lucky you. I've just had to fix Robert Taylor. He is kinda famous I guess. He plays Sheriff Longmire in 'Longmire' for example. Do you always scan for changed BYs? Maybe that would help me avoiding such messes? |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | I scan for changed BYs when I only have a Fake BY. Not when I already have an actual BY. | | | Karsten DVD Collectors Online
|
|
Registered: July 29, 2007 | Posts: 183 |
| Posted: | | | | Ok. I've finished editing my DB with the problems found by FindMergedCastCrew.exe. Then I've replaced my original cast.xml with the one from FindMergedCastCrew.
Then I've checked all four Birthyear options of CCE2 and parsed a new purchase / profile (The Hateful Eight). CCE2 split up Michael Madsen. He probably got a new BY on imdb (1957 was my old entry, the new one is 1958).
This isn't supposed to happen without a warning by CCE2, right? Is there something wrong with my cast.xml or are my setting wrong or is this maybe a new CCE2 problem?
PS: Just noticed the people in local cache count of CCE2 dropped from around 450k to 307k.
PPS: It was my cast.xml (it was very small, just a few kb). I've tried with an slightly older backup (last pre FindMergedCastCrew) and it worked. But since I've now 'lost' the cast.xml created by FindMergedCastCrew, what are the differences between it's input and outpout file? Anything mandatory or am I just getting warnings again wich I've already fixed?
My best guess is this was caused by the Win10 update feature wich booted while I was away but had CCE2 running this morning. | | | Last edited: by Corma |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | FindMergedCast crew basically removes all the upper entries of the yellow log notes as they aren't necessary anymore. CCE2 itself now checks any link immediately if it leads to a newer URL (that's why it's gotten a bit more slower). | | | Karsten DVD Collectors Online
|
|