[mythtv-users] Duplicate recordings because of bad SD data?

f-myth-users at media.mit.edu f-myth-users at media.mit.edu
Mon Aug 9 02:35:54 UTC 2010


More bad data, in three different ways.  The really irritating part of
this is not that I wind up with repeats I don't want (though that's
annoying).  It's not knowing if TMS/SD will figure out their error
and -reuse- programids for truly-new content (instead of claimed-to-
be-new-but-actually-repeated content), thus causing me to miss things
because Myth already thought it recorded them.  Is it possible to get
a straight answer on this from TMS?  Will they -ever- reuse a programid,
even if they realize that they emitted it for bogus reasons in the past?

[Manually nuking the programid data out of the entries in oldrecorded
is one way to not lose if they reuse the id for something else, but of
course it'll cause more repeats of the same bogus thing to be recorded
if they -don't- fix their error...]

Item #1:

This is in the format of my automation, so the first field is a
similarity metric based on CC data, and then we have pairs of
(startime_endtime_title::subtitle programid).

84 SIM 3212_20100808115800_20100808123500_Ideal:: EP007206980039 3212_20090705232800_20090706000500_Ideal::The_Backpacker EP007206980005
86 SIM 3212_20100808122800_20100808130500_Ideal:: EP007206980040 3212_20090712232800_20090713000500_Ideal::Party EP007206980006
89 SIM 3212_20100808125800_20100808133500_Ideal:: EP007206980041 3212_20090719232800_20090720000500_Ideal::Pregnancy EP007206980008
86 SIM 3212_20100808132800_20100808140500_Ideal:: EP007206980042 3212_20090726232800_20090727000500_Ideal::Corpse EP007206980009

So these four were aired with no title and new programid's, but they
were identical to four aired a while ago.  (The similarity metrics
don't look high because there is substantial pre- and post-roll
padding on all of these recordings, and that's different, of course.
I compared the actual CC data from start to end and it's identical
except for a couple dropped bits here and there---no content
differences, just the occasional transmission error.)

XML for all of the ones aired recently (I don't feel like digging
around for the ancient XML unless it's important):

<program id='EP007206980039'>
<title>Ideal</title>
<showType>Series</showType>
<series>EP00720698</series>
<syndicatedEpisodeNumber>105</syndicatedEpisodeNumber>
<originalAirDate>2010-08-04</originalAirDate>
</program>
<program id='EP007206980040'>
<title>Ideal</title>
<showType>Series</showType>
<series>EP00720698</series>
<syndicatedEpisodeNumber>106</syndicatedEpisodeNumber>
<originalAirDate>2010-08-05</originalAirDate>
</program>
<program id='EP007206980041'>
<title>Ideal</title>
<showType>Series</showType>
<series>EP00720698</series>
<syndicatedEpisodeNumber>107</syndicatedEpisodeNumber>
<originalAirDate>2010-08-06</originalAirDate>
</program>
<program id='EP007206980042'>
<title>Ideal</title>
<showType>Series</showType>
<series>EP00720698</series>
<syndicatedEpisodeNumber>108</syndicatedEpisodeNumber>
<originalAirDate>2010-08-08</originalAirDate>
</program>

<schedule program='EP007206980039' station='14873' time='2010-08-08T16:00:00Z' duration='PT00H30M'/>
<schedule program='EP007206980040' station='14873' time='2010-08-08T16:30:00Z' duration='PT00H30M'/>
<schedule program='EP007206980041' station='14873' time='2010-08-08T17:00:00Z' duration='PT00H30M'/>
<schedule program='EP007206980042' station='14873' time='2010-08-08T17:30:00Z' duration='PT00H30M'/>

Item #2:

Slightly different format, but you get the idea:

3227    2010-08-07 00:58:00     2010-08-07 01:32:00     EP010453880010  How Do They Do It?      Money; Shoes    How they print banknotes; turning hides into brogues.
3227    2010-08-08 04:58:00     2010-08-08 05:32:00     EP008159770189  How Do They Do It?      Money; shoes    Money; shoes.

In this case, both descriptions are bogus---they appear to have been
swapped.  And the second subtitle should probably have been "moving
money; fishing rods" or maybe "printing backnotes; fishing rods;
brogues" or something like that.

They're about different things, so the differing programid's are fine,
though it's curious just -how- different they are.

First few words of each program, respectively:

>>> COMING UP ON "HOW DO THEY DO
IT?"--
HOW DO THEY MAKE THE MONEY IN
YOUR WALLET ALMOST IMPOSSIBLE TO
FORGE?
AND HOW DO THEY HAND CRAFT THE
FINEST SHOES FIT FOR "A" LIST
CELEBRITIES?

and

>> Narrator: COMING UP ON
"HOW DO THEY DO IT?"...
HOW DO THEY MOVE MILLIONS OF
DOLLARS IN COLD, HARD CASH?
HOW DID THEY MAKE FISHING RODS
TO CATCH BIG-GAME FISH?
AND HOW DO THEY MAKE THE
HAUNTING SOUND OF SCOTTISH
BAGPIPES?

[The word "scottish" does not occur in the CC data for the first one.
 The word "shoes" does not occur in the CC data for the second one.]

<program id='EP008159770189'>
<title>How Do They Do It?</title>
<subtitle>Money; shoes</subtitle>
<description>Money; shoes.</description>
<showType>Series</showType>
<series>EP00815977</series>
<originalAirDate>2010-08-21</originalAirDate>
</program>

<schedule program='EP008159770189' station='16616' time='2010-08-08T09:00:00Z' duration='PT00H30M' tvRating='TV-G' stereo='true'/>

Item #3:

3227    2009-10-09 20:58:00     2009-10-09 21:32:00     EP004154020346  How It's Made           Western revolver replicas; arc trainers; used-oil furnaces; vegetable peelers; pizza cutters.
3227    2010-08-04 03:28:00     2010-08-04 04:02:00     EP004154020382  How It's Made           Western revolver replicas; arc trainers; used-oil furnaces; vegetable peelers; pizza cutters.

Identical actual contents.  Identical descriptions, which is good.
Brand-new different programid on the 8/4/10 airing, which is bad.
Interestingly, both programid's are airing in the same 30-hour
window:

<program id='EP004154020346'>
<title>How It&apos;s Made</title>
<description>Western revolver replicas; arc trainers; used-oil furnaces; vegetable peelers; pizza cutters.</description>
<showType>Series</showType>
<series>EP00415402</series>
<originalAirDate>2009-10-09</originalAirDate>
</program>
<program id='EP004154020382'>
<title>How It&apos;s Made</title>
<description>Western revolver replicas; arc trainers; used-oil furnaces; vegetable peelers; pizza cutters.</description>
<showType>Series</showType>
<series>EP00415402</series>
<originalAirDate>2010-01-12</originalAirDate>
</program>

<schedule program='EP004154020346' station='16616' time='2010-08-03T00:30:00Z' duration='PT00H30M' tvRating='TV-G' stereo='true' closeCaptioned='true'/>
<schedule program='EP004154020346' station='16616' time='2010-08-03T03:30:00Z' duration='PT00H30M' tvRating='TV-G' stereo='true' closeCaptioned='true'/>
<schedule program='EP004154020382' station='16616' time='2010-08-04T07:30:00Z' duration='PT00H30M' tvRating='TV-G' stereo='true' closeCaptioned='true'/>
<schedule program='EP004154020382' station='57390' time='2010-08-04T07:30:00Z' duration='PT00H30M' tvRating='TV-G' stereo='true' hdtv='true' closeCaptioned='true'/>


More information about the mythtv-users mailing list