[mythtv-users] Re: Character encoding problem in UK Radio Times XML data

Nick knowledgejunkie at gmail.com
Sun Jun 19 16:16:57 UTC 2005


On 6/19/05, Paul <mythtv at dsl.pipex.com> wrote:
> >I'm noticing that I keep getting 'Æ' characters in Myth where I guess
> >RT is supplying 'proper' quote characters. I'm also guessing that this
> >is an encoding issue; my FC3 is running UTF-8, and I suppose RT is
> >using 'Doze Latin1.
> 
> >Is everyone else using RT XML getting this, or have I got something I
> >can fix? Or can I work around it anyway?

Getting this too - it's in the source file from the Beeb.

> The strange thing is the apostrophe in "Britain's" is shown correctly but
> the
> same character in "who's" is not shown correctly. Is there anybody using
> the Radio Times grabber not seeing this problem?

Checking the BBC1 file at http://xmltv.radiotimes.com/xmltv/92.dat,
the first apostrophe is indeed an apostrophe in the source file. The
second in "who's" is the AE ligature in the source file. This is the
case when viewing in Firefox with 8859-1 char encoding. It also seems
to be the only AE character in the BBC1 file currently.

> Personally I think that if this is a common problem it should be fixed in
> the tv_grab_uk_rt grabber.

And/or at the BBC level, as this seems to be a non-uniform error. 

I hadn't noticed the problem at all until it was mentioned a couple of
days ago. Have other users seen this before, or is it a new 'addition'
to the data?

Nick


More information about the mythtv-users mailing list