Bug 770161

Summary: [abrt] python-mutagen-1.20-2.fc15: utf_8.py:16:decode:UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3 in position 4: invalid continuation byte
Product: [Fedora] Fedora Reporter: Alberto Segura <asgsb09>
Component: python-mutagenAssignee: Silas Sewell <silas>
Status: CLOSED NOTABUG QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 16CC: rnovacek, silas
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Unspecified   
Whiteboard: abrt_hash:a16d527e34c1c4a96464bc16f543f980ef8fbcc8
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-01-03 12:57:14 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Alberto Segura 2011-12-23 18:30:49 UTC
libreport version: 2.0.8
abrt_version:   2.0.7
cmdline:        /usr/bin/python /usr/bin/mid3v2 -T 31 -A 'Sism\xf3grafo' -t 'Sism\xf3grafo - Clem Leek - 19/12/11' --WOAF http://www.rtve.es/resources/TE_SSISMOG/podcast/mp3/7/7/1324276463877.mp3 --WOAS http://www.rtve.es/podcast/radio-3/sismografo/SSISMOG.xml '/home/alberto/audio/podcasts/Sism\xf3grafo/1324276463877.mp3'
executable:     /usr/bin/mid3v2
kernel:         3.1.5-6.fc16.x86_64
reason:         utf_8.py:16:decode:UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3 in position 4: invalid continuation byte
time:           vie 23 dic 2011 19:20:27 CET
uid:            1000
username:       alberto

backtrace:
:utf_8.py:16:decode:UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3 in position 4: invalid continuation byte
:
:Traceback (most recent call last):
:  File "/usr/bin/mid3v2", line 261, in <module>
:    main(sys.argv)
:  File "/usr/bin/mid3v2", line 246, in main
:    write_files(parser.edits, args)
:  File "/usr/bin/mid3v2", line 92, in write_files
:    edits = [(frame[2:], value.decode(enc)) for (frame, value) in edits]
:  File "/usr/lib64/python2.7/encodings/utf_8.py", line 16, in decode
:    return codecs.utf_8_decode(input, errors, True)
:UnicodeDecodeError: 'utf8' codec can't decode byte 0xf3 in position 4: invalid continuation byte
:
:Local variables in innermost frame:
:input: 'Sism\xf3grafo'
:errors: 'strict'

comment:
:	I use "hpodder" (https://github.com/jgoerzen/hpodder/wiki) for retreiving podcast. "python-mutagen" is one of its dependencies and one of its files (/usr/bin/mid3v2) gaves the error.
:
:	The error appears when hpodder is getting podcast. Commands "hpodder update && hpodder download"
:	
:	Probably, spanish language tildes (ó for example) has the fault ("Sismógrafo" is the name of one the podcast downloaded)

smolt_data:
:
:
:General
:=================================
:UUID: e46a0280-a38f-49c1-a55f-c1071f06f271
:SO: Fedora release 16 (Verne)
:Nivel de ejecución por defecto: Unknown
:Idioma: es_ES.UTF-8
:Plataforma: x86_64
:BogoMIPS: 5851.51
:Fabricante de CPU: GenuineIntel
:Modelo de CPU: Intel(R) Core(TM)2 Duo CPU     E7500  @ 2.93GHz
:Paso del UCP: 10
:Familia de UCP: 6
:Número de Modelo de CPU: 23
:Número de CPUs: 2
:Velocidad del CPU: 2933
:Memoria del Sistema: 7858
:Intercambio (swap) del Sistema: 7999
:Vendedor: Hewlett-Packard
:Sistema: HP Compaq 8000 Elite SFF PC 
:Factor de forma: Low Profile Desktop
:Kernel: 3.1.5-6.fc16.x86_64
:SELinux Habilitado: 1
:Política de SELinux: targeted
:SELinux Obligado: Enforcing
:MythTV Remote: Unknown
:MythTV Role: Unknown
:MythTV Theme: Unknown
:MythTV Plugin: 
:MythTV Tuner: -1
:
:
:Dispositivos
:=================================
:(32902:14950:4156:13894) pci, uhci_hcd, USB, 82801JD/DO (ICH10 Family) USB UHCI Controller #3
:(32902:14868:4156:13894) pci, None, PCI/ISA, 82801JDO (ICH10DO) LPC Interface Controller
:(32902:11794:4156:13894) pci, i915, VIDEO, 4 Series Chipset Integrated Graphics Controller
:(32902:11795:4156:13894) pci, None, VIDEO, 4 Series Chipset Integrated Graphics Controller
:(32902:14960:4156:13894) pci, pcieport, PCI/PCI, 82801JD/DO (ICH10 Family) PCI Express Port 1
:(32902:14962:4156:13894) pci, pcieport, PCI/PCI, 82801JD/DO (ICH10 Family) PCI Express Port 2
:(32902:11796:4156:13894) pci, None, SIMPLE, 4 Series Chipset HECI Controller
:(32902:11799:4156:13894) pci, serial, 16550_SERIAL, 4 Series Chipset Serial KT Controller
:(32902:11798:4156:13894) pci, ata_generic, STORAGE, 4 Series Chipset PT IDER Controller
:(32902:4318:4156:13894) pci, e1000e, ETHERNET, 82567LM-3 Gigabit Network Connection
:(32902:14949:4156:13894) pci, uhci_hcd, USB, 82801JD/DO (ICH10 Family) USB UHCI Controller #2
:(32902:9294:4156:13894) pci, None, PCI/PCI, 82801 PCI Bridge
:(32902:14948:4156:13894) pci, uhci_hcd, USB, 82801JD/DO (ICH10 Family) USB UHCI Controller #1
:(32902:14951:4156:13894) pci, uhci_hcd, USB, 82801JD/DO (ICH10 Family) USB UHCI Controller #4
:(32902:14952:4156:13894) pci, uhci_hcd, USB, 82801JD/DO (ICH10 Family) USB UHCI Controller #5
:(32902:11792:4156:13894) pci, agpgart-intel, HOST/PCI, 4 Series Chipset DRAM Controller
:(32902:14958:4156:13894) pci, snd_hda_intel, MULTIMEDIA, 82801JD/DO (ICH10 Family) HD Audio Controller
:(32902:14850:4156:13894) pci, ahci, STORAGE, 82801JD/DO (ICH10 Family) SATA AHCI Controller
:(32902:14953:4156:13894) pci, uhci_hcd, USB, 82801JD/DO (ICH10 Family) USB UHCI Controller #6
:(32902:14954:4156:13894) pci, ehci_hcd, USB, 82801JD/DO (ICH10 Family) USB2 EHCI Controller #1
:(32902:14956:4156:13894) pci, ehci_hcd, USB, 82801JD/DO (ICH10 Family) USB2 EHCI Controller #2
:
:
:Información relacionada con el sistema de archivos
:=================================
:device mtpt type bsize frsize blocks bfree bavail file ffree favail
:-------------------------------------------------------------------
:/dev/sdb9 / btrfs 4096 4096 38400000 17529375 16685483 0 0 0
:/dev/sda5 /usr/local btrfs 4096 4096 164880128 90446574 89836336 0 0 0
:/dev/sdb5 /opt ext4 4096 4096 26083686 10747151 9442563 102016 42214 42214
:/dev/sdb7 WITHHELD btrfs 4096 4096 25600000 24824243 24229057 0 0 0
:

Comment 1 Fedora Update System 2012-01-02 12:34:33 UTC
spyder-2.1.6-1.fc16 has been submitted as an update for Fedora 16.
https://admin.fedoraproject.org/updates/spyder-2.1.6-1.fc16

Comment 2 Radek Novacek 2012-01-02 12:36:22 UTC
Sorry, that comment above belongs to different bug.

Comment 3 Radek Novacek 2012-01-03 12:57:14 UTC
I think this is not a bug in python-mutagen. You are using UTF-8 encoding but arguments given to mid3v2 are not UTF-8 strings. E.g. 'Sism\xf3grafo' is in cp-1252. hpodder should encode the strings to proper encoding based on your settings (your locale is es_ES.UTF-8).

So closing as NOT-A-BUG.