Bug 1104471

Summary: deja-dup/duplicity fails with UnicodeDecodeError when restoring backup with non UTF-8 encoding
Product: [Fedora] Fedora Reporter: Vincent Rubiolo <vincent.libre>
Component: deja-dupAssignee: Rahul Sundaram <metherid>
Status: CLOSED DUPLICATE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: unspecified    
Version: 20CC: fedora, jitesh.1337, kengert, metherid, mike
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-06-04 13:00:32 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
Duplicity backtrace
none
Non-UTF-8 glyphs display for deja-dup UI none

Description Vincent Rubiolo 2014-06-04 05:49:26 UTC
Created attachment 901971 [details]
Duplicity backtrace

Description of problem:
I had taken backups from Ubuntu using deja-dup before moving to Fedora 20. Some of the directories are inherited from older disks and it seems they use a different encoding (this did not show up on Ubuntu). When I try to use deja-dup to restore missing files using my backup, I can see some glyphs in their names are not properly displayed in the UI. When I proceed further, the backup starts but eventually fails with a Python backtrace about a UnicodeError (cf attached capture).

Version-Release number of selected component (if applicable):
$ duplicity --version
duplicity 0.6.24

$ deja-dup --version
deja-dup 30.0

How reproducible:
Always

Steps to Reproduce:
1. Plugin my external USB hard drive where I have the backup
2. Open nautilus and navigate to the 'Documents' folder.
3. Click on the gear icon (top-right) and select 'Restore missing files'. This opens the deja-dup UI and starts a scan of the missing files in the folder.
4. The list comes up and some folders have their glyphs replaced by a crossed box (the place where a French accent glyph would be).
5. Proceed with the restoration. It starts and goes on for some time, then eventually fails with a UnicodeDecodeError.

Actual results:
Fails to restore the backup

Expected results:
Backup restore is successful.

Additional info:
I also get Pango-related glyph errors when the UI is displayed for these folders. Excerpt from journalctl:
Jun 03 22:30:47 localhost.localdomain gnome-session[2525]: (deja-dup:7118): Pango-WARNING **: Invalid UTF-8 string passed to pango_layout_set_text()

Comment 1 Rahul Sundaram 2014-06-04 13:00:32 UTC

*** This bug has been marked as a duplicate of bug 1101727 ***

Comment 2 Vincent Rubiolo 2014-06-04 15:37:26 UTC
Created attachment 902219 [details]
Non-UTF-8 glyphs display for deja-dup UI

Thanks! Attaching UI capture for such problematic directories for the record.