This bugtracker is kept to provide history on old issues.
|Anonymous | Login | Signup for a new account||2017.12.14 18:23 CET|
|My View | View Issues | Change Log | Roadmap|
|View Issue Details|
|ID||Project||Category||View Status||Date Submitted||Last Update|
|0003218||Piwigo||synchronization||public||2015.04.09 16:51||2015.04.09 16:51|
|Target Version||Fixed in Version|
|Summary||0003218: Normalize UTF-8 filenames before synchronization|
|Description||Filenames scanned during synchronization should be UTF-8 normalized to the precomposed format using Normalizer. At least when default encoding is utf-8 and PHP 5.3 is available.|
|Additional Information||UTF-8 allows two different ways to encode umlaut special characters like the german ä, ü, ä. For technical background on that see e.g. the following:|
I upload images from my Mac to my Linux server via SSH, and some of them contain Umlaut characters. For those files to be imported, I already changed the regex to the following:
$conf['sync_chars_regex'] = '/^[\s,.\'\pL0-9-_.]+$/u';
Which works fine for files with Umlaut I create directly on the server.
On the Mac, the Umlauts ä are encoded in the decomposed format 0x61cc88. When I upload them, they keep that filename. This is no problem for Linux, the names are displayed just fine. However, PHPs regex parser does not match those decomposed characters with the \pL character group (which it should). So it would be great if those filenames would be normalized before matching.
|Tags||No tags attached.|
|Database engine and version|
|Web server||Apache 2.2.x|
|2015.04.09 16:51||mindhaq||New Issue|
|Copyright © 2000 - 2017 MantisBT Team Contact|