Bug report #6327
non-ASCII character corruption in shapefile
|Affected QGIS version:||master||Regression?:||No|
|Operating System:||Easy fix?:||No|
|Pull Request or Patch supplied:||No||Resolution:||fixed|
|Crashes QGIS or corrupts data:||No||Copied to github as #:||15614|
- Create a shapefile, choosing UTF8 and adding a text attribute
- Create a feature and fill the text attribute with non-ASCII text (for example 汉字)
- Save the shapefile.
The text of the attribute is replaced by "??"
This does not happen when loading an existing UTF8 shapefile, it can even be modified and saved. "Save as", however, is also affected.
I use QGis master and latest GDAL build from SVN. I am using ArchLinux with a en_US.UTF-8 locale.
I am aware of the previous issues between QGIS and GDAL, but Alexander Bruy told me they were mostly solved now and this problem looks different, hence this new bug report.
#3 Updated by Leyan Ouyang about 10 years ago
This is the option "Ignore shapefile encoding" I mentioned in the first comment, and it does seem to solve the issue. However, the bug is not fixed in my opinion as the default setting will lead to loss of data. This option should be checked by default while the issues with GDAL are not sorted out.