Issue Number | 4903 |
---|---|
Summary | Glossary Terms with diacritics need to be sorted properly |
Created | 2020-09-24 22:29:19 |
Issue Type | Bug |
Submitted By | Englisch, Volker (NIH/NCI) [C] |
Assigned To | Kline, Bob (NIH/NCI) [C] |
Status | Closed |
Resolved | 2020-09-25 09:27:53 |
Resolution | Fixed |
Path | /home/bkline/backups/jira/ocecdr/issue.275536 |
Currently, the A-Z list for the dictionary of cancer terms is sorting all terms starting with the characters A-Z under the appropriate character and all terms starting not with an ASCII character are sorted under the letter "#" which should be reserved for terms beginning with a number or a special character like an open parentheses or open bracket, etc.
Especially for the Spanish terms, where many terms start with an accented character, those terms are sorted within the "#-group" rather than the respective letter group.
This isn't really a bug, as the requirements have changed. The
original requirements explicitly stated that every term whose name did
not begin with a character which when lowercased matched the regular
expression [a-z]
was to be displayed under
#. The current implementation matches those
requirements. The new requirements call for mapping of letters with
diacritics to ASCII letters before matching the regular expression.
Changed requirements implemented on DEV.
I can see the correct "first_letter" character in the database for the Spanish terms starting with a diacritic. The first letter entry was specified as "#" before but is now the corresponding ASCII character.
~volker can this be moved to TASK REVIEWED (DEV)?
Yes, looks good on DEV.
The terms starting with diacritics (Spanish terms) are correctly sorted on QA.
Confirmed on PROD.
Elapsed: 0:00:00.001928