CDR Tickets

Issue Number 5324
Summary Automate alternate glossary term stage names
Created 2024-04-08 15:27:51
Issue Type Task
Submitted By Kline, Bob (NIH/NCI) [C]
Assigned To Kline, Bob (NIH/NCI) [C]
Status Closed
Resolved 2024-06-27 14:24:37
Resolution Won't Fix
Path /home/bkline/backups/jira/ocecdr/issue.430898
Description

This is a followup for report issue OCECDR-5318. Create software to automate as much of the work as possible for modifying the CDR glossary's stage terms to support finding stages using the alternate Arabic numerals for the stage names.

Please post your notes to this ticket. I'm going to need a place to post my own comments, and I didn't want to hold off any longer, lest I risk losing some of the information. 😉

Comment entered 2024-04-08 16:12:27 by Kline, Bob (NIH/NCI) [C]

Here are the test run results for a first cut at a global change for a two-phase approach.

https://cdr-dev.cancer.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2024-04-08_15-19-33

The first phase is this global change to plug in placeholders in the DefinitionText elements of the concept documents and insert the corresponding ReplacementText elements into the existing name documents.

The second phase would be a script which creates the new name documents for the Arabic number alternate stage names, cloned from the original Roman-numeral versions. I modified the original specifications, which called for finding all of the terms whose names begin with "stage" except for those which begin with "stage 0" because when I used the original logic I ended up with stages which had already been created with Arabic numbers. Basically I'm getting all of the glossary terms which start with "Stage I" or "Stage V" for the English name.

I posted a question via email last week asking if any consideration had been given to just creating the new Arabic-version name documents without altering the definitions, on the theory that this would achieve the original goal (as I understand it) of supporting searches looking for stages with the Arabic numbers rather than the standard Roman numerals, but I haven't gotten any responses yet..

Comment entered 2024-04-11 10:20:48 by Osei-Poku, William (NIH/NCI) [C]

We have a few recommendations for cloning the glossary term names. They are nice to have, so if they cannot be done or if LOE is high, please ignore them.

Please exclude the following elements/blocks from the cloned documents if possible:

  1. Comments

  2. Date Last Modified

  3. Existing Processing Statuses

Please add the following block if possible:

  1. Processing Statuses with a Processing Status Value of “ Ready for publishing”

 

If possible, use system date at the time of cloning for:

  1. Term Name Status Date

  2. Translated Name Status Date

Comment entered 2024-04-11 10:46:23 by Kline, Bob (NIH/NCI) [C]

None of these should be difficult.

Comment entered 2024-06-27 14:24:37 by Juthe, Robin (NIH/NCI) [E]

We have decided not to add these alternate stage names. Google is correctly serving up the Roman numeral content even when an Arabic term is used in the search query, so this is not needed for SEO reasons.

Elapsed: 0:00:00.001943