CDR Tickets

Issue Number 5277
Summary Changes To This Summary Global
Created 2023-09-07 10:50:07
Issue Type Improvement
Submitted By Osei-Poku, William (NIH/NCI) [C]
Assigned To Kline, Bob (NIH/NCI) [C]
Status Closed
Resolved 2023-09-11 15:56:56
Resolution Fixed
Path /home/bkline/backups/jira/ocecdr/issue.357492
Description

Per Volker's assessment in OCECDR-5276 it looks like we will need a global to change the text of the section title from "Changes to This Summary" section to "Latest Updates to this Summary" for 

  1. HP Summaries

  2. Patient Summaries (even though this is not published to cancer.gov, it will be good to have some consistency.) 

  3. If the new text will be translated, then we have to include Spanish summaries as well. 

  4. Exclude blocked summaries but include non-publishable documents. 

I am creating this ticket to discuss the requirements for the global this afternoon in the CDR/EBMS meeting. 

As usual, it will be good to group the global results by

  1. Language

  2. Audience

  3. Then by Summary Type (not required - you can skip this one if it will get complicated)

Comment entered 2023-09-08 13:23:40 by Osei-Poku, William (NIH/NCI) [C]

This is the translated text  -   

Actualizaciones más recientes a este resumen del PDQ

Comment entered 2023-09-08 15:22:03 by Osei-Poku, William (NIH/NCI) [C]

I have just edited the translation.

Comment entered 2023-09-08 16:28:41 by Osei-Poku, William (NIH/NCI) [C]

I have revised the new wording for English per guidance from Robin. I have asked Linda to revise the Spanish translation as well.

Comment entered 2023-09-09 09:46:32 by Kline, Bob (NIH/NCI) [C]

So we're keeping "PDQ" in the Spanish title but dropping it from the English title?

Comment entered 2023-09-11 09:27:38 by Osei-Poku, William (NIH/NCI) [C]

This is the text for the Spanish translation:

Actualizaciones más recientes a este resumen

Comment entered 2023-09-11 09:29:41 by Osei-Poku, William (NIH/NCI) [C]

As I stated in my previous comment, I asked Linda to revise the Spanish translation.

Comment entered 2023-09-11 09:47:42 by Kline, Bob (NIH/NCI) [C]

Is it safe to assume that I can use the SectionType attribute value to determine which sections need to be modified?

Comment entered 2023-09-11 10:02:28 by Kline, Bob (NIH/NCI) [C]

As usual, it will be good to group the global results by ...

I'm not sure what you mean by "usual." In all the years of global change jobs, I can only think of two cases where we split the global into separate runs, and in one of those cases the two separate runs were each run for a different ticket.

As usual, the best way to get approval for non-trivial increases in the level of effort for tasks is to present the business case to the client and request that approval explicitly.

Comment entered 2023-09-11 10:06:22 by Osei-Poku, William (NIH/NCI) [C]

I believe so. From what   said, that is what he is using in the filters to identify the section.

Comment entered 2023-09-11 10:19:15 by Osei-Poku, William (NIH/NCI) [C]

I meant what we did in the last two cases. Let me know if you want me to open a separate ticket for the Spanish global.

Comment entered 2023-09-11 12:01:53 by Kline, Bob (NIH/NCI) [C]

We run the global change jobs in test mode on multiple tiers. Frequently that happens more than once on individual tiers for a given global change request, as problems are uncovered in the implementation and/or the requirements. It would not be unreasonable to request that we double the number of jobs in order to separate out English and Spanish, though it would be disingenuous to say that this practice was "usual." Exploding the number of jobs I'd need to create by a factor of 4 (to get all the combinations of language and audience) is something that to my knowledge we've never done (so "usual" would be even more of a stretch), and would cross the boundary into territory where it would be reasonable to expect that the client should be involved in the decision as to whether the additional level of effort is justified for a given global change request, based on the arguments presented in your business case. And a business case would have to be very compelling to justify multiplying the number of test runs by dozens (as would be required if we need to have a separate run for each combination of language, audience, and summary type).

It would be possible to modify the global change harness to build in support for splitting out separate runs based on summary criteria. This would require several days of development and testing, and again, you would need to present justification explaining why this work (and the additional complexity that would need to be maintained in the software) would be cost-effective. It would even be possible to extend this enhancement to make the criteria on which the runs would be split out able to cover other variables than summary type, language, and audience, and possibly other document types.

I'm perfectly happy to do any of these things. I believe I would be able to accomplish even the most extensive enhancements to the global change harness described above in under a couple of weeks' of work. All I'm asking is that you get explicit approval from the client. 😃

Comment entered 2023-09-11 13:01:43 by Osei-Poku, William (NIH/NCI) [C]

I think we should proceed with running the global change the best way possible at the moment even if it means combining all the documents in one link (result) like we did in the past. It is tedious to review the XML that way but that is OK. It will only take a longer time to review them. Also, this is just one step in the process of reviewing the documents so I don't think we should proceed with making further changes to the software. I think we have gotten used to it by now 🙂. 

When we run in live mode, it is easier to identify the different groups/types of summaries to review. 

By the way, we've discussed this issue in the past when I asked you to change the global change software (results interface) to separate the results by Language, Audience, and Summary Type and you mentioned that it will be too expensive and that it will be better to run multiple globals and that was what we did in the last two global changes, I believe (even though I believe the results were not separated by summary type). Maybe we can discuss this issue again for future global changes but at this time, I think it is OK to proceed with the global the best and less expensive approach.

Comment entered 2023-09-11 15:56:56 by Kline, Bob (NIH/NCI) [C]
Comment entered 2023-09-11 16:44:20 by Osei-Poku, William (NIH/NCI) [C]

Thanks! It looks like the dates are not being preserved. Please preserve the dates (they are in parentheses) but don't do another test run yet until I have reviewed the error and fixed the major ones.

Comment entered 2023-09-11 16:58:47 by Kline, Bob (NIH/NCI) [C]

OK. I got the impression from the original ticket that we were to replace the titles with the new strings. Would have been good to have had this twist spelled out in at least one of the tickets. I'll hold off on another run as requested until you've finished your review.

Comment entered 2023-09-12 11:15:22 by Osei-Poku, William (NIH/NCI) [C]

You're right. I should have at least mentioned that as part of the requirements. Sorry about that.

Comment entered 2023-09-12 11:26:55 by Osei-Poku, William (NIH/NCI) [C]

Could you please capitalize the first letter of "this" so it reads - "Latest Updates to This Summary". It looks like while revising it to remove PDQ from it, I accidentally left out the capitalization. Please continue to hold off on a second test run.

Comment entered 2023-09-13 13:05:58 by Osei-Poku, William (NIH/NCI) [C]

Please do another test run on DEV which should now include the corrections.  Thanks!

Comment entered 2023-09-13 16:14:25 by Kline, Bob (NIH/NCI) [C]

https://cdr-dev.cancer.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2023-09-13_14-12-36 (I hope I typed that in correctly—JIRA isn't working on the VPN, and I have to be on the VPN to do the CDR work, so I'm working on two separate laptops.) 🙁

Comment entered 2023-09-13 16:56:33 by Osei-Poku, William (NIH/NCI) [C]

Thanks!

This summary CDR0000258001 had both deletion and insertion markups in the dates, and this is the effect after the global:

-Changes to This Summary (

+ Latest Updates to This Summary (09/02/202202/23/2023)

 

This is another example with markup: Summary CDR0000257995, and the effect of the global is

-Changes to This Summary

+ Latest Updates to This Summary (10/06/2022)(04/06/2023)

Comment entered 2023-09-13 17:22:04 by Kline, Bob (NIH/NCI) [C]

I recommend that you fix those by hand. It would be virtually impossible to anticipate all the possible combinations of nested revision markup.

Comment entered 2023-09-14 07:39:46 by Kline, Bob (NIH/NCI) [C]

If you know for sure what the exact original text is that you want me to replace, it's possible I may be able to come up with a solution which will work around the problems introduced by the revision markup (as long as that text itself isn't inside revision markup).

Comment entered 2023-09-14 08:24:35 by Osei-Poku, William (NIH/NCI) [C]

Sure. I think we can fix these manually since we will have to go into the records and remove the markup at some point. Would it be possible to get an ad hoc query that identifies all summaries with markup in the Changes Section?

Comment entered 2023-09-14 09:25:29 by Kline, Bob (NIH/NCI) [C]

The query_term table doesn't support such a query. You'll need a separate ticket for a one-off report which will parse all of the summary documents to find which summaries have revision markup in the changes section.

Comment entered 2023-09-14 10:07:01 by Osei-Poku, William (NIH/NCI) [C]

Sure. I will create a new ticket for the one-off report.  Is it possible for the global to skip summaries that have markup in the Changes Section so that users will update it themselves when accepting/rejecting the markup?

Comment entered 2023-09-14 13:13:31 by Kline, Bob (NIH/NCI) [C]

We can run the one-off report and then feed the results into the global change script so it can skip the summaries identified by the report.

Comment entered 2023-09-18 16:28:47 by Kline, Bob (NIH/NCI) [C]
Comment entered 2023-09-19 16:23:31 by Osei-Poku, William (NIH/NCI) [C]

Looks good. Please run in live mode on DEV. Thanks!

Comment entered 2023-09-20 11:43:37 by Kline, Bob (NIH/NCI) [C]

The live run on CDR DEV has completed.

Comment entered 2023-09-22 09:23:21 by Osei-Poku, William (NIH/NCI) [C]

Looks good on DEV. Please run in test mode on QA. Thanks!

Comment entered 2023-09-22 18:52:09 by Kline, Bob (NIH/NCI) [C]

(They've just upgraded JIRA, so I'm going to give threading another try to see if it's been fixed.)

https://cdr-qa.cancer.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2023-09-22_13-20-54

Comment entered 2023-09-25 13:53:56 by Osei-Poku, William (NIH/NCI) [C]

Please run in live mode on QA.

Comment entered 2023-09-25 16:28:08 by Kline, Bob (NIH/NCI) [C]

Done.

Comment entered 2023-09-27 08:22:59 by Osei-Poku, William (NIH/NCI) [C]

Please run in test mode on PROD. Thanks!

Comment entered 2023-09-27 11:11:37 by Kline, Bob (NIH/NCI) [C]
Comment entered 2023-10-11 08:48:34 by Kline, Bob (NIH/NCI) [C]

I'm holding off on the live-mode production run until instructed to proceed.

Comment entered 2023-10-13 16:41:21 by Osei-Poku, William (NIH/NCI) [C]

Please proceed to run the global.

Comment entered 2023-10-17 18:04:15 by Osei-Poku, William (NIH/NCI) [C]

I think we can close this ticket, but I am going to leave it open until after tomorrow's meeting just in case Robin and or Victoria have any feedback.

Comment entered 2023-10-17 20:48:38 by Kline, Bob (NIH/NCI) [C]

I assume you're referring to Thursday's status meeting, not tomorrow's PDQ standup (at which neither Robin nor Victoria would be present).

Comment entered 2023-10-19 16:18:37 by Osei-Poku, William (NIH/NCI) [C]

We agreed in today's CDR meeting to close this ticket. Thank you!

Attachments
File Name Posted User
ocecdr-5277-prod.log 2023-10-13 20:56:18 Kline, Bob (NIH/NCI) [C]

Elapsed: 0:00:00.001954