CDR Tickets

Issue Number 5037
Summary [Glossary] More information links global
Created 2021-09-30 10:07:48
Issue Type Improvement
Submitted By Osei-Poku, William (NIH/NCI) [C]
Assigned To Kline, Bob (NIH/NCI) [C]
Status Closed
Resolved 2022-01-07 13:12:11
Resolution Fixed
Path /home/bkline/backups/jira/ocecdr/issue.299716
Description

I wanted to create this ticket to discuss some solutions for appending text (name of institute or name of web site) to the More Information Links in the GTCs.

We'll be adding text (in parentheses, which will either be the name of the institute or name of the web site) after Related External Ref links in the GTC document, that link to other external government web sites such as GARD and MedlinePlus (list below with examples). This will serve as a notice to users that they will be leaving the Cancer.gov site when they click on the link.

The glossary team is currently working on providing a spreadsheet with all the URLS and the corresponding web site names, in the related external ref links in the CDR that will be used to perform a global to add the website names in parenthesis in the CDR. Here is the list of web sites we currently link to. This list could be updated in the future, but it doesn't happen often.

 

 

  • Example: Chromosome (National Human Genome Research Institute)

 

Currently we can manually add the text in parenthesis in the CDR as part of the text in the related external ref elements. However, if we do that using the existing element, the text in parenthesis (names of the institute or web sites) will also be created as part of the links on Cancer.gov, which is not desirable. Ideally keeping the text in parenthesis in plain text that is not linkable is desired.

Here are a few proposed solutions we have considered.

1. Modify the publishing filters to recognize text in parenthesis, included in the related external ref element, so that they are not created as part of the links on Cancer.gov.

2. Add a new optional attribute to the related external ref element with a list of values. The value selected for each link will indicate which web site name from the list above is displayed in the CDR and on cancer.gov.

3. Have the publishing filters determine which web site name to display depending on the URL in the cdr:xref attribute of the related external ref elements.

Option 3 offers the most benefit to users as there will be no maintenance of the links/web site names and there will not be the need of a global. However, users would also want to see the names of the web sites in the CDR (option 2) so, it may not be ideal to go with just option 3 alone.

Comment entered 2021-09-30 10:50:14 by Kline, Bob (NIH/NCI) [C]

One possibility is to use an "external link" icon. 

Comment entered 2021-09-30 13:09:54 by Englisch, Volker (NIH/NCI) [C]

I thought the external link icon gets created automatically by C.gov for all non-.gov sites.

Comment entered 2021-09-30 14:41:23 by Kline, Bob (NIH/NCI) [C]

We need to find out what changes need to be made to the ElasticSearch structures for the glossary.

Comment entered 2021-09-30 14:46:30 by Kline, Bob (NIH/NCI) [C]

We will hold off on deployment to production until all of the sites have been identified and mapped.

 

William will provide a list of the sites, with identifying URL portion, attribute token, and display text for each site.

Comment entered 2021-10-01 18:24:11 by Englisch, Volker (NIH/NCI) [C]

It looks like this is the list of domains currently used:

Comment entered 2021-10-04 17:23:35 by Englisch, Volker (NIH/NCI) [C]

I created a sample of - what Margaret calls - a good solution (as opposed to the perfect solution) which would not need any changes besides the global to add the extra text to the external links:

Comment entered 2021-10-07 13:14:52 by Osei-Poku, William (NIH/NCI) [C]

Thanks for the list Volker! We will remove some of the URLs like hhs.gov and ghr.nlm.nih.gov (which is really an older URL) because they are not commonly used. There may be one or more terms linked to them so we can handle them manually. I will update on what finally decide on those.

Comment entered 2021-10-21 13:18:24 by Osei-Poku, William (NIH/NCI) [C]

More Information Links.xlsx

I have attached the draft spreadsheet containing the URLs and descriptions. I added two columns that contain sample links to the content we are linking to. The sample links contain more than just the base URL. I also added another column (UseWith) which is the attribute to indicate the language the more information link is created for in the concept document.  

We may have to use a combination of the information from the SourceKey and UseWith to come up with the list of values so that each one will be unique.

Comment entered 2021-11-30 09:51:25 by Kline, Bob (NIH/NCI) [C]

Were you going to capture the details of the requirements for the interim solution you decided to have implemented, ?

Comment entered 2021-12-09 12:11:20 by Osei-Poku, William (NIH/NCI) [C]

I think we need to discuss this again in the CDR meeting so that we are all on the same page on what we need to do next as I am not sure if we decided to hold off on implementing the desired solution of having the text in parenthesis not being linkable (just plain text) or in the interim, having the text linkable until later when we can implement the plain text in parenthesis solution.

Comment entered 2021-12-16 10:11:52 by Osei-Poku, William (NIH/NCI) [C]

It looks like we decided to go with Option 2:


2. Add a new optional attribute to the related external ref element with a list of values. The value selected for each link will indicate which web site name from the list above is displayed in the CDR and on cancer.gov.

I will create a new ticket for the schema change so that we can track only the global change in this ticket. I will provide additional information for the global.

Comment entered 2021-12-16 11:17:37 by Osei-Poku, William (NIH/NCI) [C]

More Information Links_DEC16_2021.xlsx

 

I have attached an updated spreadsheet (added a new column - BASE URL). So, for the global I will suggest that the program looks for links in the GTCs that match the provided links in the BASE URL column. If they match up to the base URL, then assign the corresponding SourceKey from the SourceKey (values) column and also append the corresponding description (in parentheses)  from the DESCRIPTION column to the information in the Related External Ref element.

Comment entered 2021-12-16 14:44:18 by Kline, Bob (NIH/NCI) [C]

I guess I was confused looking at your last two comments, because option #2 only talks about adding the new attribute and having the global change populate it, not about also having the global change also add the parenthetical description to the element's text as well. If the global is going to actually do the modification of the text content stored in the element (rather than having that happen in the publishing filter) what is the purpose of the value in the new attribute?

Comment entered 2021-12-16 15:41:32 by Osei-Poku, William (NIH/NCI) [C]

It is for the reason that users want to see the descriptions in the CDR as stated in the initial request. Also, as I suggested in the initial request, perhaps a combination of having the global modify the text content as well as having the filters use the new values to send the descriptions to cancer.gov is the way to proceed. It will be redundant and it is possible that the information in the text content may not match exactly what is on the website  but I don't think that is a problem since the most accurate information will be available on the website. Alternatively instead of modifying the text content, we could have CSS (??) display a label of the description when a SourceKey value is selected. This way, the information on the website will always match what is in the CDR.

Comment entered 2021-12-16 16:13:36 by Kline, Bob (NIH/NCI) [C]

Well, let's think for a moment about what you want to happen when we reach the real solution, beyond this interim workaround. Once that happens we'll be sending this extra information to the ElasticSearch server for the dictionary in a separate field, instead of including it the text which will be wrapped with the link markup, so that it can be displayed outside of the link markup, right? Wouldn't it be preferable to keep the information separate now, having the filter add it when we export the document, so that when the web site is ready for the solution you really want we won't have to run another global change to pull the extra information back out of the element's text content, using parsing which tries to identify that extra information by finding parentheses (and hoping it doesn't stumble on parentheses which are there for a different reason)? If we have the current global just do what option #2 originally said it was going to do, we wouldn't need a second global change at all.

Comment entered 2021-12-16 16:42:42 by Osei-Poku, William (NIH/NCI) [C]

I do not have any problems with keeping the information (description) separate. I assume that will be in new element, in which case we will need a schema change.

Comment entered 2021-12-16 17:05:27 by Kline, Bob (NIH/NCI) [C]

No, if we do what the original second option said, we wouldn't need to do anything else to the CDR documents. The interim filter change adds the parenthesized information to the link's text. The permanent solution removes that change from the filter and has the loader add the field to the ElasticSearch records, mapping the attribute values to the exported values, using the same mapping the interim filter will use. For both the interim and permanent solutions, only one global change is necessary, and when we transition to the solution you really want we don't need to do anything to the CDR documents.

I'm pretty sure Volker will be able to modify the display in XMetaL (if this would be helpful) so users can see the attribute's value without needing to bring up the attribute inspector.

Comment entered 2022-01-04 10:20:55 by Kline, Bob (NIH/NCI) [C]

: Please confirm that you understand my latest explanation, and that you agree (and if you want the enhancement to the XMetaL CSS I mentioned, please add a ticket for  to take care of that as well).

Comment entered 2022-01-05 10:44:07 by Osei-Poku, William (NIH/NCI) [C]

Hi  Yes, I understand the latest explanation and will create a ticket for  to update the XMetal CSS.

Comment entered 2022-01-06 14:10:17 by Kline, Bob (NIH/NCI) [C]

William will check with Amy about whether this ticket should be in Ohm (to accommodate the need for an additional CSS ticket).

Comment entered 2022-01-07 13:12:11 by Kline, Bob (NIH/NCI) [C]

Global change run on DEV.

https://cdr-dev.cancer.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2022-01-07_12-50-36

Have you created the ticket for  to modify the relevant filters, ?

Comment entered 2022-01-10 09:43:36 by Osei-Poku, William (NIH/NCI) [C]

I have consulted with Amy and Linda and they are OK with moving the global forward now. The other changes can then wait for the Ohm release. I assume then that OCECDR-5076 and OCECDR-5090 will need to be moved to Ohm, right?

Comment entered 2022-01-10 10:12:11 by Kline, Bob (NIH/NCI) [C]

I assume you understand that if I run the global change on production before OCECDR-5076 gets promoted, then all of the documents that I change will be invalid. So I'm guessing that by "moving the global forward now" you mean "move it forward into Ohm now," right?

Comment entered 2022-01-10 10:40:12 by Osei-Poku, William (NIH/NCI) [C]

That was what I was confused about. So, it looks like what we can implement now without a release is only OCECDR-5090. The others will need to be moved into Ohm.

Comment entered 2022-01-10 11:10:41 by Kline, Bob (NIH/NCI) [C]

I think you're still confused.

You're the one who said OCECDR-5076 will need to be moved to Ohm. If OCECDR-5076 is done in Ohm then this ticket has to be done in (or after) Ohm. Do you understand why?

Comment entered 2022-01-10 12:14:20 by Osei-Poku, William (NIH/NCI) [C]

My confusion lies in knowing what can be done in a release and what cannot. Will we be able to add the new attribute OCECDR-5076  without a release? I assumed that OCECDR-5076  cannot be done without a release because it is a schema change, and that is why the global (the current ticket (OCECDR-5037) which will add the new attribute and the XMetal CSS changes OCECDR-5084 , which will display the descriptions in XMetal will need to be done in Ohm. That leaves  the filter change  OCECDR-5090 which can be done outside of a release.  This is my understanding of the changes. Please let me know if I am missing anything.

Comment entered 2022-01-10 12:38:40 by Kline, Bob (NIH/NCI) [C]

As I reminded you last week, we can update a schema without a release (we built a web admin tool to do that a few years ago). You have decided in most (but not all) cases not to take advantage of that, but to wait instead for a release so that the change can be accompanied by enhancements to the XMetaL CSS (which do require a release) corresponding to the schema change. Ring any bells?

Comment entered 2022-01-10 14:25:51 by Osei-Poku, William (NIH/NCI) [C]

In that case we can proceed with OCECDR-5076 and the global for this ticket while the XML CSS ticket waits for a release.  OCECDR-5090  can also proceed with the filter change. I was aware that you could update the schema without a release but I thought it was not in every single case. Now I know 😃.

Comment entered 2022-01-18 18:32:37 by Osei-Poku, William (NIH/NCI) [C]

Please proceed to run the global in live mode on DEV. Thanks!

Comment entered 2022-01-25 16:11:15 by Osei-Poku, William (NIH/NCI) [C]


Have you created the ticket for  to modify the relevant filters, ?

Yes.  OCECDR-5090

Comment entered 2022-01-25 16:12:50 by Osei-Poku, William (NIH/NCI) [C]

  I do not see a comment about the live run on DEV but I think it is done. Could you please confirm?

Comment entered 2022-01-25 16:28:29 by Kline, Bob (NIH/NCI) [C]

Yes, it did finish. Sorry about that. 😛

Comment entered 2022-01-25 17:03:01 by Osei-Poku, William (NIH/NCI) [C]

No problem. Please run in test mode on QA. I assume  OCECDR-5076 will be copied to QA first before the global is ran.

Comment entered 2022-01-25 17:23:21 by Kline, Bob (NIH/NCI) [C]

I assume  OCECDR-5076 will be copied ...

Well, sure, when you add a note to that ticket indicating that it's been approved on DEV and asking for promotion to QA. The most recent "progress" comment in the ticket is my "installed on DEV" note.

Comment entered 2022-02-09 13:38:06 by Kline, Bob (NIH/NCI) [C]

Run completed.
{{   Docs examined    = 667}}
{{   Docs changed     = 0}}
{{   Versions changed = 606}}
{{   Could not lock   = 0}}
{{   Errors           = 0}}
{{   Time             = 0:56:01.039002}}

https://cdr-dev.cancer.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2022-02-09_12-37-58

Comment entered 2022-02-09 16:07:19 by Osei-Poku, William (NIH/NCI) [C]

Verified. Please run in live mode on QA.

Comment entered 2022-02-09 18:03:05 by Kline, Bob (NIH/NCI) [C]

2022-02-09 17:36:41.141 [INFO] Run completed.
{{   Docs examined    = 667}}
{{   Docs changed     = 202}}
{{   Versions changed = 420}}
{{   Could not lock   = 0}}
{{   Errors           = 0}}
{{   Time             = 1:02:26.064663}}
Specific versions saved:
{{  new cwd = 1}}
{{  new pub = 202}}
{{  new ver = 16}}
{{  old cwd = 10}}

Comment entered 2022-02-11 12:29:19 by Osei-Poku, William (NIH/NCI) [C]

Looks good on QA. Please run in test mode on PROD.

Comment entered 2022-02-14 12:54:36 by Kline, Bob (NIH/NCI) [C]

https://cdr-dev.cancer.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2022-02-14_11-43-53

 

2022-02-14 12:39:18.608 [INFO] Run completed.
{{   Docs examined    = 744}}
{{   Docs changed     = 0}}
{{   Versions changed = 612}}
{{   Could not lock   = 0}}
{{   Errors           = 0}}
{{   Time             = 0:55:24.879568}}

Comment entered 2022-02-15 14:51:22 by Osei-Poku, William (NIH/NCI) [C]

Test data verified. Please run in live mode on PROD.

Comment entered 2022-02-15 16:34:15 by Kline, Bob (NIH/NCI) [C]

2022-02-15 16:28:04.449 [INFO] Run completed.
{{   Docs examined    = 746}}
{{   Docs changed     = 204}}
{{   Versions changed = 418}}
{{   Could not lock   = 0}}
{{   Errors           = 0}}
{{   Time             = 1:04:46.393584}}
Specific versions saved:
{{  new cwd = 1}}
{{  new pub = 204}}
{{  new ver = 10}}
{{  old cwd = 11}}

Comment entered 2022-02-17 10:28:26 by Osei-Poku, William (NIH/NCI) [C]

Verified on PROD. Thanks!

Attachments
File Name Posted User
launch--blue-60v.svg 2021-09-30 10:48:40 Kline, Bob (NIH/NCI) [C]
More Information Links_DEC16_2021.xlsx 2021-12-16 11:02:29 Osei-Poku, William (NIH/NCI) [C]
More Information Links.xlsx 2021-10-21 12:47:25 Osei-Poku, William (NIH/NCI) [C]
Screen Shot 2021-10-04 at 17.08.02.png 2021-10-04 17:21:27 Englisch, Volker (NIH/NCI) [C]

Elapsed: 0:00:00.002047