CDR Tickets

Issue Number 4547
Summary Modify URL Check report to identify section containing the URL
Created 2018-11-05 16:30:44
Issue Type Improvement
Submitted By Osei-Poku, William (NIH/NCI) [C]
Assigned To Kline, Bob (NIH/NCI) [C]
Status Closed
Resolved 2019-07-02 18:15:08
Resolution Fixed
Path /home/bkline/backups/jira/ocecdr/issue.235564
Description

We would like to explore the possibility of having the URL check report display the Summary Section containing the URLs that are retrieved in the results. If possible, displaying part of the text before and after the URL will also be helpful.

Comment entered 2019-06-20 18:05:25 by Kline, Bob (NIH/NCI) [C]

Just Cancer Information Summaries or Drug Information Summaries, too?

Comment entered 2019-06-20 18:40:01 by Kline, Bob (NIH/NCI) [C]

Also, the Broken URLs report or the Page Title Mismatches report or both?

I can see that this is a much more extensive task than was originally estimated. In fact, it will involve a separate new report (or two, if the answer to the question immediately above is "both"), as the existing report is able to get everything it needs from the query_term table, but the new requirement will have to load and parse each of the summary documents, which will dramatically increase the time to generate the report. It's possible I might be able to get avoid parsing the summary documents if I only need the section title (and you didn't say whether you need it to be the top-level section title or the immediately enclosing section title) and not the surrounding text, in which case I might be able to keep the LOE number down to a 20 or even possibly a 13.

Comment entered 2019-06-20 18:57:21 by Kline, Bob (NIH/NCI) [C]

Assigned to William for requirements clarification.

Comment entered 2019-06-20 21:12:59 by Osei-Poku, William (NIH/NCI) [C]

Cancer Information Summaries alone should be fine. The DIS(s) are not as long as the CIS(s) and the request did come specifically for the CISs(s).

Comment entered 2019-06-20 21:33:53 by Osei-Poku, William (NIH/NCI) [C]

 Also, the Broken URLs report or the Page Title Mismatches report or both?

The Broken URLs report only. We use the Title Mismatches report to check which page titles have changed.

Comment entered 2019-06-20 21:34:38 by Osei-Poku, William (NIH/NCI) [C]

re-assigned back to

Comment entered 2019-06-27 10:55:31 by Kline, Bob (NIH/NCI) [C]

You only answered one of my questions, .

Comment entered 2019-06-27 11:10:40 by Osei-Poku, William (NIH/NCI) [C]

I assume this is the question you're referring to:

 

(and you didn't say whether you need it to be the top-level section title or the immediately enclosing section title) 

 

It should be the immediately enclosing section title.

Comment entered 2019-06-27 11:25:58 by Kline, Bob (NIH/NCI) [C]

OK, I'm going to assume you don't need the report to display the surrounding text.

Comment entered 2019-06-27 11:28:22 by Osei-Poku, William (NIH/NCI) [C]

That is right. We just need the section title.  Thanks!

Comment entered 2019-07-02 18:15:08 by Kline, Bob (NIH/NCI) [C]

Enhancement implemented on DEV.

https://github.com/NCIOCPL/cdr-lib/commit/17f4a16

Comment entered 2019-07-03 12:23:10 by Osei-Poku, William (NIH/NCI) [C]

Verified on DEV. Thanks!

Comment entered 2019-08-05 18:41:42 by Osei-Poku, William (NIH/NCI) [C]

Verified on QA. Thanks!

Comment entered 2019-09-06 15:22:42 by Osei-Poku, William (NIH/NCI) [C]

Verified on PROD. Thanks!

Elapsed: 0:00:00.001530