CDR Tickets

Issue Number 3992
Summary [Media] Questions about the Updated Media Documents Automated Email Report
Created 2015-10-22 16:12:58
Issue Type Improvement
Submitted By Juthe, Robin (NIH/NCI) [E]
Assigned To Englisch, Volker (NIH/NCI) [C]
Status Closed
Resolved 2015-11-04 13:32:25
Resolution Fixed
Path /home/bkline/backups/jira/ocecdr/issue.172525
Description

The Updated Media Documents report is sent weekly via e-mail and used to identify which images need to be added/updated in VOL.

We want to clarify the selection criteria for documents on this report. Does it only show documents that have had a new image file added and published? We are thinking this may be the case, because metadata changes (e.g., the addition of a creator, change to a caption/content description, addition of a blocked from VOL attribute) don't appear to be enough to land the document on this weekly report. Since this report is to be used to identify all images that should be added to OR updated in VOL, we would like it to include all media documents (for images) with a publishable version created in the time frame specified by the report.

As an example, the media document for the Reed-Sternberg cell image (CDR 576466) was updated to include a creator and a publishable version was created on 10/7/15. However, the weekly report that was sent on 10/9/15 for the dates 10/3/15-10/9/15 did not include this image.

We also want to clarify what the "First Pub Date" refers to. Is this is the image was first published to Cancer.gov as a part of another document (dictionary term, summary)? Or something else?

Thank you!

Comment entered 2015-10-22 16:17:08 by Juthe, Robin (NIH/NCI) [E]

Adding Margaret.

Comment entered 2015-10-22 17:22:15 by Englisch, Volker (NIH/NCI) [C]

OMG! I am feeling really embarrassed now because the problem is a result of a simple, silly mistake of mine.

It is possible I was thinking at some point we're only listing new images on the report and it was not necessary to include older ones but it's more likely I added some code to limit the output to newer media documents during testing. In fact, the selection criteria excludes everything with a CDR-ID less than 750,000.
Removing this restriction would have included the missing Reed-Sternberg cell image. This is the modified output of the select statement:
415522
537558
576466
755983
775981
775982
775983
775984
775985
775986
775987
775988
775989
775990

Aside from this error I believe the selection criteria is correct. We're looking at the date of the latest publishable version of the Media document regardless if it is links to another document and regardless if it's published to Cancer.gov or not.
We could modify a media document on DEV and run the modified report.

Comment entered 2015-11-03 18:19:28 by Englisch, Volker (NIH/NCI) [C]

I've modified the program to remove the restriction on the CDR-ID and requested CBIIT to replace the single file on PROD to correct the report. This change has been made in trunk

  • R13513: Notify_VOL.py

Comment entered 2015-11-06 12:11:30 by Englisch, Volker (NIH/NCI) [C]

The WEBTEAM ticket has been finished. We want to double-check the next few reports to confirm we're getting better results now.

Comment entered 2015-12-03 17:24:16 by Juthe, Robin (NIH/NCI) [E]

The reports look good. We're seeing what we expect to now. Thank you!

Elapsed: 0:00:00.001535