PDQ Issues

Issue Number	3606
Summary	[Glossary] Interface to generate spreadsheet of terms needing audio pronunciations
Created	2013-05-09 17:17:56
Issue Type	Improvement
Submitted By	Juthe, Robin (NIH/NCI) [E]
Assigned To	Kline, Bob (NIH/NCI) [C]
Status	Closed
Resolved	2015-01-14 08:56:31
Resolution	Fixed
Path	/home/bkline/backups/jira/ocecdr/issue.107934

Description

BZISSUE::5305
BZDATETIME::2013-05-09 17:17:56
BZCREATOR::Robin Juthe
BZASSIGNEE::Bob Kline
BZQACONTACT::William Osei-Poku

We would like CIAT to be able to generate the spreadsheet of terms that need audio pronunciations that Bob currently prepares each time we are ready to send Vanessa another batch. This is related to OCECDR-3373 (automating the media doc creation) and OCECDR-3237 (the original bug for audio pronunciations) but we thought it was worth splitting this up into separate issues. Made this P5 after discussing the LOE with Bob.

Comment entered 2014-11-03 15:14:25 by Kline, Bob (NIH/NCI) [C]

I'm looking at implementing this while the clinical trials search redesign project is on hold. The approach I have in mind is to replace the sweep through the spreadsheets which were generated in the past with a check of a new table which records when a glossary term was added to a spreadsheet sent to Vanessa. That table would have columns for:

the glossary term name document's CDR ID
the name of the spreadsheet it was included in
the date the spreadsheet was created

I don't see any holes in this approach, as far as the production system goes, though I can imagine some edge cases where testing could be confusing on the lower tiers. Imagine, for example, that you create a new glossary term name document on a lower tier for testing, and by chance the CDR ID assigned to that new document just happens to match the ID assigned earlier on the production system for a glossary term name document which was included on a spreadsheet which actually got sent to Vanessa. In that case, the tester could be puzzled by the fact that the new glossary term name document created on the lower tier wasn't included on the spreadsheet generated on that tier for testing. As long as we're aware of that (very unlikely) possibility, we should be OK.

Steps for implementation:

Write the SQL for creating the new table
Run the script on DEV
Write the script to populate the table from the spreadsheets already sent to Vanessa
Run that script on DEV
Write the CGI script to generate a fresh spreadsheet and add the corresponding rows to the new table
Have CIAT test the new script
Install and populate the new table on QA
Install the CGI script on QA
Have CIAT test the new script
Deploy data and script to PROD (as part of a future release)

See any flaws in this approach?

Comment entered 2014-11-03 15:20:40 by Juthe, Robin (NIH/NCI) [E]

Adding William to this issue. William, could you please go over this and/or discuss with Amy and see if the above looks reasonable? Thanks.

Comment entered 2014-11-03 15:25:39 by Osei-Poku, William (NIH/NCI) [C]

Sure. Will do.

Comment entered 2014-11-04 15:06:56 by Osei-Poku, William (NIH/NCI) [C]

Your approach looks good to us. We didn't find any problems with it.

Comment entered 2014-11-05 06:44:32 by Kline, Bob (NIH/NCI) [C]

Made some good progress on this task:

✔ Write the SQL for creating the new table
✔ Run the script on DEV
✔ Write the script to populate the table
✔ Run that script on DEV
✔ Write the CGI script to generate a fresh spreadsheet

So now we're ready for the next steps:

☐ Have CIAT test the new script on DEV
☐ Install and populate the new table on QA
☐ Install the CGI script on QA
☐ Have CIAT test the new script
☐ Deploy data and script to PROD (as part of a future release)

The tool has been plugged into the CIAT admin menu. In order to test again after running it the first time, you'll need to add more glossary term name documents with pronunciations.

/*
 * Record of having asked for pronunciation audio clips for the names
 * found in a particular GlossaryTermName document.
 *
 *      cdr_id   primary key for the GlossaryTermName document
 * spreadsheet   name of the spreadsheet in which the request was included
 *    requeted   the date the request was generated
 */
CREATE TABLE glossary_term_audio_request
     (cdr_id INTEGER      NOT NULL PRIMARY KEY REFERENCES all_docs,
 spreadsheet VARCHAR(256) NOT NULL,
   requested DATE         NOT NULL)

Comment entered 2014-11-05 07:17:32 by Kline, Bob (NIH/NCI) [C]

You must be in a group with REVIEW TERM AUDIO permission to generate the spreadsheet (right now that includes users in the Developers and Glossary Maintainers groups, at least on DEV).

Comment entered 2014-11-05 13:28:16 by Osei-Poku, William (NIH/NCI) [C]

I get a python script error when I click on the link. It is probably related to OCECDR-3823.

A problem occurred in a Python script.

D:\cdr\Log\tmpkxszsy.html contains the description of this error.

Comment entered 2014-11-05 14:11:40 by Kline, Bob (NIH/NCI) [C]

Please try again. I had omitted the module which is needed for checking the permission mentioned in my previous comment.

Comment entered 2014-11-05 14:20:08 by Osei-Poku, William (NIH/NCI) [C]

I am getting the same error message even after logging out of XMetal and logging back in.

A problem occurred in a Python script.

D:\cdr\Log\tmpeuiuy6.html contains the description of this error

Comment entered 2014-11-05 14:29:38 by Kline, Bob (NIH/NCI) [C]

Not quite the same error, but in the same general area. Please try again. No need to log out of XMetaL.

Comment entered 2014-11-05 14:34:35 by Osei-Poku, William (NIH/NCI) [C]

It worked. Thanks!

Comment entered 2014-11-13 11:41:53 by Osei-Poku, William (NIH/NCI) [C]

I am no longer able to generate new spreadsheets. I get "You are not authorized to generate this spreadsheet" error message.

Comment entered 2014-11-14 08:30:53 by Kline, Bob (NIH/NCI) [C]

You need to be a member of a group which has the AUDIO IMPORT permission. Doesn't look like any of the groups have that permission.

NCIWS-D141-V-M:D:\Inetpub\wwwroot\cgi-bin\cdr>cdrsql
SQL> SELECT g.name
   >   FROM grp g
   >   JOIN grp_action ga
   >     ON ga.grp = g.id
   >   JOIN action a
   >     ON ga.action = a.id
   >  WHERE a.name = 'AUDIO IMPORT';
name
-------------------------------------------
SQL>

Comment entered 2014-11-14 12:01:56 by Osei-Poku, William (NIH/NCI) [C]

Thanks! I fixed the error and I am now able to run the report.

Comment entered 2014-11-18 15:37:49 by Osei-Poku, William (NIH/NCI) [C]

The report is running but it is no longer retrieving new data. I've added a few new terms with the pronunciation keys but they are not being picked up by the report.

Comment entered 2014-11-18 15:52:06 by Kline, Bob (NIH/NCI) [C]

The software only picks up terms that have been published. You may need to wait until that has happened (or add pronunciation keys to documents which didn't have them, and don't have pronunciation Media links, but have been published).

Comment entered 2014-11-20 11:29:11 by Osei-Poku, William (NIH/NCI) [C]

Verified on DEV.

Comment entered 2015-01-14 08:56:31 by Kline, Bob (NIH/NCI) [C]

This is ready for testing on QA.

Comment entered 2015-01-21 14:42:36 by Osei-Poku, William (NIH/NCI) [C]

Verified on QA

Comment entered 2015-02-16 09:26:55 by Osei-Poku, William (NIH/NCI) [C]

I am getting a Python script error on Stage:

A problem occurred in a Python script.

D:\cdr\Log\tmpzn_7u6.html contains the description of this error.

Comment entered 2015-02-16 10:25:33 by Kline, Bob (NIH/NCI) [C]

The deployment to stage didn't include the steps identified in the November 5 comment.

I will put in a ticket for the database team to add and populate the new table. Can you tell me if we can hold off on generating any new spreadsheets for Vanessa until this tool is in production? If so, I will ask them to add and populate the table on prod now as well.

Comment entered 2015-02-16 10:31:21 by Osei-Poku, William (NIH/NCI) [C]

Yes, we can hold off on generating spreadsheets for Vanessa until this is in Production.

Comment entered 2015-02-16 10:49:47 by Kline, Bob (NIH/NCI) [C]

https://tracker.nci.nih.gov/browse/DBATEAM-1590 submitted. I'll let you know when the table is ready.

Comment entered 2015-02-17 14:46:28 by Kline, Bob (NIH/NCI) [C]

The table has been created and populated on STAGE. You should be able to test now.

Comment entered 2015-02-17 15:25:51 by Kline, Bob (NIH/NCI) [C]

The table has also been created and populated on PROD, so we'll be all set once tonight's deployment rolls out.

Comment entered 2015-02-17 15:39:06 by Osei-Poku, William (NIH/NCI) [C]

Verified on STAGE. Thank you!

Comment entered 2015-04-02 13:11:57 by Osei-Poku, William (NIH/NCI) [C]

Verified on PROD.

Elapsed: 0:00:00.000756

CDR Tickets