CDR Tickets

Issue Number 3237
Summary [Media] Adding Audio Pronunciations to Glossary Documents
Created 2010-10-04 16:45:21
Issue Type Improvement
Submitted By Juthe, Robin (NIH/NCI) [E]
Assigned To Kline, Bob (NIH/NCI) [C]
Status Closed
Resolved 2013-09-20 13:43:44
Resolution Fixed
Path /home/bkline/backups/jira/ocecdr/issue.107565
Description

BZISSUE::4926
BZDATETIME::2010-10-04 16:45:21
BZCREATOR::Robin Juthe
BZASSIGNEE::Bob Kline
BZQACONTACT::Margaret Beckwith

This is an umbrella issue for the project to add audio pronunciations to Glossary Term Name documents for both English and Spanish glossary term name strings. This will be a place to discuss ongoing work on this and record decisions, but it is expected that there will be several smaller and more specific issues to address various components of this project.

The pronunciation audio files will be received as .wav files, and there will be a separate file for each English term and for each Spanish term. There will be roughly 14,000 audio files (7,000 each in English and Spanish), so these files will need to be programmatically added to the CDR (likely stored in new media documents for each term) and then linked to the appropriate Glossary Term Name documents.

One of the first steps will be to create a spreadsheet of all of the glossary term names currently in the CDR with their phonetic pronunciations and a place to store information about the audio pronunciation recordings for each term. This will be addressed in a separate issue.

I will post a couple sample audio pronunciation files for our glossary terms once we have them available.

Comment entered 2010-10-04 17:33:53 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2010-10-04 17:33:53
BZCOMMENTOR::Robin Juthe
BZCOMMENT::1

Adding link to related issue about creating a spreadsheet of term names and pronunciations

Comment entered 2010-10-07 09:14:50 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-10-07 09:14:50
BZCOMMENTOR::Bob Kline
BZCOMMENT::2

If we're really going to use WAV files instead of MP3s we'll need to enhance the DLL because the code which populates the duration element currently only works for MP3 files.

Comment entered 2010-10-07 09:26:19 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2010-10-07 09:26:19
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::3

I think we need to talk to the person who will be doing the recordings, which hasn't been completely decided. I am waiting to hear some samples of terms from our dictionary before deciding. I will ask Debbie to find out if she could provide the audio files as MP3.

Comment entered 2010-10-15 14:23:39 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2010-10-15 14:23:39
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::4

Trying to attach recordings of dictionary terms in three different file formats using link that vendor provided.

Comment entered 2010-10-15 14:53:24 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-10-15 14:53:24
BZCOMMENTOR::Bob Kline
BZCOMMENT::5

The attachment was just the URL, which will be easier to follow if I just enter it in the comment here:

http://www.vrvoice.com/VR_Pronunciations.zip

Comment entered 2010-10-15 14:57:07 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2010-10-15 14:57:07
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::6

That makes sense. Thanks.

Comment entered 2010-10-15 14:59:04 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-10-15 14:59:04
BZCOMMENTOR::Bob Kline
BZCOMMENT::7

I can't detect any difference in quality among the three formats.

Comment entered 2010-10-15 16:35:09 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2010-10-15 16:35:09
BZCOMMENTOR::Volker Englisch
BZCOMMENT::8

(In reply to comment #7)
> I can't detect any difference in quality among the three formats.

I agree with that statement.

Comment entered 2010-10-28 12:19:12 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2010-10-28 12:19:12
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::9

Just thought I would add an update here. CIAT is working on getting the subcontracting agreement with VRVoice in place to do these recordings. Then we will have a meeting with VR to discuss details in terms of format, the spreadsheet, and timing.

In terms of the three formats she sent, I was not able to get the OGG format to work, I just got a bunch of crazy characters that looked like what you get with a corrupted file. May just be my computer or something weird that I am doing with the file. The other two sounded fine.

Comment entered 2010-11-22 16:59:26 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2010-11-22 16:59:26
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::10

Notes from our last CDR meeting:

1. We will use an MP3 file format for the audio files.
2. We will ask Vanessa to put the path to the file in the spreadsheet, using relative paths.
3. We will ask her to load a zipped file to the ftp server.
4. We will ask her to do a small subset (~50 terms) to test the process.
5. We would like to do sets of terms English and corresponding Spanish rather than all of the English first and then all of the Spanish.
6. For ongoing maintenance, CIAT will need to work with Vanessa to get the audio files and create the media docs in the CDR; there may be a way to automate some of this process.

There is a meeting with Vanessa scheduled for Tues. Nov. 23 from 1-2.

Comment entered 2010-11-22 17:02:46 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-11-22 17:02:46
BZCOMMENTOR::Bob Kline
BZCOMMENT::11

One more thing we discussed: we wanted to make sure that the sample of 50 files included files in more than one directory, unless she plans to store all of the files in the same directory for the final deliverable (probably not a good idea).

Comment entered 2010-12-07 14:44:52 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-12-07 14:44:52
BZCOMMENTOR::Bob Kline
BZCOMMENT::12

By good fortune, the random sample picked up some interesting cases. For one thing, two of the 50 terms had no Spanish name at all (CDR491302: diarylsulfonylurea, and CDR44012: conditioned stimulus). Also, it appears that the approach taken for "breast cancer" (the example Linda gave in the meeting today of a term for which an English term name is associated with more than one Spanish name) was not used for CDR446564 ("cáncer de seno (mama) en estadio temprano") where the two term names are bundled into the same Spanish name element. Finally, although it's usually possible to tell the brand names from the common names by the lowercasing of the latter, for some reason the Spanish "Ablación" is capped (CDR335061).

I pulled these from Bach, even though when we test the loading procedure we'll be using Mahler (or possibly Franck), which means that some might not load (because the corresponding name documents aren't there, or don't match. I thought this was better than giving Vanessa information which was out of sync with the production data, possibly introducing unnecessary confusion.

Comment entered 2010-12-07 14:44:52 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926.xls has been added with description: Random sample of 50 terms for testing the process

Comment entered 2010-12-09 13:59:20 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-12-09 13:59:20
BZCOMMENTOR::Bob Kline
BZCOMMENT::13

Need to create a new sample spreadsheet using only published term names which have at least one Spanish name.

Comment entered 2010-12-14 09:41:01 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-12-14 09:41:01
BZCOMMENTOR::Bob Kline
BZCOMMENT::14

Comment entered 2010-12-14 09:41:01 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926.xls has been added with description: Sample using published terms with at least one Spanish name

Comment entered 2010-12-15 16:54:45 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2010-12-15 16:54:45
BZCOMMENTOR::Robin Juthe
BZCOMMENT::15

This spreadsheet looks fine to me for the trial set. Thanks.

I just thought I would comment here that if all goes well with the trial batch, then the next task will be to create a spreadsheet of all English terms that have pronunciation keys with their corresponding Spanish translations, and a report of everything else. The first report will be given to Vanessa to work from (she estimated she would get through ~500 terms/week) and the second report will be used by CIAT to identify which terms do not have pronunciation keys such that they can be added if appropriate (not all terms will need them, e.g., most acronyms). Batches of terms with newly acquired pronunciation keys will be sent to Vanessa thereafter.

Comment entered 2010-12-16 13:28:44 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-12-16 13:28:44
BZCOMMENTOR::Bob Kline
BZCOMMENT::16

This issue covers the loading of the initial set of audio files as media documents linked from the glossary name docs (probably in multiple batches). It has not been decided whether any automated support will be implemented for updates to the pronunciation documents, or for creating audio files for new names in the future. At least at first new audio documents (and replacements for existing documents after the initial loading is complete) will be done in XMetaL as is true for other audio Media documents (e.g., meeting minutes). If automation for this process is desired at some point in the future, a separate issue will be created.

Comment entered 2010-12-30 10:01:38 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2010-12-30 10:01:38
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::17

Just wondering if the test spreadsheet got sent to Vanessa?

Comment entered 2010-12-30 11:04:39 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2010-12-30 11:04:39
BZCOMMENTOR::Bob Kline
BZCOMMENT::18

(In reply to comment #17)
> Just wondering if the test spreadsheet got sent to Vanessa?

Not by me. It seems likely that Robin sent it after she reviewed it.

Comment entered 2011-01-03 09:11:08 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-01-03 09:11:08
BZCOMMENTOR::Robin Juthe
BZCOMMENT::19

Margaret, I have not sent the test spreadsheet to Vanessa. Let me know if you'd like me to.

Comment entered 2011-01-03 12:07:21 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-01-03 12:07:21
BZCOMMENTOR::Robin Juthe
BZCOMMENT::20

I sorted the spreadsheet by CDR ID and sent it to Vanessa.

Comment entered 2011-01-05 16:18:22 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-01-05 16:18:22
BZCOMMENTOR::Robin Juthe
BZCOMMENT::21

Vanessa has uploaded the test files and spreadsheet to the FTP site. I have pasted her email explaining the file naming conventions below. I have not had a chance to review the files yet.

Begin email....

I have just uploaded the Zip'd file named: "NCI_Terms_010511.zip" to the Audio folder.
Inside this folder are the 100 terms, saved individually, as well as the spreadsheet with the individual filenames listed.

The procedure for naming the folders is: NCi_Terms_ followed by the date uploaded (in this case 010511)

The procedure for naming the files is:
CDRid#
followed by an underscore: _
then either "En" or "SP" to verify whether English or Spanish
followed by the date uploaded (month, day, year - always will be 6 numbers)

If there is a revision, it will be saved the same way, (with final 6 numbers telling which was done last) and if there are ever additional Spanish words for the same term, the date will set it apart and, if it is noted that it is an additional word, the letters a (or b or c - depending on how many additional words added) will be added to the end of the CDRid#.

Don't hesitate to let me know if you have any questions or if you need anything!

Thanks again,
Vanessa

...End email

Comment entered 2011-01-05 16:49:07 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-01-05 16:49:07
BZCOMMENTOR::Volker Englisch
BZCOMMENT::22

(In reply to comment #21)
> The procedure for naming the files is:
> CDRid# followed by an underscore: _
> then either "En" or "SP" to verify whether English or Spanish

Should we be using the 2 letter ISO language codes instead since we're using those everywhere else in the CDR: EN, ES?

Comment entered 2011-01-13 11:04:13 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-01-13 11:04:13
BZCOMMENTOR::Robin Juthe
BZCOMMENT::23

I'm posting the body of an email sent to Vanessa today with requests for another upload of the files taking into consideration the issues discussed in our last CDR meeting:

"Thanks again for uploading the audio files. We have reviewed them and they sound great! We do have a few technical requests from our programmers though in order to simplify the automation of future uploads:

1. You are listing the file names in the spreadsheet without the file extension *.mp3. Could you please list the filename exactly as they are stored?

2. For the distinction between the English and Spanish version you've used the letters 'En' and 'Sp'. Within our systems we are using standard ISO language codes and we would like you to use those for the filenames as well: English - en, Spanish – es

3. You have chosen to list all of the audio files in a flat file structure, meaning that all of the files are sitting in the same directory as your spreadsheet. Although this is perfectly acceptable, it might be difficult for you to keep track of the individual files once you are dealing with several thousand of those in the future. It might also be more challenging to create ZIP files for incremental updates because when creating a ZIP file you would need to identify the files to include individually instead of selecting just one folder holding many files.
When we were talking about this at our meeting we had envisioned a file structure similar to this:

\my_project\my_spreadsheet
\my_project\my_spreadsheet\A\44488_en_010511.mp3
\my_project\my_spreadsheet\A\44488_es_010511.mp3
...
\my_project\my_spreadsheet\F\45419_en_010511.mp3
\my_project\my_spreadsheet\F\45419_es_010511.mp3
...
\my_project\my_spreadsheet\E\45676_en_010511.mp3
\my_project\my_spreadsheet\E\45676_es_010511.mp3

Instead of using the alphabet for splitting the big pile of documents into smaller portions you could also pick other criteria like the CDR-ID grouped by thousands, the last digit of the CDR-ID, the file creation date, language, etc.

For a directory structure given in the sample above, the filename entries in the spreadsheet would then become:

A\44488_en_010511.mp3
A\44488_es_010511.mp3
...
F\45419_en_010511.mp3
F\45419_es_010511.mp3
...
E\45676_en_010511.mp3
E\45676_es_010511.mp3

Could you please create a new ZIP file for us taken the above points into consideration and upload to our FTP server again? We’d like to make sure we have this process down before we proceed with the other thousands of files!"

PS. Thanks to Volker for writing up these issues!!

Comment entered 2011-01-18 14:57:16 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-01-18 14:57:16
BZCOMMENTOR::Volker Englisch
BZCOMMENT::24

I talked to Vanessa today to discuss the questions she had. She is planning to create another ZIP file and upload it to the FTP server by the end of tomorrow.

Comment entered 2011-01-19 15:07:32 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-01-19 15:07:32
BZCOMMENTOR::Volker Englisch
BZCOMMENT::25

I've copied the new ZIP file that Vanessa had send to the group folder
L:\OCE_Cross\CDR

The file and content of the spreadsheet looks good to me. The only thing I would change is probably to use two digits to specify the week (04 instead of 4).
Note: The current week would actually be Week 03 because Week 01 starts
with Jan 3rd. I incorrectly told Vanessa we're in Week 04 right now.

The time stamp that's part of the file name (week_4/44488_en_010511.mp3) is most likely a leftover from the original set of files Vanessa had send which would not be necessary if the files are stored in folders by week. The file names would then be listed in the spreadsheet as
Week_04/NNNNNN_en.mp3
or even
W04/NNNNNN_en.mp3

Comment entered 2011-01-26 15:26:32 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-01-26 15:26:32
BZCOMMENTOR::Bob Kline
BZCOMMENT::26

Is this what you had in mind for the Media documents? Notice that I've used "to be specified" as the category. That's a valid value in the schema for that element, but you might want to consider adding a value that actually matches the nature of the MP3 files, like maybe "glossary pronunciation." Please check over the sample below. Is everything accurate (e.g., audience)? Have I left out any optional elements or attributes you want included?

<Media>
<MediaTitle>Pronunciation of 'vaccinia CEA vaccine'</MediaTitle>
<PhysicalMedia>
<SoundData>
<SoundType>Speech</SoundType>
<SoundEncoding>MP3</SoundEncoding>
<RunSeconds>2</RunSeconds>
</SoundData>
</PhysicalMedia>
<MediaSource>
<OriginalSource>
<Creator>Vanessa Richardson, VR Voice</Creator>
<DateCreated>2011-01-05</DateCreated>
<SourceFilename>week_4/46034_en_010511.mp3</SourceFilename>
</OriginalSource>
</MediaSource>
<MediaContent>
<Categories>
<Category>to be specified</Category>
</Categories>
<ContentDescriptions>
<ContentDescription audience='Patient' language='en'>Pronunciation of dictionary term 'vaccinia CEA vaccine'</ContentDescription>
</ContentDescriptions>
</MediaContent>
</Media>

Comment entered 2011-01-26 15:52:46 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-01-26 15:52:46
BZCOMMENTOR::Volker Englisch
BZCOMMENT::27

(In reply to comment #26)
> <Media>
> <MediaTitle>Pronunciation of 'vaccinia CEA vaccine'</MediaTitle>

I'm wondering if we really want the MediaTitle to be "Pronunciation of ..." for all of the documents? We don't have our images listed as "Image of abdominal ultrasound" and it would be more convenient for the users if they were searching for a string and can find both, the image and the audio file at the same time.
Maybe we should just use the name for the title, change our title-filter and add another indicator like the ImageType/SoundType or Encoding to distinguish an image from a pronunciation file:

  • colonoscopy;JPEG

  • colonoscopy;MP3
    or something similar.

Comment entered 2011-01-27 12:05:04 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-01-27 12:05:04
BZCOMMENTOR::Robin Juthe
BZCOMMENT::28

(In reply to comments #26 and #27)

I think the schema looks good. I agree - we need to add a category for these documents. Would just "pronunciation" be sufficient?

I also think it could be helpful to use the processing status elements for these documents. We would have to add more appropriate values to the picklist, but it could be used to indicate whether the pronunciation has been reviewed by CIAT and/or revised. We don't necessarily have to add a processing status at the time of importing the file, but it could be added manually as the documents are reviewed/revised later.

I agree with Volker's comment about the title of the document. Maybe the title filter could be changed to also show the category too. Then, althought tangential to this issue, we also wouldn't have to repeat "Meeting Recording" in the title of audio files of Board meetings. For example:

  • colonoscopy; pronunciation; mp3

  • colonoscopy; test or procedure; jpeg
    [- Adult treatment 04-15-10; meeting recording; mp3]

Comment entered 2011-02-07 11:47:42 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-07 11:47:42
BZCOMMENTOR::Bob Kline
BZCOMMENT::29

(In reply to comment #28)

> I think the schema looks good. I agree - we need to add a category for these
> documents. Would just "pronunciation" be sufficient?

I've added that value to the schema on Franck in preparation for testing.

Comment entered 2011-02-07 14:15:49 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-02-07 14:15:49
BZCOMMENTOR::Robin Juthe
BZCOMMENT::30

Margaret and I reviewed the schema (including this new category) on Franck and it looks fine. We noticed a minor thing - that "Meeting Recording" is capitalized and the other category values are lowercase. Could you make meeting recording begin with two lowercase letters so that we are consistent? Thanks.

We also decided to hold off on adding any processing status values for right now, though we do think it will be helpful to add values specific to this project once we get a bit further in the process. We'll let CIAT determine how the processing status information will be most useful for them and propose values.

Comment entered 2011-02-07 14:37:38 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-07 14:37:38
BZCOMMENTOR::Bob Kline
BZCOMMENT::31

Lowercased 'meeting recording' on Franck (and in version control archives).

Comment entered 2011-02-07 15:03:34 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2011-02-07 15:03:34
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::32

Hi Bob. Just wondering how long you think the testing of the spreadsheet will take? We got an email from Vanessa politely wanting to know when she could expect the large spreadsheet so she could plan her recording efforts, and I wasn't sure what to tell her. Do you think it's possible we could be ready to send her something by the end of the week or is that pushing it? Thanks!

Comment entered 2011-02-07 15:07:55 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-07 15:07:55
BZCOMMENTOR::Bob Kline
BZCOMMENT::33

That's perfectly reasonable. I'm loading the documents on Franck right now.

Comment entered 2011-02-07 15:09:10 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2011-02-07 15:09:10
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::34

Groovy!

Comment entered 2011-02-08 10:27:48 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-08 10:27:48
BZCOMMENTOR::Bob Kline
BZCOMMENT::35

I've created the Media documents on Franck and run a test global on the
glossary name documents. After CIAT has had a change to check the test
documents I'll run the job live on Franck.

http://franck.nci.nih.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2011-02-08_10-09-47

Vanessa does a very good job with the pronunciations. Odd, though, that
"predominio" comes out sounding as if it were "predomino" in CDR694751.

Comment entered 2011-02-09 16:34:27 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-09 16:34:27
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::36

(In reply to comment #35)
> I've created the Media documents on Franck and run a test global on the
> glossary name documents. After CIAT has had a change to check the test
> documents I'll run the job live on Franck.
>
> http://franck.nci.nih.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2011->02-08_10-09-47

I looked at the test results and didn't find any problems. Please run in live mode on Franck.

Comment entered 2011-02-09 16:41:31 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-09 16:41:31
BZCOMMENTOR::Bob Kline
BZCOMMENT::37

Live run on Franck completed.

Comment entered 2011-02-10 09:19:11 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-10 09:19:11
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::38

(In reply to comment #35)
> I've created the Media documents on Franck and run a test global on the
> glossary name documents. After CIAT has had a change to check the test
> documents I'll run the job live on Franck.
>
> http://franck.nci.nih.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2011-02-08_10-09-47
>
> Vanessa does a very good job with the pronunciations. Odd, though, that
> "predominio" comes out sounding as if it were "predomino" in CDR694751.

I asked Linda and Amy to listen to some of the pronunciations. Even though I asked Linda specifically to listen to the term "predominio", I did not tell her about what Bob thought about the pronunciation of the term. This is what she had to say:

"I have listened to the Spanish pronunciation and two words are not pronounced correctly, Hodgkin and predominio. " ..... There is one particular word, Hodgkin, in this term that is used a lot in the dictionary and I want to make sure that it’s pronounce correctly and also go over some other terms."

Comment entered 2011-02-10 10:06:59 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-10 10:06:59
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::39

(In reply to comment #37)
> Live run on Franck completed.

I have looked at several of the documents on Franck and they all look good to me.

Comment entered 2011-02-10 10:19:51 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-10 10:19:51
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::40

Linda found a few more terms that she thought were not pronounced well. They are yellow-highlighted in the attached spreadsheet.

Comment entered 2011-02-10 10:19:51 by Osei-Poku, William (NIH/NCI) [C]

Attachment Test Spreadsheet 50 Terms for VR.XLS has been added with description: Spanish terms pronunciation

Comment entered 2011-02-10 10:42:05 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-10 10:42:05
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::41

When searching for media documents that have same title for both Spanish and English using CDR > Search, they are displayed with no indication of which one is English or Spanish. Examples: "vorinostat" and "edema". Is it possible to differentiate one from the other in the search results? Users have gone around this limitation by usually appending the Spanish media titles with "-Spanish".

Comment entered 2011-02-10 10:45:24 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-10 10:45:24
BZCOMMENTOR::Bob Kline
BZCOMMENT::42

You can always add an issue to have the title filter modified, along the lines of Robin's suggestion in comment #28.

Comment entered 2011-02-18 15:43:45 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-02-18 15:43:45
BZCOMMENTOR::Robin Juthe
BZCOMMENT::43

Maybe this suggestion is coming too late in the game ❓, but I think it would be worthwhile using the "Proposed Use" element (with the "Glossary" child element) in the media document to link to the appropriate Glossary term name document. Right now, there's not an easy way to get from the audio pronunciation to the glossary term name.

Comment entered 2011-02-18 16:44:12 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-18 16:44:12
BZCOMMENTOR::Bob Kline
BZCOMMENT::44

(In reply to comment #43)
> Maybe this suggestion is coming too late in the game ❓, but I think it would
> be worthwhile using the "Proposed Use" element (with the "Glossary" child
> element) in the media document to link to the appropriate Glossary term name
> document. Right now, there's not an easy way to get from the audio
> pronunciation to the glossary term name.

We can do that. The sample Media documents are already created on Franck, so we'd either need to refresh Franck or delete the ones I've created, whichever you prefer.

Comment entered 2011-02-18 17:01:08 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-02-18 17:01:08
BZCOMMENTOR::Robin Juthe
BZCOMMENT::45

(In reply to comment #44)
> The sample Media documents are already created on Franck, so
> we'd either need to refresh Franck or delete the ones I've created, whichever
> you prefer.

Whichever is easiest. We just have to be sure we check with the rest of the team if they need to save anything from Franck before a refresh (which I'm sure you'd do anyway), if that's the easier option. Thanks.

Comment entered 2011-02-18 17:17:06 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-18 17:17:06
BZCOMMENTOR::Bob Kline
BZCOMMENT::46

Before I proceed, are there any other enhancements anyone needs to get into the process?

Comment entered 2011-02-25 11:56:08 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-25 11:56:08
BZCOMMENTOR::Bob Kline
BZCOMMENT::47

(In reply to comment #42)
> You can always add an issue to have the title filter modified, along the lines
> of Robin's suggestion in comment #28.

I'm working on rolling back the test results on Franck so I can repeat with Robin's enhancement folded in (and again when I get the corrected set from Vanessa). What have you decided you're going to do about the titles?

Comment entered 2011-02-25 13:36:09 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-25 13:36:09
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::48

(In reply to comment #47)
> (In reply to comment #42)
> > You can always add an issue to have the title filter modified, along the lines
> > of Robin's suggestion in comment #28.
>
> I'm working on rolling back the test results on Franck so I can repeat with
> Robin's enhancement folded in (and again when I get the corrected set from
> Vanessa). What have you decided you're going to do about the titles?

The Spanish media document should retain the English term name with the "-Spanish" string after the term name.

Comment entered 2011-02-25 16:14:40 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-02-25 16:14:40
BZCOMMENTOR::Bob Kline
BZCOMMENT::49

I deleted the Media documents from the previous test, and rolled back the term name documents to the pre-test versions. Then I implemented Robin's enhancement (add links from the Media documents to the term name docs) and modified the MediaTitle element to match William's requested change. Then I ran another test. The results are ready for review (on Franck).

Comment entered 2011-02-28 12:10:35 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-02-28 12:10:35
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::50

I reviewed several of the documents and they all looked good to me.

Comment entered 2011-03-14 08:55:23 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-03-14 08:55:23
BZCOMMENTOR::Robin Juthe
BZCOMMENT::51

I put together a rough step-by-step process for the initial 14,000+ audio pronunciations, based on recent discussions in our status meetings. It would be great if others (Bob, Volker, William?) could take a look at this and amend it as appropriate. Thanks.

Comment entered 2011-03-14 08:55:23 by Juthe, Robin (NIH/NCI) [E]

Attachment Glossary Audio Pronunciations.doc has been added with description: Audio Pronunciation Process for Initial Set

Comment entered 2011-03-14 09:09:06 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-03-14 09:09:06
BZCOMMENTOR::Bob Kline
BZCOMMENT::52

I think we said that the spreadsheet I made for Vanessa would exclude the ones she's already done, right? Also, in the first step, I assume you want separate workbooks, not just separate sheets, right? Have we captured the most recent set of column headers to be put at the top of the generated sheets?

Comment entered 2011-03-14 09:16:10 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-03-14 09:16:10
BZCOMMENTOR::Robin Juthe
BZCOMMENT::53

(In reply to comment #52)
> I think we said that the spreadsheet I made for Vanessa would exclude the ones
> she's already done, right?

Yes.

> Also, in the first step, I assume you want separate
> workbooks, not just separate sheets, right?

Yes - separate workbooks/files.

> Have we captured the most recent
> set of column headers to be put at the top of the generated sheets?

The column headers are as follows:

Term Name
Language
Pronunciation
Filename
Notes (Vanessa)
Approved?
Notes (NCI)

Thanks Bob.

Comment entered 2011-03-15 08:35:04 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-03-15 08:35:04
BZCOMMENTOR::Bob Kline
BZCOMMENT::54

(In reply to comment #53)

> The column headers are as follows:
>
> Term Name
> Language
> Pronunciation
> Filename
> Notes (Vanessa)
> Approved?
> Notes (NCI)

Why are we dropping the CDR ID column? I don't think I can process the audio files correctly without that information.

Comment entered 2011-03-15 08:38:08 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-03-15 08:38:08
BZCOMMENTOR::Robin Juthe
BZCOMMENT::55

(In reply to comment #54)
> Why are we dropping the CDR ID column? I don't think I can process the audio
> files correctly without that information.

Sorry - I didn't intend to drop the CDR ID. That should remain to the left of the Term Name column.

Comment entered 2011-03-15 09:52:37 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-03-15 09:52:37
BZCOMMENTOR::Bob Kline
BZCOMMENT::56

Comment entered 2011-03-15 09:52:37 by Kline, Bob (NIH/NCI) [C]

Attachment A.xls has been added with description: Docs with English pronunciations

Comment entered 2011-03-15 09:53:12 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-03-15 09:53:12
BZCOMMENTOR::Bob Kline
BZCOMMENT::57

Comment entered 2011-03-15 09:53:12 by Kline, Bob (NIH/NCI) [C]

Attachment B.xls has been added with description: Docs lacking English pronunciations

Comment entered 2011-03-15 11:34:09 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-03-15 11:34:09
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::58

I have provided Amy and Linda with the two spreadsheets and they are also aware of all the steps that CIAT needs to do. However, it doesn't look like they are completely satisfied with the 50 test pronunciations yet so it doesn’t look like we are ready for takeoff yet.

Margaret-
Amy and Linda will email you their findings.

Comment entered 2011-03-16 11:07:13 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-03-16 11:07:13
BZCOMMENTOR::Volker Englisch
BZCOMMENT::59

(In reply to comment #51)
> It would be great if others (Bob, Volker, William?) could take a look

I have two remarks:
a) You're saying "Use 'Images Download'-like function to save files on BACH.
Does CIAT have access to the files on BACH or will CIAT pick up the files
directly from CIPSFTP?
If I remember correctly, the Images-Download moves the images on the FTP
server from the upload directory to a directory for CIAT and copies the
files from the FTP server to BACH.
Do we want to use the same process?
b) You mentioned to save the spreadsheet with the revised pronunciations as
"WeekXXRevised". I would probably rather have the file named
"WeekXXRevisionN" with N = 1, 2, 3 just in case we need to have revisions
of the revised audio files.

Bob: Do you want me to copy the files to BACH as ZIP files, copy and unzip
them or are you going to go to the FTP server to retrieve the audio
files yourself?

I don't think step 18 is necessary. I thought we would publish the audio files as they become available (provided Cancer.gov is ready to process them).

Comment entered 2011-03-16 14:33:50 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-03-16 14:33:50
BZCOMMENTOR::Bob Kline
BZCOMMENT::60

(In reply to comment #59)

> Bob: Do you want me to copy the files to BACH as ZIP files, copy and unzip
> them or are you going to go to the FTP server to retrieve the audio
> files yourself?

I plan to pull the audio files and spreadsheet out of the zip archives directly instead of getting them from the file system. I could write the utility to pull the files directly from the FTP server but I'd rather decouple that step from the rest of the processing for a number of reasons. First, we said we wanted to be able to copy the files from the FTP server to the CDR server so that subsequent uploads by the translator would not overwrite earlier versions of the same files without our knowledge/approval. The timing for when we want that capture to happen may not coincide with when we're ready to perform the import (in fact, since the sequence of events will be (a) Vanessa uploads to the FTP server; (b) we copy to the CDR Server; (c) CIAT reviews the pronunciations; (d) lather, rinse, repeat as necessary; (e) import and link to Media files; it's almost a certainty that the timing will not match). Second, I want to implement the utility with options which will allow it to be invoked in such a way that instead of doing the actual importing and linking it reports on what it would import, including which version of a pronunciation was used when a set of multiple archive zipfiles are being processed. We wouldn't want to retrieve the same files from the FTP server repeatedly for performance reasons (in addition to the concerns about the translator overwriting a zipfile on the FTP server). So I'm going to leave it to you to copy the ZIP archives to Bach. Let's come up with a standard place, and I'll let you automate the copy as much as you like (Python's standard library has an excellent ftplib module), including safeguards against inadvertently overwriting an existing archive.

The proposed syntax for invoking the utility will be:

ImportGlossaryPronunciations [-r] uid pwd zip1 [zip2 [... zipN]]

where:
uid is the CDR User ID for the account used for the import
pwd is the password for that CDR user account
the remaining arguments are the paths (relative or absolute) for the
ZIP archives used for the import job, with audio files from archives
named later in the argument list taking precedence over those for
the same glossary name string/language combination in a given
CDR GlossaryTermName document

Let me know if you see any drawbacks to this proposed approach.

Comment entered 2011-03-17 10:09:24 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-03-17 10:09:24
BZCOMMENTOR::Robin Juthe
BZCOMMENT::61

(In reply to comment #59)
> a) You're saying "Use 'Images Download'-like function to save files on BACH.
> Does CIAT have access to the files on BACH or will CIAT pick up the files
> directly from CIPSFTP?
> If I remember correctly, the Images-Download moves the images on the FTP
> server from the upload directory to a directory for CIAT and copies the
> files from the FTP server to BACH.
> Do we want to use the same process?

Let's talk about this one in today's meeting. I think it would be the same process.

> b) You mentioned to save the spreadsheet with the revised pronunciations as
> "WeekXXRevised". I would probably rather have the file named
> "WeekXXRevisionN" with N = 1, 2, 3 just in case we need to have revisions
> of the revised audio files.

Good idea.

> I don't think step 18 is necessary. I thought we would publish the audio files
> as they become available (provided Cancer.gov is ready to process them).

I don't believe this has been decided. If Cancer.gov is ready before we have all of the audio files (which is likely to be the case), we may want to have a given # of them done before they are pushed (e.g., half of the files).

I will update the process steps after today's meeting.

Comment entered 2011-03-23 16:27:22 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-03-23 16:27:22
BZCOMMENTOR::Robin Juthe
BZCOMMENT::62

I updated the notes about process to include comments from Bob and Volker and our meeting last week. Additions are in red.

Comment entered 2011-03-23 16:27:22 by Juthe, Robin (NIH/NCI) [E]

Attachment Glossary Audio Pronunciations_V2.doc has been added with description: Audio Pronunciation Process for Initial Set - version 2

Comment entered 2011-03-24 11:00:23 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-03-24 11:00:23
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::63

Amy and Linda have completed reviewing the test records and have indicated that they are now satisfied with the results.

Comment entered 2011-04-04 15:25:44 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-04-04 15:25:44
BZCOMMENTOR::Bob Kline
BZCOMMENT::64

Since I don't see any objections to the proposed approach in comment #60, I'm planning to implement what I've proposed there. The latest zip-files on the FTP server are Week_09.zip and Week_12.zip. I was expecting the later archive to have the corrected pronunciations for the subset which had been rejected, but it looks like they both have the same number of MP3 files. Am I wandering down the wrong path?

Also, now that we have decided to have Alan build the more sophisticated review tool for the audio files, using the database to record the review decisions, I need to know whether it's expected that the import tool will check the decisions he has recorded, or just assume that the contents of the archives (and the order in which they are specified when the import tool is invoked) are correct.

Comment entered 2011-04-04 16:05:15 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-04-04 16:05:15
BZCOMMENTOR::Volker Englisch
BZCOMMENT::65

(In reply to comment #64)
> Am I wandering down the wrong path?

No. I looked at the zip file myself and noticed the same thing but after looking at the email we've send out to Vanessa we had actually said that we wanted to "start over" and receive the complete test data again.

We are still going to request - for the live updates - to have the revisions submitted in a separate file.

The revised audio files are easily identified by the time stamp in the new test file. Would you want to get a "revisions-only" file for testing? I could just extract and re-zip the revised audio files.

Also, I noticed that one of the newly revised audio files is corrupted. Its not an empty file but the size is much smaller than all of the audio files.
I thought I'd mention it so that you'd know what to expect.

Comment entered 2011-04-06 15:08:27 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-04-06 15:08:27
BZCOMMENTOR::Bob Kline
BZCOMMENT::66

(In reply to comment #60)

> I want to implement the utility with options which will allow it to be
> invoked in such a way that instead of doing the actual importing and
> linking it reports on what it would import, ....

I added another option flag to the syntax for this tool: -h behaves like -r in that it asks for a report instead of an actual import processing job, but the report output is in HTML to make it easier for users to review. Here's the output when running it with the new flag against Week_09.zip and Week_12.zip:

http://bach.nci.nih.gov/audio-import-report.html

Please take a look, noting that there are a couple of problems with the latest set from Vanessa.

The report only notes overrides of pronunciation files from earlier sets when the bytes are different.

Comment entered 2011-04-07 13:14:41 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-04-07 13:14:41
BZCOMMENTOR::Bob Kline
BZCOMMENT::67

[Decided at status meeting:]

Add more checks:

  • same file name in spreadsheet more than once

  • same cdr ID/name/language in sheet more than once

Comment entered 2011-04-11 13:48:32 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-04-11 13:48:32
BZCOMMENTOR::Bob Kline
BZCOMMENT::68

(In reply to comment #67)
> [Decided at status meeting:]
>
> Add more checks:
> * same file name in spreadsheet more than once
> * same cdr ID/name/language in sheet more than once

Done. Vanessa will be asked to submit a zip file which corrects the two most recently reported problems (wrong file name and corrupt .mp3 file). I'll run another report on the combination of the three zip files.

Comment entered 2011-04-15 08:55:12 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-04-15 08:55:12
BZCOMMENTOR::Robin Juthe
BZCOMMENT::69

(In reply to comment #68)
> Vanessa will be asked to submit a zip file which corrects the two most
> recently reported problems (wrong file name and corrupt .mp3 file). I'll run
> another report on the combination of the three zip files.

Just posting an update to say that Vanessa sent these two files, but the spreadsheet contains errors in the filename column. Volker plans to call Vanessa to review the problems and offer up two possible solutions:

  • we leave the file name column blank when we send her a spreadsheet for revised terms, and she fills it in with the new directory information

  • we pre-populate the filename column with the file path we designate (Alan's program could specify this path based on the number of revisions a particular set of terms has gone through)

Last night, she uploaded the first batch of 235 pronunciations and plans to upload weekly from here on out, with the pace quickening after May 20th.

Comment entered 2011-04-25 14:23:28 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-04-25 14:23:28
BZCOMMENTOR::Bob Kline
BZCOMMENT::70

Are we ready for me to test the first batch on Franck? Which server are the zip files on for that batch, and what's the sequence order in which they should be fed to the import tool (later zipfiles overriding earlier ones)?

Comment entered 2011-05-05 10:50:51 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-05 10:50:51
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::71

Amy just informed me that the review of Week 16 is complete: Here is her email below for details:

>Hi William,
>Linda and I have completed our QC for week 16, which includes Rev1 and Rev2. >Things are good to go on our end.
>Please let me know if you have any questions.

>Thanks,
>Amy

Comment entered 2011-05-12 12:50:16 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-12 12:50:16
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::72

Review of Week 17 has been completed.

>Hi William,

>The audio review for week 17 is now complete; this batch was entirely reviewed >via the new CDR audio tool.

>Please let me know if you have any questions.

>Thanks,
>Amy

Comment entered 2011-05-12 16:18:47 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-12 16:18:47
BZCOMMENTOR::Bob Kline
BZCOMMENT::73

(In reply to comment #70)
> Are we ready for me to test the first batch on Franck? Which server are the
> zip files on for that batch, and what's the sequence order in which they should
> be fed to the import tool (later zipfiles overriding earlier ones)?

Here's Volker's email response from May 3 (when Bugzilla was unavailable):

Bob,

I think I was supposed to give you a set of audio files for you to test the load process on FRANCK.
I've copied the files
Week_015.zip,
Week_015_Rev1.zip, and
Week_015_Rev2.zip
to the Audio_from_CIPSFTP directory.

There are 235 audio files in these file.

Comment entered 2011-05-12 16:33:50 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-05-12 16:33:50
BZCOMMENTOR::Volker Englisch
BZCOMMENT::74

I've copied Week_016 (plus revisions) to FRANCK for you to test with as well.

Comment entered 2011-05-13 15:48:49 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-05-13 15:48:49
BZCOMMENTOR::Robin Juthe
BZCOMMENT::75

I think most of these items are covered above in this issue, but in case not, we wanted to summarize which elements we think should be populated in the media docs when Bob uploads the audio files.

1. MediaTitle- this will be the English term name for all English audio pronunciations, and it will be the English term name followed by “-Spanish” for any Spanish term pronunciations. Examples: “cancer”; “cancer-Spanish”

2. Physical Media/Sound Data (Sound Type, Sound Encoding, Run all will be populated)

3. Media Source/Creator – Vanessa Richardson

4. Media Source/Date – the date the file was created (or the date of the spreadsheet, whichever you can get.)

5. Media Source/Filename – the name of the audio file

6. Media Content/Category – Pronunciation

7. Media Content/Content Description – Pronunciation of dictionary term “X”.

Note for Discussion: We typically place language and audience attributes on the content description for other media documents (images and meeting recordings), so we may want to add them here as well. (Patient and Eng or Spn). Not sure if that would be useful. Should we instead use the Media Language element? This is not generally used for illustrations (to my knowledge).

8. Proposed use/Glossary – will include a link to the glossary term name document.

Comment entered 2011-05-16 14:15:19 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-16 14:15:19
BZCOMMENTOR::Bob Kline
BZCOMMENT::76

(In reply to comment #75)
> I think most of these items are covered above in this issue, but in case not,
> we wanted to summarize which elements we think should be populated in the media
> docs when Bob uploads the audio files.
>
> 1. MediaTitle- this will be the English term name for all English audio
> pronunciations, and it will be the English term name followed by “-Spanish” for
> any Spanish term pronunciations. Examples: “cancer”; “cancer-Spanish”

Right. I wonder if it would be more appropriate to use a dash, which would be the correct way to append such information to multi-word terms (avoiding the implication which the hyphen carries that the appended information pertains only to the preceding word); so "breast cancer--Spanish"?

>
> 2. Physical Media/Sound Data (Sound Type, Sound Encoding, Run all will be
> populated)

Right. The names are slightly different, but I know what you mean.

>
> 3. Media Source/Creator – Vanessa Richardson

I had been using what's in the spreadsheet if available or "Vanessa Richardson, VR Voice" if the cell is empty for that row. Should I just hard-code "Vanessa Richardson" (without her company)?

>
> 4. Media Source/Date – the date the file was created (or the date of the
> spreadsheet, whichever you can get.)

I'm using the date of the file.

>
> 5. Media Source/Filename – the name of the audio file

I'm using the path from the spreadsheet.

>
> 6. Media Content/Category – Pronunciation

Lowercased, matching the schema.

>
> 7. Media Content/Content Description – Pronunciation of dictionary term “X”.
>
> Note for Discussion: We typically place language and audience attributes on the
> content description for other media documents (images and meeting recordings),
> so we may want to add them here as well. (Patient and Eng or Spn). Not sure if
> that would be useful. Should we instead use the Media Language element? This is
> not generally used for illustrations (to my knowledge).

I'm using "Patients" and "en" or "es" but perhaps that's not right (the description is always in English, after all).

>
> 8. Proposed use/Glossary – will include a link to the glossary term name
> document.

Right.

Comment entered 2011-05-16 14:22:41 by alan

BZDATETIME::2011-05-16 14:22:41
BZCOMMENTOR::Alan Meyer
BZCOMMENT::77

(In reply to comment #76)
...
> > 1. MediaTitle- this will be the English term name for all English audio
> > pronunciations, and it will be the English term name followed by “-Spanish” for
> > any Spanish term pronunciations. Examples: “cancer”; “cancer-Spanish”
>
> Right. I wonder if it would be more appropriate to use a dash, which would be
> the correct way to append such information to multi-word terms (avoiding the
> implication which the hyphen carries that the appended information pertains
> only to the preceding word); so "breast cancer--Spanish"?
...

The double hyphen has the added advantage of being clearly
distinguishable from hyphens that appear in the technical terms.

Comment entered 2011-05-16 14:30:01 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-05-16 14:30:01
BZCOMMENTOR::Volker Englisch
BZCOMMENT::78

(In reply to comment #77)
> The double hyphen has the added advantage of being clearly
> distinguishable from hyphens that appear in the technical terms.

I like the "--" or maybe " - " better, too. Unfortunately, we're already using the "-Spanish" appendix for Media documents.
As usual, I'd like to be consistent.

Comment entered 2011-05-17 09:57:10 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-17 09:57:10
BZCOMMENTOR::Bob Kline
BZCOMMENT::79

Even though this was a test run the media documents are actually created, so there will be something for the name documents to link to. Only a couple of anomalies in the log file: one name had changed, and one row in one of the worksheets (other than the header row) had no ID in the first column. Not all of the questions from the most recent comments have been answered, so for example, I still have Vanessa's company in the Creator element. Please examine both the media documents and the modifications to the name documents carefully.

http://franck.nci.nih.gov/cgi-bin/cdr/ShowGlobalChangeTestResults.py?dir=2011-05-17_09-38-51

I assume this is a later set than the original 50, but it was the earliest I found on Franck.

Comment entered 2011-05-17 09:57:10 by Kline, Bob (NIH/NCI) [C]

Attachment Request4926-testrun.log has been added with description: Log from test run on Franck

Comment entered 2011-05-17 11:24:31 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-17 11:24:31
BZCOMMENTOR::Bob Kline
BZCOMMENT::80

(In reply to comment #42)
> You can always add an issue to have the title filter modified, along the lines
> of Robin's suggestion in comment #28.

Did this issue ever get created?

Comment entered 2011-05-17 12:02:00 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-17 12:02:00
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::81

(In reply to comment #80)
> (In reply to comment #42)
> > You can always add an issue to have the title filter modified, along the lines
> > of Robin's suggestion in comment #28.
>
> Did this issue ever get created?

Actually, users did not want that anymore. We talked about it in the meeting but the issue was not updated. Users want to continue using the "-Spanish" to designate the difference.

Comment entered 2011-05-17 14:47:00 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-17 14:47:00
BZCOMMENTOR::Bob Kline
BZCOMMENT::82

(In reply to comment #81)
> (In reply to comment #80)
> > (In reply to comment #42)
> > > You can always add an issue to have the title filter modified, along the lines
> > > of Robin's suggestion in comment #28.
> >
> > Did this issue ever get created?
>
> Actually, users did not want that anymore. We talked about it in the meeting
> but the issue was not updated. Users want to continue using the "-Spanish" to
> designate the difference.

In that case I'm going to wipe out the Media documents I just created on Franck and do another test run, with the DocTitle set to just use the contents of the MediaTitle element (as is done by the current Media DocTitle filter).

Comment entered 2011-05-17 14:55:34 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-05-17 14:55:34
BZCOMMENTOR::Robin Juthe
BZCOMMENT::83

(In reply to comment #82)
> (In reply to comment #81)
> > (In reply to comment #80)
> > > (In reply to comment #42)
> > > > You can always add an issue to have the title filter modified, along the lines
> > > > of Robin's suggestion in comment #28.
> > >
> > > Did this issue ever get created?
> >
> > Actually, users did not want that anymore. We talked about it in the meeting
> > but the issue was not updated. Users want to continue using the "-Spanish" to
> > designate the difference.
> In that case I'm going to wipe out the Media documents I just created on Franck
> and do another test run, with the DocTitle set to just use the contents of the
> MediaTitle element (as is done by the current Media DocTitle filter).

Users DO want the doc title to include the category, so I think a title filter change is still warranted. Without this, although the language would be clear, there wouldn't be any way to determine which documents are audio pronunciations and which are images. I havne't looked on Franck yet but I wanted to add this comment before you wipe everything out (hopefully).

Comment entered 2011-05-17 16:31:29 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-17 16:31:29
BZCOMMENTOR::Bob Kline
BZCOMMENT::84

(In reply to comment #31)
> Lowercased 'meeting recording' on Franck (and in version control archives).

You'll probably want to create another issue or two to have the template modified and to globally change the existing documents on Bach. You could piggy-back the global with the one that's going to add the external attribute I think.

Comment entered 2011-05-17 16:32:42 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-17 16:32:42
BZCOMMENTOR::Bob Kline
BZCOMMENT::85

(In reply to comment #83)

> Users DO want the doc title to include the category, so I think a title filter
> change is still warranted. Without this, although the language would be clear,
> there wouldn't be any way to determine which documents are audio pronunciations
> and which are images. I havne't looked on Franck yet but I wanted to add this
> comment before you wipe everything out (hopefully).

OK, I will back out the changes from the software and leave the previous test results in place.

Comment entered 2011-05-18 09:48:18 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-05-18 09:48:18
BZCOMMENTOR::Robin Juthe
BZCOMMENT::86

(In reply to comment #76)

Just wanted to address a few of Bob's questions:

> Right. I wonder if it would be more appropriate to use a dash

We'll keep this as is with the hyphen for the sake of consistency.

> I had been using what's in the spreadsheet if available or "Vanessa Richardson,
> VR Voice" if the cell is empty for that row. Should I just hard-code "Vanessa
> Richardson" (without her company)?

Please continue to use "Vanessa Richardson, VR Voice" - this is fine.

> I'm using "Patients" and "en" or "es" but perhaps that's not right (the
> description is always in English, after all).

Please continue doing what you're doing. This separate might come in handy for reports down the road.

Comment entered 2011-05-18 11:44:25 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-18 11:44:25
BZCOMMENTOR::Bob Kline
BZCOMMENT::87

I'm afraid our process is still broken. The workbook for Week_016_Rev2.zip says the files are in Week_016_Rev2 but they're actually in Weel_016_Rev2, so the software can't find them.

Comment entered 2011-05-18 11:50:34 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-05-18 11:50:34
BZCOMMENTOR::Volker Englisch
BZCOMMENT::88

(In reply to comment #87)
> I'm afraid our process is still broken.

That didn't come out of our process. Week 16 was the last week where Linda and Amy used a manual process to review the data and Vanessa had used a spreadsheet submitted to her by Linda.

I've renamed the Weel to Week on FRANCK.

Comment entered 2011-05-18 14:15:12 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-18 14:15:12
BZCOMMENTOR::Bob Kline
BZCOMMENT::89

(In reply to comment #88)
> (In reply to comment #87)
> > I'm afraid our process is still broken.
>
> That didn't come out of our process.

I was including Vanessa in the "our" part.

Comment entered 2011-05-18 16:29:20 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-18 16:29:20
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::90

(In reply to comment #79)
> Created attachment 2117 [details]
> Log from test run on Franck
>
> Even though this was a test run the media documents are actually created, so
> there will be something for the name documents to link to. Only a couple of
> anomalies in the log file: one name had changed, and one row in one of the
> worksheets (other than the header row) had no ID in the first column. Not all
Is this the

CDR0000043971 had no diff. I suppose it is the one affected by the row without an ID?

Comment entered 2011-05-18 17:47:04 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-18 17:47:04
BZCOMMENTOR::Bob Kline
BZCOMMENT::91

(In reply to comment #90)
> (In reply to comment #79)
> > Created attachment 2117 [details]
> > Log from test run on Franck
> >
> > Even though this was a test run the media documents are actually created, so
> > there will be something for the name documents to link to. Only a couple of
> > anomalies in the log file: one name had changed, and one row in one of the
> > worksheets (other than the header row) had no ID in the first column. Not all
> Is this the
>
> CDR0000043971 had no diff. I suppose it is the one affected by the row without
> an ID?

You should review the log file I attached. That's the one for which the name was changed.

Comment entered 2011-05-19 08:42:07 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-19 08:42:07
BZCOMMENTOR::Bob Kline
BZCOMMENT::92

I have modified the software to add Usage="External" to the Media element. Are there any other changes which need to be made? Should I run another import job on Franck? Live or test mode? Same set or a different set? Are we going to test with the original set of 50?

Comment entered 2011-05-19 09:06:44 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-05-19 09:06:44
BZCOMMENTOR::Volker Englisch
BZCOMMENT::93

(In reply to comment #92)
> Should I run another import job
> on Franck? Live or test mode? Same set or a different set? Are we going to
> test with the original set of 50?

Not sure what the answers will be to these questions but I've copied all files from BACH to FRANCK.

  • Week_012.zip, which includes the 50/50 test files

  • Week_015 - Week_018

Please note that the name of the spreadsheet for Week_012 does not follow our naming convention and we may need to rename it if you're looking for a particular file name (but I think you're only looking for an *.xls file).

Comment entered 2011-05-19 09:10:08 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-05-19 09:10:08
BZCOMMENTOR::Volker Englisch
BZCOMMENT::94

Would it be possible to load a set of audio files to MAHLER?
It would allow me to start testing the publishing changes and push some documents to Gatekeeper.

Comment entered 2011-05-19 09:29:26 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-19 09:29:26
BZCOMMENTOR::Bob Kline
BZCOMMENT::95

(In reply to comment #94)
> Would it be possible to load a set of audio files to MAHLER?
> It would allow me to start testing the publishing changes and push some
> documents to Gatekeeper.

Done; log file attached.

Comment entered 2011-05-19 09:29:26 by Kline, Bob (NIH/NCI) [C]

Attachment Request4926-20110519092239.log has been added with description: Log from live run on Mahler

Comment entered 2011-05-19 10:20:06 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-19 10:20:06
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::96

(In reply to comment #91)

> You should review the log file I attached. That's the one for which the name
> was changed.

Yes. I have been reviewing the log file and the test results. I just needed help interpreting the error message in the log file.

Comment entered 2011-05-20 12:17:29 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-20 12:17:29
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::97

I did not find any problems with the live run on Mahler. All the ones I checked looked good. But I have 2 questions about the test run on Franck.

1. The test results doesn't appear to include the "-Spanish" for the Spanish Media titles (on Mahler it displays that correctly).

2. 44135 added the Media link for only the English and not the Spanish. I wondered if it is the one you mentioned that the name had changed but I am still not sure even after looking at the log files.

3. Would it be possible to run in live mode on Franck (I know you created the media docs during the test run)? If you do, could you also post the spreadsheet you use or let me know which one and I can access it from the ftp site.

Comment entered 2011-05-20 15:48:35 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-05-20 15:48:35
BZCOMMENTOR::Bob Kline
BZCOMMENT::98

(In reply to comment #97)

> 1. The test results doesn't appear to include the "-Spanish" for the Spanish
> Media titles (on Mahler it displays that correctly).

That shouldn't matter, since what's put in the text node of a linking element is overwritten by what comes out of the DocTitle filter, but I've modified the software to add "-Spanish" to the links to Spanish pronunciations.

> 2. 44135 added the Media link for only the English and not the Spanish. I
> wondered if it is the one you mentioned that the name had changed but I am
> still not sure even after looking at the log files.

We didn't get a Spanish pronunciation file for that document.

> 3. Would it be possible to run in live mode on Franck (I know you
> created the media docs during the test run)? If you do, could you
> also post the spreadsheet you use or let me know which one and I
> can access it from the ftp site.

Done. Logs attached. The logs tell you, for each Media document I created, which zip file the sound file was taken from. Each zip file has only a single Excel workbook.

Comment entered 2011-05-20 15:48:35 by Kline, Bob (NIH/NCI) [C]

Attachment Request4926-201105201543-franck.log has been added with description: Log from live run on Franck

Comment entered 2011-05-23 14:48:48 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-23 14:48:48
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::99

Audio review of Week 018 completed:

>Hi William
>The audio pronunciation QC for Week 018 (initial batch and Rev1 and Rev2 >batches) have been approved and completed. These are good to go now.
>Thanks,
>Amy

Comment entered 2011-05-23 18:21:50 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-23 18:21:50
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::100

(In reply to comment #98)
> Created attachment 2120 [details]
> Log from live run on Franck
>

Review on Franck completed. They look good. I think we are ready to go live on Bach after the publishing/vendor filter issues have been done and tested.

Comment entered 2011-05-27 08:35:03 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-05-27 08:35:03
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::101

Review of Weeks 19 & 20 Completed.

From: Saucedo, Linda
Sent: Friday, May 27, 2011 7:59 AM
To: Osei-Poku, William
Cc: Beckwith, Margaret (NIH/NCI) [E]; Harrison, Robin (NIH/NCI) [E]
Subject: Week 19 and Week 20 Audio Revisions

Hi William,

Maryam and I have finished listening to the revisions from Week 19 & 20 and they have been approved and completed.

Let me know if you have any questions.

Thank you,
Linda

Comment entered 2011-06-17 08:13:55 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-06-17 08:13:55
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::102

Week 021 review completed.

From: Saucedo, Linda
Sent: Friday, June 17, 2011 8:05 AM
To: Osei-Poku, William
Cc: 'Beckwith, Margaret (NIH/NCI) [E]'; Harrison, Robin (NIH/NCI) [E]
Subject: Week 21 Audio Revisions

Hi William,

Amy and I have finished listening to the revisions from Week 21 and they have been approved and completed.

Let me know if you have any questions.

Thank you,
Linda

--------------------

Comment entered 2011-06-17 08:19:29 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-06-17 08:19:29
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::103

So far, here are the weeks I have received confirmation from Linda and Amy as completed and I have in turn reported in this issue.

[X] Week 016 (Manual Process)
[X] Week 017
[X] Week 018
[X] Week 019
[X] Week 020
[X] Week 021

Comment entered 2011-06-20 15:28:31 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-06-20 15:28:31
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::104

Review of week 22 completed.

[X] Week 016 (Manual Process)
[X] Week 017
[X] Week 018
[X] Week 019
[X] Week 020
[X] Week 021
[X] Week 022

Comment entered 2011-07-08 16:31:43 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-07-08 16:31:43
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::105

Review of weeks 23 & 26 completed.

[X] Week 016 - (Manual Process)
[X] Week 017
[X] Week 018
[X] Week 019
[X] Week 020
[X] Week 021
[X] Week 022
[X] Week 023
[ ] Week 024
[ ] Week 025 - (No audios for review - Vanessa on Vacation.)
[X] Week 026

Comment entered 2011-07-12 15:57:39 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-07-12 15:57:39
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::106

Review of week 24 completed.

[X] Week 016 - (Manual Process)
[X] Week 017
[X] Week 018
[X] Week 019
[X] Week 020
[X] Week 021
[X] Week 022
[X] Week 023
[X] Week 024
[ ] Week 025 - (No audios for review - Vanessa on Vacation.)
[X] Week 026

Comment entered 2011-07-13 17:23:16 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-07-13 17:23:16
BZCOMMENTOR::Volker Englisch
BZCOMMENT::107

The Cancer.gov team is getting ready to roll out the new enhancements around Monday/Tuesday of next week. We should start thinking about loading the audio files into production.

Comment entered 2011-07-14 13:42:01 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-07-14 13:42:01
BZCOMMENTOR::Bob Kline
BZCOMMENT::108

Unless Emile convinces Bryan to agree to accepting a smaller initial import, Bob will process the following zipfiles on Monday morning, in this order:

Week_012.zip
Week_012_Rev1.zip
Week_015.zip
Week_015_Rev1.zip
Week_015_Rev2.zip
Week_016.zip
Week_016_Rev1.zip
Week_016_Rev2.zip
Week_017.zip
Week_017_Rev1.zip
Week_018.zip
Week_018_Rev1.zip
Week_018_Rev2.zip
Week_019.zip
Week_019_Rev1.zip
Week_020.zip
Week_020_Rev1.zip
Week_021.zip
Week_021_Rev1.zip
Week_022.zip
Week_022_Rev1.zip
Week_022_Rev2.zip
Week_022_Rev3.zip
Week_022_Rev4.zip
Week_023.zip
Week_023_Rev1.zip
Week_023_Rev2.zip
Week_024.zip
Week_024_Rev1.zip
Week_024_Rev2.zip
Week_026.zip
Week_026_Rev1.zip

Comment entered 2011-07-15 14:09:01 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-07-15 14:09:01
BZCOMMENTOR::Volker Englisch
BZCOMMENT::109

According to Lakshmi we're supposed to load half of the audio files for the initial load.

Comment entered 2011-07-19 23:22:15 by alan

BZDATETIME::2011-07-19 23:22:15
BZCOMMENTOR::Alan Meyer
BZCOMMENT::110

I've added some custom queries to the CDR Query Interface on Bach.

The queries are:

Glossary Term audio review zipfiles completed
Shows zipfiles that are fully reviewed.

Glossary Term audio review zipfiles NOT completed
Shows zipfiles that still have some unreviewed mp3 files.

Glossary Term audio review mp3 still rejected
Shows mp3 files that are waiting for re-recording.

The third query has problems. For the Week 16 and 17 zipfiles, some of the review dates are out of sequence in the database. The approved mp3 in the Rev file is showing as earlier than the rejected mp3 in the original file.

I think this was probably caused by the manipulations to the data that were done to cope with startup problems. I'm tempted to just ignore those two weeks in the query, but I left them in and put comments in the query to, hopefully, avoid further confusion.

Comment entered 2011-07-20 13:30:54 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-07-20 13:30:54
BZCOMMENTOR::Robin Juthe
BZCOMMENT::111

Since William is on vacation, I'm pasting Amy's email below about week 27 being ready for upload. I've asked Amy to provide the file names in the order that they should be uploaded from now on, as discussed in last week's status meeting.

Here’s the files (in order) for week 27:

Week_027.zip
Week_027_Rev1.zip
Week_027_Rev2.zip

From: Kuhlmann, Amy aKuhlmann@icfi.com
Sent: Wednesday, July 20, 2011 12:33 PM
To: Harrison, Robin (NIH/NCI) [E]
Cc: Beckwith, Margaret (NIH/NCI) [E]; Saucedo, Linda; Osei-Poku, William
Subject: Week 27 Audios Completed

Hi Robin,

Just wanted to let you know that Linda and I have finished our review of the audios for Week 27, so these are good to go. We’re currently working on revisions for Week 28 and will notify you when those are finished.

Thanks,
Amy

Comment entered 2011-07-20 14:02:05 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-07-20 14:02:05
BZCOMMENTOR::Volker Englisch
BZCOMMENT::112

As part of the upgrade to Cancer.gov 6.2 we loaded the weeks 12 and 15-24. There were some files that did not get loaded from those weeks due to changes in the term name for instance, but we have 2229 audio files loaded to Cancer.gov at this point.

These are now the completed weeks/files that are ready to be loaded.
Week_024_Rev1.zip
Week_024_Rev2.zip
Week_026.zip
Week_026_Rev1.zip
Week_027.zip
Week_027_Rev1.zip
Week_027_Rev2.zip

The Cancer.gov team has asked to hold off with the load for this week, so we're planning to load these next week.

Comment entered 2011-07-21 08:11:41 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2011-07-21 08:11:41
BZCOMMENTOR::Robin Juthe
BZCOMMENT::113

(In reply to comment #112)
> These are now the completed weeks/files that are ready to be loaded.

I think the original week 24 zip file should be on this list too, so the files ready to be loaded are:

Week_024.zip
Week_024_Rev1.zip
Week_024_Rev2.zip
Week_026.zip
Week_026_Rev1.zip
Week_027.zip
Week_027_Rev1.zip
Week_027_Rev2.zip

Comment entered 2011-07-22 14:14:39 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-07-22 14:14:39
BZCOMMENTOR::Bob Kline
BZCOMMENT::114

The program to add the Media documents and link to them from the glossary documents is on Bach:

d:\cdr\Utilities\Request4926.py

Invoke with command-line arguments identifying zip files containing
media files to be added and accompanying spreadsheet telling how to
link the new media documents from the appropriate GlossaryTermName
documents. The zip files must be named in the proper order, as
media files found in zip files named later in the list of command
line arguments will supercede those found in zip files named earlier.
e.g.:

SET AUDIO=d:/cdr/Audio_from_CIPSFTP
Request4926.py rmk pw %AUDIO%/Week_026.zip %AUDIO%/Week_026_Rev1.zip

or:
d:
cd \cdr\Audio_from_CIPSFTP
d:\cdr\Utilities\Request4926.py Week_026.zip Week_026_Rev1.zip

Comment entered 2011-08-03 18:58:23 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-08-03 18:58:23
BZCOMMENTOR::Volker Englisch
BZCOMMENT::115

The following files have been loaded and were published to Gatekeeper. The terms will not show up until Friday's load because GlossaryTerms are not part of the nightly publishing job.
Week_024.zip
Week_024_Rev1.zip
Week_024_Rev2.zip

Bob, the next set of audio files ready to be loaded are the following. Could you please load these?
Week_026.zip
Week_026_Rev1.zip
Week_027.zip
Week_027_Rev1.zip
Week_027_Rev2.zip

Comment entered 2011-08-04 08:25:37 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-08-04 08:25:37
BZCOMMENTOR::Bob Kline
BZCOMMENT::116

> Bob, the next set of audio files ready to be loaded are the following. Could
> you please load these?
> Week_026.zip
> Week_026_Rev1.zip
> Week_027.zip
> Week_027_Rev1.zip
> Week_027_Rev2.zip

Done. I have attached the error messages from the jobs which have been run so far on Bach, just so CIAT will know what's failing.

Comment entered 2011-08-04 08:25:37 by Kline, Bob (NIH/NCI) [C]

Attachment Request4926.err has been added with description: Errors from live runs on Bach

Comment entered 2011-08-04 11:08:06 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-08-04 11:08:06
BZCOMMENTOR::Volker Englisch
BZCOMMENT::117

I've published Week 26 + Week 27. The audio files are currently processing on Gatekeeper.

William, what's the last completed weekly batch that needs to be loaded?
Are Week 28 and Week 29 ready to go?

Comment entered 2011-08-04 11:15:18 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-08-04 11:15:18
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::118

(In reply to comment #117)
> I've published Week 26 + Week 27. The audio files are currently processing on
> Gatekeeper.
>
> William, what's the last completed weekly batch that needs to be loaded?
> Are Week 28 and Week 29 ready to go?

I believe the last completed week is 27. I have not been notified that weeks 28 and 29 have been done yet.

Comment entered 2011-08-10 13:28:28 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-08-10 13:28:28
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::119

The following files (Weeks 28 & 29) have been completed and can be loaded into the CDR:

Week_028.zip
Week_028_Rev1.zip
Week_028_Rev2.zip

Week_029.zip
Week_029_Rev1.zip
Week_029_Rev2.zip

Comment entered 2011-08-10 18:38:41 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-08-10 18:38:41
BZCOMMENTOR::Volker Englisch
BZCOMMENT::120

Bob, could you please load the audio files for Week 28 and Week 29 tomorrow morning?

Comment entered 2011-08-11 08:16:26 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-08-11 08:16:26
BZCOMMENTOR::Bob Kline
BZCOMMENT::121

Done. Only a single failure:

2011-08-11 08:13:41: CDR46552: unable to find name node for u'exploraci\xf3n conradion\xfaclido' in CDR46552

Comment entered 2011-08-23 15:58:42 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-08-23 15:58:42
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::122

The following files (Weeks 30 & 31) have been completed and can be loaded into
the CDR:

Week_030.zip
Week_030_Rev1.zip

Week_031.zip
Week_031_Rev1.zip

Comment entered 2011-08-23 18:13:24 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-08-23 18:13:24
BZCOMMENTOR::Bob Kline
BZCOMMENT::123

(In reply to comment #122)
> The following files (Weeks 30 & 31) have been completed and can be loaded into
> the CDR:
>
> Week_030.zip
> Week_030_Rev1.zip
>
> Week_031.zip
> Week_031_Rev1.zip

Done.

Comment entered 2011-09-08 11:34:47 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-09-08 11:34:47
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::124

Amy and Linda have completed reviewing the following files:

Week_032.zip
Week_032_Rev1.zip
Week_032_Rev2.zip
Week_032_Rev3.zip
Week_032_Rev4.zip

Week_033.zip
Week_033_Rev1.zip
Week_033_Rev2.zip

Week_034.zip
Week_034_Rev1.zip
Week_034_Rev2.zip

They are ready to be loaded onto Bach.

Comment entered 2011-09-08 14:13:14 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-09-08 14:13:14
BZCOMMENTOR::Volker Englisch
BZCOMMENT::125

Bob, could you please load the audio files for the weeks 32, 33, and 34 sometime tonight (after 7pm) or tomorrow morning? There are about 600 audio files in this batch and it will take at least 4-5 hours to load those on Gatekeeper.

Comment entered 2011-09-09 08:41:00 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-09-09 08:41:00
BZCOMMENTOR::Bob Kline
BZCOMMENT::126

Weeks 32-34 have been loaded. List of errors from the loading of weeks 30 through 34 are attached.

Comment entered 2011-09-09 08:41:00 by Kline, Bob (NIH/NCI) [C]

Attachment 4926-20110909.log has been added with description: Errors from the past couple of jobs

Comment entered 2011-09-09 15:03:08 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-09-09 15:03:08
BZCOMMENTOR::Volker Englisch
BZCOMMENT::127

(In reply to comment #126)
> Weeks 32-34 have been loaded.

The audio files have been loaded to Gatekeeper and will be available on Cancer.gov after tonight's weekly publishing job.

Comment entered 2011-09-21 14:23:08 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-09-21 14:23:08
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::128

(In reply to comment #126)
> Created attachment 2155 [details]
> Errors from the past couple of jobs
>
> Weeks 32-34 have been loaded. List of errors from the loading of weeks 30
> through 34 are attached.

The errors have been fixed.

Comment entered 2011-09-27 13:07:44 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-09-27 13:07:44
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::129

The following files for weeks 35 and 36 have been completed and are ready to be loaded onto Bach.

Week_035.zip
Week_035_Rev1.zip
Week_035_Rev2.zip

Week_036.zip
Week_036_Rev1.zip

Please see note below from the reviewers:

"FYI: For the Week 35 batch, the Spanish term, “parathormona”, (CDR 403137) has been deleted from the GTN to avoid having a duplicate translation. (The translation is the same as the Spanish term, paratirina, [CDR 403138], which is also included on this spreadsheet.) “Parathormona” has been marked as “Approved” on the Week 35 spreadsheet, however, we won’t need a Media doc created for this term. If the Media doc ends up being programatically created for “parathormona”, then we will need to block it."

I suspect this will be reported as an error and then we will manually fix it.

Comment entered 2011-09-28 12:36:24 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-09-28 12:36:24
BZCOMMENTOR::Volker Englisch
BZCOMMENT::130

(In reply to comment #129)
> The following files for weeks 35 and 36 have been completed and are ready
> to be loaded onto Bach.

Bob, I don't know how you are feeling today but if you do come to work tomorrow could you please load the audio files for these two weeks during the morning? There are too many files to have them still loaded this afternoon. The load might not finish before the nightly publishing starts.

Comment entered 2011-09-28 13:14:50 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-09-28 13:14:50
BZCOMMENTOR::Bob Kline
BZCOMMENT::131

(In reply to comment #130)

> could you please load the audio files for these two weeks during the morning?

I'll do it first thing tomorrow.

Comment entered 2011-09-29 07:53:22 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-09-29 07:53:22
BZCOMMENTOR::Bob Kline
BZCOMMENT::132

Errors from this morning's import job:

!5912 Thu Sep 29 07:42:15 2011: CDR410605 u'menstrual period' (English) row 246 in Week_035.zip: There is no item named u'Week_035/410605_en.mp3' in the archive
!5912 Thu Sep 29 07:42:15 2011: CDR410605 u'per\xedodo menstrual' (Spanish) row 247 in Week_035.zip: There is no item named u'Week_035/410605_es.mp3' in the archive
!5912 Thu Sep 29 07:42:16 2011: CDR411379 u'monoclonal gammopathy of undetermined significance' (English) row 11 in Week_035_Rev1.zip: There is no item named u'Week_035_Rev1/411379_en.mp3' in the archive
!5912 Thu Sep 29 07:42:16 2011: CDR416109 u'stage IV AIDS-related lymphoma' (English) row 137 in Week_036.zip: There is no item named u'Week_036/416109_en.mp3' in the archive
!5912 Thu Sep 29 07:42:16 2011: CDR416109 u'linfoma relacionado con el SIDA en estadio IV' (Spanish) row 138 in Week_036.zip: There is no item named u'Week_036/416109_es.mp3' in the archive
2011-09-29 07:45:54: CDR403137: unable to find name node for u'parathormona' in CDR403137
2011-09-29 07:47:23: CDR415317: unable to find name node for u'met\xe1stasis distante' in CDR415317
2011-09-29 07:47:41: CDR423043: unable to find name node for u'factor estimulador de colonias de granulocitos y macrof\xe1gos' in CDR423043

Comment entered 2011-09-29 16:50:01 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-09-29 16:50:01
BZCOMMENTOR::Volker Englisch
BZCOMMENT::133

The audio files for the Weeks 35 and 36 have been loaded to Gatekeeper and will be available on Cancer.gov after tomorrow's weekly publishing job.

Comment entered 2011-09-29 18:34:26 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-09-29 18:34:26
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::134

Week 37 audio review is completed. The following files are ready to be uploaded to Bach.

Week_037.zip
Week_037_Rev1.zip
Week_037_Rev2.zip

Comment entered 2011-09-29 22:02:58 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-09-29 22:02:58
BZCOMMENTOR::Volker Englisch
BZCOMMENT::135

Bob, if you could load this batch in the morning I could include these for the weekly publishing along with the other two weeks we just loaded.

Comment entered 2011-09-30 08:50:40 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-09-30 08:50:40
BZCOMMENTOR::Bob Kline
BZCOMMENT::136

(In reply to comment #135)
> Bob, if you could load this batch in the morning I could include these for the
> weekly publishing along with the other two weeks we just loaded.

Done. Only one error from this batch:

!6732 Fri Sep 30 08:45:57 2011: CDR430478 u'cl\xe9rigo' (Spanish) row 14 in Week_037.zip: There is no item named 'Week_037/430478_es.mp3' in the archive

Comment entered 2011-09-30 10:02:21 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-09-30 10:02:21
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::137

Week 38 audio review is completed. The following files are ready to be uploaded
to Bach.

Week_038.zip
Week_038_Rev1.zip
Week_038_Rev2.zip

Comment entered 2011-09-30 10:08:10 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-09-30 10:08:10
BZCOMMENTOR::Volker Englisch
BZCOMMENT::138

Bob, Week 38 are only about 100 documents. If you can load this batch before lunch I'll include them in the weekly update. Otherwise we should wait until Monday.

Comment entered 2011-09-30 11:03:23 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-09-30 11:03:23
BZCOMMENTOR::Bob Kline
BZCOMMENT::139

(In reply to comment #138)
> Bob, Week 38 are only about 100 documents. If you can load this batch before
> lunch I'll include them in the weekly update. Otherwise we should wait until
> Monday.

Done. There was one error:

!5100 Fri Sep 30 10:58:15 2011: CDR439416 u'portal hypertension' (English) row 1 in Week_038_Rev1.zip: There is no item named u'Week_037_Rev2/430486_es.mp3' in the archive

Comment entered 2011-10-05 11:27:35 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-10-05 11:27:35
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::140

(In reply to comment #139)
> Done. There was one error:
>
> !5100 Fri Sep 30 10:58:15 2011: CDR439416 u'portal hypertension' (English) row
> 1 in Week_038_Rev1.zip: There is no item named u'Week_037_Rev2/430486_es.mp3'
> in the archive

This error has been fixed.

Comment entered 2011-10-11 10:53:28 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-10-11 10:53:28
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::141

Week 39 audio review is completed. The following files are ready to be uploaded
to Bach.

Week_039.zip
Week_039_Rev1.zip
Week_039_Rev2.zip

Comment entered 2011-10-14 13:45:59 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-10-14 13:45:59
BZCOMMENTOR::Volker Englisch
BZCOMMENT::142

Bob, could you please load Week 39? I forgot to send an email earlier today and if you're not able to do this within the next half an hour we should probably better wait until Monday but I would like to include these with the weekly update if possible.

Comment entered 2011-10-14 14:36:54 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-10-14 14:36:54
BZCOMMENTOR::Bob Kline
BZCOMMENT::143

I can do it right now. Soon enough?

Comment entered 2011-10-18 09:40:48 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-10-18 09:40:48
BZCOMMENTOR::Bob Kline
BZCOMMENT::144

Week 39 files loaded. Three errors:

!4584 Tue Oct 18 09:35:29 2011: CDR446197 u'stage IV uterine sarcoma' (English) row 177 in Week_039.zip: There is no item named u'Week_039/446197_en.mp3' in the archive

!4584 Tue Oct 18 09:35:29 2011: CDR446197 u'sarcoma uterino en estadio IV' (Spanish) row 178 in Week_039.zip: There is no item named u'Week_039/446197_es.mp3' in the archive

2011-10-18 09:38:40: CDR446420: unable to find name node for u'tumor de la gl\xe1ndula pituitaria' in CDR446420

Comment entered 2011-10-18 13:58:25 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-10-18 13:58:25
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::145

Review of Week 40 audio files is done. The following files are ready to be loaded to Bach.

Week_040.zip
Week_040_Rev1.zip
Week_040_Rev2.zip

Comment entered 2011-10-19 10:49:23 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-10-19 10:49:23
BZCOMMENTOR::Volker Englisch
BZCOMMENT::146

(In reply to comment #145)
> Review of Week 40 audio files is done.

There are about 150 terms in this file. Let's load these today, if possible.

Comment entered 2011-10-19 10:51:26 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-10-19 10:51:26
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::147

(In reply to comment #144)
> Week 39 files loaded. Three errors:
>
Two of the errors have been fixed. The third pronunciation needs to be re-recorded. Linda is contacting Vanessa about it.

Comment entered 2011-10-19 11:09:00 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-10-19 11:09:00
BZCOMMENTOR::Bob Kline
BZCOMMENT::148

(In reply to comment #146)
> (In reply to comment #145)
> > Review of Week 40 audio files is done.
>
> There are about 150 terms in this file. Let's load these today, if possible.

Done. Here are the errors from the run:

!5540 Wed Oct 19 11:00:45 2011: CDR450136 u'carcinoma de la cavidad nasal y el seno etmoidal in situ, estadio 0' (Spanish) row 126 in Week_040.zip: There is no item named u'Week_040/450136_es.mp3' in the archive
2011-10-19 11:03:02: CDR450130: unable to find name node for u'c\xe1ncer del seno maxilar en estadio I' in CDR450130
2011-10-19 11:03:14: CDR454697: unable to find name node for u'trastorno de adaptaci\xf3n' in CDR454697
2011-10-19 11:03:52: CDR454843: unable to find name node for u'localized malignant mesothelioma' in CDR454843
2011-10-19 11:03:52: Doc CDR0000454843 made invalid by change - will NOT store it
New pub version would become invalid
No publishable document found for link target: MediaID=CDR0000715191
No publishable document found for link target: MediaID=CDR0000715190
2011-10-19 11:03:52: Warning for CDR0000454843: No publishable document found for link target: MediaID=CDR0000715191
2011-10-19 11:03:52: Warning for CDR0000454843: No publishable document found for link target: MediaID=CDR0000715190
2011-10-19 11:03:52: Warning for CDR0000454843: Non-publishable version will be created.
2011-10-19 11:03:53: CDR454927: unable to find name node for u'advanced malignant mesothelioma' in CDR454927
2011-10-19 11:03:53: Doc CDR0000454927 made invalid by change - will NOT store it
New pub version would become invalid
No publishable document found for link target: MediaID=CDR0000715176
No publishable document found for link target: MediaID=CDR0000715175
2011-10-19 11:03:53: Warning for CDR0000454927: No publishable document found for link target: MediaID=CDR0000715176
2011-10-19 11:03:53: Warning for CDR0000454927: No publishable document found for link target: MediaID=CDR0000715175
2011-10-19 11:03:53: Warning for CDR0000454927: Non-publishable version will be created.

Comment entered 2011-10-27 13:04:56 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-10-27 13:04:56
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::149

(In reply to comment #148)
> (In reply to comment #146)
> > (In reply to comment #145)
> > > Review of Week 40 audio files is done.
> >
> > There are about 150 terms in this file. Let's load these today, if possible.
>
> Done. Here are the errors from the run:
>
The errors for week 40 have been fixed.

Comment entered 2011-11-07 11:05:44 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-11-07 11:05:44
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::150

The audio pronunciations for weeks 41 and 42 have been reviewed and the following files are ready to be loaded onto Bach.
Week_041.zip
Week_041_Rev1.zip
Week_041_Rev2.zip
Week_042.zip
Week_042_Rev1.zip
Week_042_Rev2.zip

Comment entered 2011-11-08 10:35:19 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-11-08 10:35:19
BZCOMMENTOR::Volker Englisch
BZCOMMENT::151

Bob, could you please load the weeks 41 and 42 this morning?

Comment entered 2011-11-09 11:38:56 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-11-09 11:38:56
BZCOMMENTOR::Bob Kline
BZCOMMENT::152

(In reply to comment #151)
> Bob, could you please load the weeks 41 and 42 this morning?

Done. Errors:

!1276 Wed Nov 09 10:52:19 2011: CDR457974 u'c\xe1ncer adrenocortical' (Spanish) row 2 in Week_041.zip: There is no item named u'Week_041/457974_es.mp3' in the archive
!1276 Wed Nov 09 10:52:21 2011: CDR476287 u'lobeline' (English) row 19 in Week_042.zip: There is no item named u'Week_042/476287_en.mp3' in the archive
!1276 Wed Nov 09 10:52:22 2011: CDR476625 u'mustine' (English) row 6 in Week_042_Rev1.zip: There is no item named u'Week_042_Rev1/476625_en.mp3' in the archive
2011-11-09 10:58:49: CDR481624: unable to find name node for u'purple coneflower' in CDR481624

Comment entered 2011-11-09 17:49:29 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-11-09 17:49:29
BZCOMMENTOR::Volker Englisch
BZCOMMENT::153

The audio files for weeks 41 and 42 have been loaded to Cancer.gov and will be available after Friday's publishing job.

Comment entered 2011-11-14 10:22:54 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-11-14 10:22:54
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::154

(In reply to comment #152)
> (In reply to comment #151)
> > Bob, could you please load the weeks 41 and 42 this morning?
>
> Done. Errors:
>
Errors for weeks 41 and 42 have been fixed.

Comment entered 2011-11-14 10:25:46 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-11-14 10:25:46
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::155

The audio pronunciations for week 43 have been completed. The following files are ready to be loaded onto Bach.

Week_043.zip
Week_043_Rev1.zip
Week_043_Rev2.zip

Comment entered 2011-11-23 15:40:25 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-11-23 15:40:25
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::156

The audio pronunciations for weeks 44 and 45 have been completed. The following files are ready to be loaded onto Bach.

Week_044.zip
Week_044_Rev1.zip
Week_044_Rev2.zip

Week_045.zip
Week_045_Rev1.zip
Week_045_Rev2.zip

Comment entered 2011-11-28 09:45:03 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-11-28 09:45:03
BZCOMMENTOR::Volker Englisch
BZCOMMENT::157

We have three weeks worth of audio files to load (about 700 documents).
Bob, if you're able to load the weeks 43-45 this morning please go ahead with the load. Otherwise we should rather wait until tomorrow.

Comment entered 2011-11-28 10:54:03 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-11-28 10:54:03
BZCOMMENTOR::Bob Kline
BZCOMMENT::158

(In reply to comment #157)
> We have three weeks worth of audio files to load (about 700 documents).
> Bob, if you're able to load the weeks 43-45 this morning please go ahead with
> the load. Otherwise we should rather wait until tomorrow.

Done. Error log attached.

Comment entered 2011-11-28 10:54:03 by Kline, Bob (NIH/NCI) [C]

Attachment 4926.errors.txt has been added with description: Errors from last load

Comment entered 2011-11-28 14:55:18 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-11-28 14:55:18
BZCOMMENTOR::Volker Englisch
BZCOMMENT::159

The audio files for weeks 43, 44 and 45 have been loaded to Cancer.gov and will be
available after Friday's publishing job.

Comment entered 2011-12-01 11:22:59 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-12-01 11:22:59
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::160

(In reply to comment #158)
> Created attachment 2192 [details]
> Errors from last load

>
> Done. Error log attached.

The errors have been fixed.

Comment entered 2011-12-06 09:38:26 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-12-06 09:38:26
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::161

The audio pronunciations for week 47 have been reviewed and approved. The following files are ready to be loaded onto Bach.

Week_047.zip
Week_047_Rev1.zip

Comment entered 2011-12-19 11:00:07 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-12-19 11:00:07
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::162

The following files for week 48 are ready to be uploaded onto Bach.
Week_048.zip
Week_048_Rev1.zip
Week_048_Rev2.zip

Comment entered 2011-12-19 11:12:46 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-12-19 11:12:46
BZCOMMENTOR::Volker Englisch
BZCOMMENT::163

FYI:
I see weeks 47 and 48 to be ready but I don't see a note about week 46 in the comments.
What's the status on 46?

Comment entered 2011-12-19 12:31:01 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-12-19 12:31:01
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::164

(In reply to comment #163)
> FYI:
> I see weeks 47 and 48 to be ready but I don't see a note about week 46 in the
> comments.
> What's the status on 46?

Thanks for catching this, Volker. We somehow missed week 46. The files for Week 46 are ready to go:

Week_046.zip
Week_046_Rev1.zip
Week_046_Rev2.zip

Comment entered 2011-12-19 13:19:32 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-12-19 13:19:32
BZCOMMENTOR::Volker Englisch
BZCOMMENT::165

Bob, maybe we can run a load of the weeks 46-48 tomorrow morning?

Comment entered 2011-12-20 11:06:52 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2011-12-20 11:06:52
BZCOMMENTOR::Bob Kline
BZCOMMENT::166

(In reply to comment #165)
> Bob, maybe we can run a load of the weeks 46-48 tomorrow morning?

Done. Errors from run:

!7400 Mon Nov 28 10:39:18 2011: CDR535468 u'Lipitor' (English) row 237 in Week_044.zip: There is no item named u'Week_044/535468_en.mp3' in the archive
!7400 Mon Nov 28 10:39:18 2011: CDR535468 u'Lipitor' (Spanish) row 238 in Week_044.zip: There is no item named u'Week_044/535468_es.mp3' in the archive
!7400 Mon Nov 28 10:39:18 2011: Week_044_Rev2.zip row 7: int() argument must be a string or a number, not 'NoneType'
!7400 Mon Nov 28 10:39:18 2011: Week_044_Rev2.zip row 8: int() argument must be a string or a number, not 'NoneType'
!7400 Mon Nov 28 10:39:18 2011: Week_044_Rev2.zip row 9: int() argument must be a string or a number, not 'NoneType'
!7400 Mon Nov 28 10:39:18 2011: Week_044_Rev2.zip row 10: int() argument must be a string or a number, not 'NoneType'
!7400 Mon Nov 28 10:39:19 2011: CDR543497 u'gland of Lieberkuhn' (English) row 123 in Week_045.zip: There is no item named u'Week_045/543497_en.mp3' in the archive
!7400 Mon Nov 28 10:39:19 2011: CDR543497 u'gl\xe1ndula de Lieberkuhn' (Spanish) row 124 in Week_045.zip: There is no item named u'Week_045/543497_es.mp3' in the archive
!7400 Mon Nov 28 10:39:20 2011: CDR543930 u'c\xe1ncer periampollar' (Spanish) row 18 in Week_045_Rev1.zip: There is no item named u'Week_045_Rev1/543930_es.mp3' in the archive
2011-11-28 10:43:35: CDR507600: unable to find name node for u'formulaci\xf3n estabilizada por alb\xfamina de paclitaxel adherido a una nanopart\xedcula' in CDR507600
2011-11-28 10:43:36: CDR507601: unable to find name node for u'nanopart\xedcula del paclitaxel' in CDR507601
2011-11-28 10:44:08: CDR512908: unable to find name node for u'c\xe1ncer de labio y cavidad oral en estadio IVA' in CDR512908
2011-11-28 10:44:52: CDR522475: unable to find name node for u'IL-1-alpha' in CDR522475
2011-11-28 10:44:54: CDR522476: unable to find name node for u'interleucina-1-alfa' in CDR522476
2011-11-28 10:45:12: CDR524207: unable to find name node for u'efusi\xf3n pleural maligna' in CDR524207
2011-11-28 10:46:22: CDR539481: unable to find name node for u'esomeprazole de magnesio' in CDR539481
2011-11-28 10:47:05: CDR549435: unable to find name node for u'Zactima' in CDR549435
2011-11-28 10:47:05: Doc CDR0000549435 made invalid by change - will NOT store it
New pub version would become invalid
No publishable document found for link target: MediaID=CDR0000718204
No publishable document found for link target: MediaID=CDR0000718205
2011-11-28 10:47:06: Warning for CDR0000549435: No publishable document found for link target: MediaID=CDR0000718205
2011-11-28 10:47:06: Warning for CDR0000549435: No publishable document found for link target: MediaID=CDR0000718204
2011-11-28 10:47:06: Warning for CDR0000549435: Non-publishable version will be created.

!5844 Tue Dec 20 10:50:52 2011: CDR613205 u'leucovorina c\xe1lcica' (Spanish) row 289 in Week_048.zip: There is no item named u'Week_048/613205_es.mp3' in the archive
2011-12-20 10:58:50: CDR583711: unable to find name node for u'Ibiopsia por AAF' in CDR583711
2011-12-20 10:59:45: CDR597169: unable to find name node for u'radioinmunoter\xe1pico' in CDR597169
2011-12-20 11:00:30: CDR609744: unable to find name node for u'carcinona in situ no melanoma' in CDR609744

Comment entered 2011-12-20 16:28:55 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2011-12-20 16:28:55
BZCOMMENTOR::Volker Englisch
BZCOMMENT::167

The audio files for the weeks 46 - 48 have been loaded to Gatekeeper and will be available on Cancer.gov by Christmas.

Comment entered 2011-12-22 12:12:34 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2011-12-22 12:12:34
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::168

(In reply to comment #166)
> (In reply to comment #165)
> > Bob, maybe we can run a load of the weeks 46-48 tomorrow morning?
>
> Done. Errors from run:

The errors have been fixed.

Comment entered 2012-01-11 19:50:43 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-01-11 19:50:43
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::169

The following files for weeks 49-51 are ready to uploaded onto Bach.
Week_049.zip
Week_049_Rev1.zip
Week_049_Rev2.zip
Week_050.zip
Week_050_Rev1.zip
Week_050_Rev2.zip
Week_051.zip
Week_051_Rev1.zip

Comment entered 2012-01-12 09:44:29 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-01-12 09:44:29
BZCOMMENTOR::Volker Englisch
BZCOMMENT::170

Bob, please go ahead and load those three weeks available.

Comment entered 2012-01-12 10:49:12 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-01-12 10:49:12
BZCOMMENTOR::Bob Kline
BZCOMMENT::171

Done.

!7924 Thu Jan 12 10:35:01 2012: CDR630776 u'papillary-reticular dermal interface' (English) row 3 in Week_049_Rev1.zip: There is no item named u'Week_049_Rev1/630776_en.mp3' in the archive
2012-01-12 10:40:00: CDR632169: unable to find name node for u'ligando del oncog\xe9n kit' in CDR632169
2012-01-12 10:40:29: CDR634971: unable to find name node for u'sustancia en fase investigaci\xf3n' in CDR634971
2012-01-12 10:40:36: CDR635299: unable to find name node for u'P30 Cancer Center Support Grant' in CDR635299
2012-01-12 10:41:08: CDR635823: unable to find name node for u'gemcitabine/cisplatin' in CDR635823
2012-01-12 10:41:09: CDR635824: unable to find name node for u'gemcitabine/cisplatin regimen' in CDR635824

Comment entered 2012-01-12 15:20:29 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-01-12 15:20:29
BZCOMMENTOR::Volker Englisch
BZCOMMENT::172

The audio files for the weeks 49 - 51 have been loaded to Gatekeeper and will
be available on Cancer.gov after Friday's publishing job finished.

Comment entered 2012-01-19 09:23:03 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-01-19 09:23:03
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::173

weeks 49-51 errors have been fixed.

Comment entered 2012-01-20 16:46:56 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2012-01-20 16:46:56
BZCOMMENTOR::Robin Juthe
BZCOMMENT::174

Vanessa is nearing the end of the first major batch of 10,493 terms so we're ready to generate the next spreadsheet.

Bob, could you please generate a spreadsheet of all glossary term names docs with English pronunciation keys that were not on either the above "Docs with English pronunciations" (3/15/11) document or the above "Sample using published terms with at least one Spanish name" (12/14/10).

This should give us all of the terms that have had pronunciations added since the date of the master spreadsheet or that are entirely new with a pronunciation since that time.

The spreadsheet should have the following columns:

CDR ID
Term Name
Language
Pronunciation
Filename
Notes (Vanessa)

Please sort it by CDR ID. Thanks.

Comment entered 2012-01-23 09:11:55 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-01-23 09:11:55
BZCOMMENTOR::Bob Kline
BZCOMMENT::175

Please check this before passing it on to Vanessa. In particular, make sure that the format (content and order of columns) matches what Alan's software needs, and spot-check to make sure that the logic to exclude terms whose IDs appeared on the two earlier reports identified by Robin worked correctly.

Comment entered 2012-01-23 09:11:55 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20120123.xls has been added with description: Requested report

Comment entered 2012-01-25 14:40:37 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-01-25 14:40:37
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::176

The following files for weeks 52-54 are ready to be uploaded onto Bach.
Week_052.zip
Week_052_Rev1.zip
Week_052_Rev2.zip
Week_052_Rev3.zip
Week_053.zip
Week_053_Rev1.zip
Week_054.zip
Week_054_Rev1.zip

Comment entered 2012-01-27 12:12:38 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-01-27 12:12:38
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::177

(In reply to comment #175)
> Created attachment 2216 [details]
> Requested report
>
> Please check this before passing it on to Vanessa. In particular, make sure
> that the format (content and order of columns) matches what Alan's software
> needs, and spot-check to make sure that the logic to exclude terms whose IDs
> appeared on the two earlier reports identified by Robin worked correctly.

Amy reviewed the list and confirmed that they are the right terms. However, they are currently working on adding pronunciation keys to about 35 terms and they want these terms included in the list. Will you be able to generate a new report by mid next week so that all the new ones they are adding would be captured?

Comment entered 2012-01-30 11:49:28 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-01-30 11:49:28
BZCOMMENTOR::Bob Kline
BZCOMMENT::178

(In reply to comment #177)

> Amy reviewed the list and confirmed that they are the right terms. However,
> they are currently working on adding pronunciation keys to about 35 terms and
> they want these terms included in the list. Will you be able to generate a new
> report by mid next week so that all the new ones they are adding would be
> captured?

Yes, I can do that.

Comment entered 2012-01-31 18:38:31 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-01-31 18:38:31
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::179

(In reply to comment #178)
> (In reply to comment #177)
>
> > Amy reviewed the list and confirmed that they are the right terms. However,
> > they are currently working on adding pronunciation keys to about 35 terms and
> > they want these terms included in the list. Will you be able to generate a new
> > report by mid next week so that all the new ones they are adding would be
> > captured?
>
> Yes, I can do that.

This is done. The terms have been updated with pronunciation keys so please proceed to generate the report. Thanks!

Comment entered 2012-02-01 12:00:38 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-02-01 12:00:38
BZCOMMENTOR::Volker Englisch
BZCOMMENT::180

FYI:

We will hold off loading the remaining audio files listed in Comment 176 until the current problems with Gatekeeper/Percussion have been resolved.

Comment entered 2012-02-02 13:38:01 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-02-02 13:38:01
BZCOMMENTOR::Bob Kline
BZCOMMENT::181

(In reply to comment #179)

> This is done. The terms have been updated with pronunciation keys so please
> proceed to generate the report. Thanks!

Needed by Tuesday.

Comment entered 2012-02-06 20:19:42 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-02-06 20:19:42
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::182

The following files for week 55 are ready to be uploaded to:
Week_055.zip
Week_055_Rev3.zip
Week_55_Rev1.zip
Week_55_Rev2.zip

So far files for the following weeks have not been uploaded:

[ ]Week 52
[ ]Week 53
[ ]Week 54
[ ]Week 55

Comment entered 2012-02-07 09:48:24 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-02-07 09:48:24
BZCOMMENTOR::Bob Kline
BZCOMMENT::183

(In reply to comment #179)

> This is done. The terms have been updated with pronunciation keys so please
> proceed to generate the report. Thanks!

Please check carefully, both to confirm that the new terms you've just added pronunciations for are included, and to verify that I haven't included terms that were in the original lists.

Comment entered 2012-02-07 09:48:24 by Kline, Bob (NIH/NCI) [C]

Attachment A.xls has been added with description: Fresh copy of the new list of glossary names

Comment entered 2012-02-07 14:14:04 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2012-02-07 14:14:04
BZCOMMENTOR::Robin Juthe
BZCOMMENT::184

(In reply to comment #183)
> Created attachment 2220 [details]
> Fresh copy of the new list of glossary names

We reviewed this in today's meeting and we were wondering if you could remove from this list any documents that were made newly publishable after last Friday, 2/3.

The reason for doing this is to avoid sending Vanessa new English terms that have not been translated yet. (There are, however, other terms on the list that do not have a Spanish translation because an equivalent term does not exist in Spanish.)

Thanks.

Comment entered 2012-02-07 16:59:07 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-02-07 16:59:07
BZCOMMENTOR::Bob Kline
BZCOMMENT::185

(In reply to comment #184)

> We reviewed this in today's meeting and we were wondering if you could remove
> from this list any documents that were made newly publishable after last
> Friday, 2/3.

How's this? Didn't eliminate many of the rows, but some got knocked off.

Comment entered 2012-02-07 16:59:07 by Kline, Bob (NIH/NCI) [C]

Attachment A.xls has been added with description: Another attempt

Comment entered 2012-02-07 17:02:24 by Juthe, Robin (NIH/NCI) [E]

BZDATETIME::2012-02-07 17:02:24
BZCOMMENTOR::Robin Juthe
BZCOMMENT::186

(In reply to comment #185)
> Created attachment 2223 [details]
> Another attempt
> (In reply to comment #184)
> > We reviewed this in today's meeting and we were wondering if you could remove
> > from this list any documents that were made newly publishable after last
> > Friday, 2/3.
> How's this? Didn't eliminate many of the rows, but some got knocked off.

I think this is good, but I'll let Amy review it to be sure she sees what she expects on the report. If she's satisfied with this version, she plans to send it to Vanessa.

Comment entered 2012-02-14 19:46:41 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-02-14 19:46:41
BZCOMMENTOR::Volker Englisch
BZCOMMENT::187

Bob, Blair gave us the green light to publish the media documents.

Could you please load the files that William had indicated to be ready?
[ ]Week 52
[ ]Week 53
[ ]Week 54
[ ]Week 55

Tonight's publishing job finished, so anytime would work for me.

Comment entered 2012-02-15 09:48:57 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-02-15 09:48:57
BZCOMMENTOR::Bob Kline
BZCOMMENT::188

(In reply to comment #187)

Weeks 52-55 have been loaded. We agreed that the zip files would be named with 3-digit week numbers. Please don't come up with creative variations on that pattern. I had to do some extra manual work to make sure Week_55_Rev1.zip and week_55_Rev2.zip got processed in the right order (lucky that I noticed them at all).

Errors for the run:

!5812 Wed Feb 15 08:54:40 2012: CDR650254 u'chemabrasion' (English) row 13 in Week_052_Rev1.zip: There is no item named u'Week_052_Rev1/650254_en.mp3' in the archive
!5812 Wed Feb 15 08:54:40 2012: CDR657511 u'FOLOTYN' (English) row 11 in Week_053.zip: There is no item named u'Week_053/657511_en.mp3' in the archive
!5812 Wed Feb 15 08:54:40 2012: CDR657511 u'FOLOTYN' (Spanish) row 12 in Week_053.zip: There is no item named u'Week_053/657511_es.mp3' in the archive
!5812 Wed Feb 15 08:54:40 2012: CDR657854 u'neoplasia de c\xe9lulas beta' (Spanish) row 46 in Week_053.zip: There is no item named u'Week_053/657854_es.mp3' in the archive
!5812 Wed Feb 15 08:54:40 2012: CDR658010 u'tiroiditis de Hashimoto' (Spanish) row 66 in Week_053.zip: There is no item named u'Week_053/658010_es.mp3' in the archive
!5812 Wed Feb 15 08:54:42 2012: CDR662027 u'Stavzor' (English) row 171 in Week_054.zip: There is no item named u'Week_054/662027_en.mp3' in the archive
!5812 Wed Feb 15 08:54:42 2012: CDR662027 u'Stavzor' (Spanish) row 172 in Week_054.zip: There is no item named u'Week_054/662027_es.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666205 u'Latisse' (English) row 1 in Week_55_Rev1.zip: There is no item named u'Week_055_Rev1/666205_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666670 u'Blackfan\u2013Diamond anemia' (English) row 2 in Week_55_Rev1.zip: There is no item named u'Week_055_Rev1/666670_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666675 u'MET/VEGFR-2 inhibitor GSK1363089' (English) row 3 in Week_55_Rev1.zip: There is no item named u'Week_055_Rev1/666675_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666707 u'anti-HGF monoclonal antibody AMG 102' (English) row 4 in Week_55_Rev1.zip: There is no item named u'Week_055_Rev1/666707_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666709 u'aceite etiodizado' (Spanish) row 5 in Week_55_Rev1.zip: There is no item named u'Week_055_Rev1/666709_es.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666670 u'Blackfan\u2013Diamond anemia' (English) row 1 in Week_55_Rev2.zip: There is no item named u'Week_055_Rev2/666670_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666675 u'MET/VEGFR-2 inhibitor GSK1363089' (English) row 2 in Week_55_Rev2.zip: There is no item named u'Week_055_Rev2/666675_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666707 u'anti-HGF monoclonal antibody AMG 102' (English) row 3 in Week_55_Rev2.zip: There is no item named u'Week_055_Rev2/666707_en.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666709 u'aceite etiodizado' (Spanish) row 4 in Week_55_Rev2.zip: There is no item named u'Week_055_Rev2/666709_es.mp3' in the archive
!5812 Wed Feb 15 08:54:43 2012: CDR666205 u'Latisse' (English) row 5 in Week_55_Rev2.zip: There is no item named u'Week_055_Rev2/666205_en.mp3' in the archive
2012-02-15 08:59:05: CDR643103: unable to find name node for u'radioterapia de fracci\xf3n acelerada' in CDR643103
2012-02-15 08:59:43: CDR644750: unable to find name node for u'neutropenia de Kostmann' in CDR644750
2012-02-15 08:59:47: CDR644761: unable to find name node for u'biorretrolimentaci\xf3n EEG' in CDR644761
2012-02-15 09:00:20: CDR649003: unable to find name node for u'micele polim\xe9rico cargado con paclitaxel' in CDR649003
2012-02-15 09:01:05: CDR651195: unable to find name node for u'terapia farmacol\xf3gica' in CDR651195
2012-02-15 09:02:35: CDR658366: unable to find name node for u'vacuna para el papilomavirus humano16/18 L1 part\xedcula semejante al virus/AS04' in CDR658366
2012-02-15 09:03:30: CDR661955: unable to find name node for u'vacuna genotecnol\xf3gica tetravalente del papilomavirus humano (tipos 6, 11, 16 y 18)' in CDR661955
2012-02-15 09:03:31: CDR661957: unable to find name node for u'vacuna contra el tumor de mama que segrega GM-CSF' in CDR661957
2012-02-15 09:03:39: CDR662024: unable to find name node for u'Sindrome Rothmund-Thomposon' in CDR662024
2012-02-15 09:04:45: CDR666689: unable to find name node for u'prueba del ADN del papilomavirus humano' in CDR666689
2012-02-15 09:04:59: CDR669655: unable to find name node for u'aspiraci\xf3n y biopsia de la m\xe9dula \xf3sea' in CDR669655

Comment entered 2012-02-15 09:54:53 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-02-15 09:54:53
BZCOMMENTOR::Volker Englisch
BZCOMMENT::189

(In reply to comment #188)
> Weeks 52-55 have been loaded. We agreed that the zip files would be named with
> 3-digit week numbers. Please don't come up with creative variations on that
> pattern.

Is it OK to rename the files to match our pattern or would that throw off any additional processing at this point?
I agree those file names should have followed the specified pattern.

Comment entered 2012-02-15 10:09:13 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-02-15 10:09:13
BZCOMMENTOR::Bob Kline
BZCOMMENT::190

(In reply to comment #189)

> Is it OK to rename the files to match our pattern or would that throw off any
> additional processing at this point?

It wouldn't affect any of my software, but we'll need to check with Alan when he gets back to find out what it would do to his web interface (and the tables behind it).

> I agree those file names should have followed the specified pattern.

In fact, looking over the error logs posted above, it seems that the mistake went deeper, and was the cause of the software failing to find lots of the sound files, because the directories inside the zip files didn't match the paths specified in the spreadsheets.

Comment entered 2012-02-28 11:22:07 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-02-28 11:22:07
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::191

The following files for weeks 56 and 58 are ready for uploading onto Bach. There was no activity for week 57.

Week_056.zip
Week_056_Rev1.zip
Week_056_Rev2.zip
Week_056_Rev3.zip
Week_058.zip
Week_058_Rev1.zip

Comment entered 2012-03-01 19:14:48 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-03-01 19:14:48
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::192

Files for weeks 59 and 60 have been reviewed and are ready to go:
Week_059.zip
Week_059_Rev1.zip
Week_060.zip
Week_060_Rev1.zip

[X] Week 56
[ ] Week 57 (No activity)
[X] Week 58
[X] Week 59
[X] Week 60

Comment entered 2012-03-02 09:42:25 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-03-02 09:42:25
BZCOMMENTOR::Volker Englisch
BZCOMMENT::193

Bob, could we load these audio files (Weeks 56-60) this morning?

Comment entered 2012-03-02 11:34:29 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-03-02 11:34:29
BZCOMMENTOR::Bob Kline
BZCOMMENT::194

(In reply to comment #193)
> Bob, could we load these audio files (Weeks 56-60) this morning?

Done.

!7012 Fri Mar 02 10:01:02 2012: CDR686898 u'zibotent\xe1n' (Spanish) row 172 in Week_056.zip: There is no item named u'Week_056/686898_es.mp3' in the archive
!7012 Fri Mar 02 10:01:04 2012: CDR44576 u'kilogram' (English) row 123 in Week_059.zip: There is no item named u'Week_059/44576_en.mp3' in the archive
2012-03-02 10:08:06: CDR45084: unable to find name node for u'hidroxicloroquina' in CDR45084
2012-03-02 10:08:33: CDR674707: unable to find name node for u'Nervio fr\xe9nico' in CDR674707
2012-03-02 10:08:35: CDR674712: unable to find name node for u'anastomosis colorrectar directa' in CDR674712
2012-03-02 10:08:41: CDR680687: unable to find name node for u'fl\xfaor F-18 fluorometilcolina' in CDR680687
2012-03-02 10:08:43: CDR680693: unable to find name node for u'C-11 colina' in CDR680693
2012-03-02 10:08:44: CDR680699: unable to find name node for u'marcadores fiduciarios de semillas de oro' in CDR680699
2012-03-02 10:08:45: CDR680700: unable to find name node for u'marcadores fiduciarios de oro' in CDR680700
2012-03-02 10:08:45: CDR680704: unable to find name node for u'Tumor fibrohistioc\xedtico plexiforme' in CDR680704
2012-03-02 10:08:58: CDR683250: unable to find name node for u'brainstem auditory evoked response test' in CDR683250
2012-03-02 10:08:59: CDR683252: unable to find name node for u'auditory brainstem response test' in CDR683252
2012-03-02 10:08:59: CDR683252: unable to find name node for u'auditory brainstem response test' in CDR683252
2012-03-02 10:10:08: CDR692083: unable to find name node for u'chlorambucil/prednisone' in CDR692083
2012-03-02 10:10:08: CDR692084: unable to find name node for u'chlorambucil/prednisone regimen' in CDR692084

Comment entered 2012-03-07 18:35:03 by alan

BZDATETIME::2012-03-07 18:35:03
BZCOMMENTOR::Alan Meyer
BZCOMMENT::195

(In reply to comment #190)
> (In reply to comment #189)
>
> > Is it OK to rename the files to match our pattern or would
> > that throw off any additional processing at this point?
>
> It wouldn't affect any of my software, but we'll need to check
> with Alan when he gets back to find out what it would do to his
> web interface (and the tables behind it).

I think that renaming the files would break parts of my code. If
the user has already started reviewing a zipfile, I record the
name of the zipfile in the database. I then use that name in
subsequent processing. In particular, when the user resumes
review of the zipfile I search for the zipfile using the name
that I previously recorded.

If the name changes after some audio files have been reviewed,
the renamed file will be treated as new and the individual audio
files would have to be approved again. I don't know what the
effects of doing that would be, but I should think that, at the
least, it would involve some extra work by reviewers, and some
inaccuracies in reports based on the stored review data.

Also, an attempt to resume reviewing of the old filename would
produce a file not found error.

Everything should be fine if renaming is done before audio review
has started.

If we need to rename during the process, I think it would be
necessary either to modify the program to accommodate that, or
else hand modify the database (in just one row of one table) to
incorporate the corrected name immediately after making the
correction and before a user logs on to review the corrected
files. Hand modification should be easiest if we only have to do
this once or a few times.

Comment entered 2012-03-08 10:01:35 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-03-08 10:01:35
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::196

(In reply to comment #188)
> (In reply to comment #187)
>
> Errors for the run:
>

The errors have been fixed.

(In reply to comment #194)
> (In reply to comment #193)
> > Bob, could we load these audio files (Weeks 56-60) this morning?
>
> Done.
>

The errors have been fixed.

Comment entered 2012-04-03 09:08:42 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-03 09:08:42
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::197

The following files for weeks 61-64 have been reviewed and are ready to be uploaded to Bach.

Week_061.zip
Week_061_Rev1.zip
Week_061_Rev2.zip
Week_062.zip
Week_062_Rev1.zip
Week_062_Rev2.zip
Week_063.zip
Week_063_Rev1.zip
Week_064.zip
Week_064_Rev1.zip

Comment entered 2012-04-03 10:15:40 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-04-03 10:15:40
BZCOMMENTOR::Volker Englisch
BZCOMMENT::198

Due to the upgrade to WCMS 2.4.1 on Friday we'll have to wait until next week before we can load these 1083 new audio files.

Comment entered 2012-04-10 10:44:15 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-04-10 10:44:15
BZCOMMENTOR::Volker Englisch
BZCOMMENT::199

Bob, whenever you feel better this week (Tue, Wed, or Thu) could you please load these audio files (Weeks 61-64) sometime in the morning?

Comment entered 2012-04-11 10:30:54 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-04-11 10:30:54
BZCOMMENTOR::Bob Kline
BZCOMMENT::200

(In reply to comment #199)
> Bob, whenever you feel better this week (Tue, Wed, or Thu) could you please
> load these audio files (Weeks 61-64) sometime in the morning?

Done.

!7048 Wed Apr 11 10:08:27 2012: CDR46040 u'O(6)-benzylguanine' (Spanish) row 11 in Week_062_Rev1.zip: There is no item named u'Week_062_Rev1/46040_es.mp3' in the archive
!7048 Wed Apr 11 10:08:29 2012: CDR367456 u'cicuta verde' (Spanish) row 122 in Week_064.zip: There is no item named u'Week_064/367456_es.mp3' in the archive
2012-04-11 10:13:37: CDR45217: unable to find name node for u'hydromorphone' in CDR45217
2012-04-11 10:15:23: CDR45870: unable to find name node for u'retinoideo' in CDR45870
2012-04-11 10:15:47: CDR46040: unable to find name node for u'O(6)-benzylguanine' in CDR46040
2012-04-11 10:17:58: CDR315757: unable to find name node for u'cilengitide' in CDR315757
2012-04-11 10:18:52: CDR340935: unable to find name node for u'ravuconazole' in CDR340935

Comment entered 2012-04-11 10:34:28 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-11 10:34:28
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::201

Files for week 65 have been completed as of this morning. Would you want to load them now?

Week_065.zip Completed 2012-03-30 15:04:43
Week_065_Rev1.zip Completed 2012-04-05 10:58:24

Comment entered 2012-04-11 10:37:53 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-04-11 10:37:53
BZCOMMENTOR::Volker Englisch
BZCOMMENT::202

It's too late now. I just started the publishing job 2 minutes ago. Besides, since Week 66 is the last of the batch updates we should probably just run Week 65/66 together as the last batch load once they are ready.

Comment entered 2012-04-16 11:48:17 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-16 11:48:17
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::203

(In reply to comment #200)
> (In reply to comment #199)
> > Bob, whenever you feel better this week (Tue, Wed, or Thu) could you please
> > load these audio files (Weeks 61-64) sometime in the morning?
>
> Done.
>
The errors have been fixed.

  • A few new English terms have been added in the past week and I am waiting for the Spanish team to complete their portion before putting in a request for the new spreadsheet to be sent to Vanessa.

Comment entered 2012-04-17 15:00:36 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-17 15:00:36
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::204

Files for week 66 have been reviewed and are available for uploading to Bach.
Week_066.zip
Week_066_Rev1.zip
Week_066_Rev2.zip

I also provided completed files for week 65 in Comment # 201.

Comment entered 2012-04-23 11:45:26 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-23 11:45:26
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::205

We need to generate a new spreadsheet for Vanessa. The new spreadsheet should capture all new terms that have been added since the last report was generated for her on 2/7/2012 (I believe). It looks like the last one was generated on 2/7/2012 however only terms that were ready as of 2/3/2012 were included so we will need the new report to start from 2/4/2012 in order not to capture terms that have already been recorded. It should also capture terms that have had new pronunciation keys added since the last spreadsheet was generated.

In addition to the above, we need to also include (in the same spreadsheet), terms from the attached spreadsheet, which are acronyms that have been selected to also have audio pronunciations. The English terms we want to include are highlighted in yellow; the Spanish terms are in green. As you will see in the spreadsheet, in many cases, we will have an audio recording for an English term, but not for its translated Spanish term. There are also some Spanish terms where we will have an audio recording, but not for its translated English term. So, the spreadsheet is not going to show a one-to-one match. If possible only display the acronym that needs to be recorded.

Comment entered 2012-04-23 11:45:26 by Osei-Poku, William (NIH/NCI) [C]

Attachment Gloss Terms without Pronuns - Acronyms and others_FINAL_4-23-12.xls has been added with description: Acronyms list

Comment entered 2012-04-23 16:02:41 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-04-23 16:02:41
BZCOMMENTOR::Bob Kline
BZCOMMENT::206

(In reply to comment #205)
> The English terms we want to include are highlighted in yellow; the
> Spanish terms are in green. As you will see in the spreadsheet, in
> many cases, we will have an audio recording for an English term, but
> not for its translated Spanish term. There are also some Spanish
> terms where we will have an audio recording, but not for its
> translated English term.

You need to use some other mechanism besides formatting of the spreadsheet cells to convey which terms you want pronunciations for. One way to do it would be to just provide the term name strings for the names Vanessa should record. And the names have to match what's in the glossary document.

Comment entered 2012-04-24 16:40:12 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-24 16:40:12
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::207

(In reply to comment #206)

> You need to use some other mechanism besides formatting of the spreadsheet
> cells to convey which terms you want pronunciations for. One way to do it
> would be to just provide the term name strings for the names Vanessa should
> record. And the names have to match what's in the glossary document.

How about generating the new report and including the list of attached acronyms at the end of the new report and then we will format it for Vanessa so she will know which acronyms she needs to provide recordings for?

Comment entered 2012-04-24 16:43:00 by Beckwith, Margaret (NIH/NCI) [E]

BZDATETIME::2012-04-24 16:43:00
BZCOMMENTOR::Margaret Beckwith
BZCOMMENT::208

William, can't you just go through and delete all of the acronym terms that don't require a pronunciation so Bob is working with a clean list? Am I missing something?

Comment entered 2012-04-24 17:39:56 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-24 17:39:56
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::209

(In reply to comment #208)
> William, can't you just go through and delete all of the acronym terms that
> don't require a pronunciation so Bob is working with a clean list? Am I
> missing something?

Margaret,
The list contains only terms that at least one language needs to be recorded. So it has been cleaned to the point where only terms that need to be recorded are included and further cleaning will not reduce the number of terms on the spreadsheet. That is, if we go by CDR ID. However, unlike the usual reports we have been sending Vanessa, there are some terms/acronyms that only the English or the Spanish term needs to be recorded. We are communicating this information by color-coding the spreadsheet. But it seems Bob may not be able to work with the colors so what I have proposed is that he would generate the report by ignoring the distinction/colors and then we can color-code it again so that Vanessa would know which Language to record in cases where she does not need to record for both languages.

On the other hand in such cases where she does not need to record for both languages, we can delete the acronyms that need not be recorded. The spreadsheet will have some blank cells and perhaps look a little confusing at first but Amy can explain this to Vanessa. Also, we can delete the acronyms either before or after Bob generates the report. Please let me know if we should update the spreadsheet now or wait until Bob has completed it. We prefer to update it after Bob has completed it because Amy and Linda will review it before sending it out to Vanessa and they can update it then.

Comment entered 2012-04-24 18:03:33 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-24 18:03:33
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::210

Since the term name fields have notes (for Vanessa) in parenthesis, it could pose a problem so we will proceed to remove the notes from the term name field so that the term name field will have only terms as they appear in the CDR records as Bob suggested.

Comment entered 2012-04-24 18:30:34 by alan

BZDATETIME::2012-04-24 18:30:34
BZCOMMENTOR::Alan Meyer
BZCOMMENT::211

(In reply to comment #209)
> (In reply to comment #208)
> > William, can't you just go through and delete all of the
> > acronym terms that don't require a pronunciation so Bob is
> > working with a clean list? Am I missing something?
...
> ... where she does not need to record for both languages, we
> can delete the acronyms that need not be recorded. The
> spreadsheet will have some blank cells and perhaps look a
> little confusing at first but Amy can explain this to Vanessa.
> Also, we can delete the acronyms either before or after Bob
> generates the report. Please let me know if we should update
> the spreadsheet now or wait until Bob has completed it. We
> prefer to update it after Bob has completed it because Amy and
> Linda will review it before sending it out to Vanessa and they
> can update it then.

When the spreadsheet is returned by Vanessa, it has to be
readable by the Glossary Term Audio Review program, which is not
only unable to read formatting in the cells, but is also likely
to be confused by missing information in columns.

Furthermore, for every row in the sheet we'll either have to have
the reviewer arbitrarily assign an "Approved" review status to
that row, or we have to have the entire row gone from the
spreadsheet before the audio review program sees it. If we don't
do one of those two, and if we have a row in the spreadsheet for
something that won't be reviewed. the review program will
consider that there are unreviewed rows in the spreadsheet and
the spreadsheet has not been completed.

I therefore recommend that we adopt Margaret's solution:

If a row is generated by Bob's program and a human at CIAT
decides that no reading of the term in that row is required,
then someone at CIAT should delete the entire row.

In other words, if there is a term in the spreadsheet that should
not be read by Vanessa:

Don't just delete the contents of a cell.

Don't mark the cell in any way.

Don't add a special note in the reviewer notes column.

Just DELETE THE ENTIRE ROW before it is sent to Vanessa.

Is that reasonable and feasible?

Comment entered 2012-04-24 18:53:44 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-24 18:53:44
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::212

I am attaching a new acronyms list with the the comments in parenthesis removed.

Comment entered 2012-04-24 18:53:44 by Osei-Poku, William (NIH/NCI) [C]

Attachment Gloss Terms without Pronuns - Acronyms and others_FINAL1_4-24-12.xls has been added with description: Acronyms list

Comment entered 2012-04-24 19:08:24 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-24 19:08:24
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::213

(In reply to comment #211)
> Is that reasonable and feasible?

Yes it does. Thank you! I had even forgotten that the spreadsheet Bob would generate will have each term on its own row which makes deleting rows after the spreadsheet is generated seems a better approach.

>
> I therefore recommend that we adopt Margaret's solution:
>

Unless I am misinterpreting Margaret's comments, she was referring to cleaning up the report prior to Bob generating the spreadsheet.

Comment entered 2012-04-27 11:26:04 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-04-27 11:26:04
BZCOMMENTOR::Bob Kline
BZCOMMENT::214

(In reply to comment #205)

> We need to generate a new spreadsheet for Vanessa.

Attached.

> .... It should also capture terms that have had new pronunciation keys
> added since the last spreadsheet was generated.

Just to be clear, I'm interpreting that sentence literally (as meaning something different from "... have had new or changed pronunciation keys ...") even though it would be reasonable for someone to say of a glossary term name, "this one had an different pronunciation key at one time, and we've replaced that with a new pronunciation key." If a glossary term name document has already appeared in one of the spreadsheets we gave Vanessa earlier, you won't find it on the attached sheet, even if the pronunciation key changed (and even if it showed up on the acronym sheet – not likely to be an issue, since most or all of the acronyms have no pronunciation key at all).

Acronyms are at the bottom of the sheet.

Comment entered 2012-04-27 11:26:04 by Kline, Bob (NIH/NCI) [C]

Attachment request-4926-round-3.xls has been added with description: Sheet for round 3

Comment entered 2012-04-30 13:45:42 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-30 13:45:42
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::215

(In reply to comment #214)
> Created attachment 2246 [details]
> Sheet for round 3
>
> (In reply to comment #205)
>
>
> > We need to generate a new spreadsheet for Vanessa.
>
> Attached.
>

In addition to the acronyms and new terms correctly selected, the attached spreadsheet contains approximately 100 terms that already have audios. The terms actually appear to match ones contained in the file for Week 9. Here are sample terms that already have audio but included in the current spreadsheet:

44041 immunosuppressive - date audio document created 4-13-11; published 7-16-11
44300 subependymal -date audio document created 1-5-11; published 7-16-11
45676 edema - date audio document created 1-5-11; published 7-16-11
562725 Cardura - date audio document created 1-5-11; published 7-16-11
589421 Tazorac - date audio document created 3-24-11; published 7-16-11

Comment entered 2012-04-30 14:20:03 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-04-30 14:20:03
BZCOMMENTOR::Bob Kline
BZCOMMENT::216

(In reply to comment #215)

> In addition to the acronyms and new terms correctly selected, the attached
> spreadsheet contains approximately 100 terms that already have audios.

Good catch! I had checked the sheets we gave her for the two previous rounds, but I had forgotten about the test round, which got loaded into production after approval. Please check the attached set and see if it's right. Sorry about that!

Comment entered 2012-04-30 14:20:03 by Kline, Bob (NIH/NCI) [C]

Attachment request-4926-round-3.xls has been added with description: Sheet for round 3 (fixed)

Comment entered 2012-04-30 14:29:34 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-04-30 14:29:34
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::217

(In reply to comment #214)

>
> > .... It should also capture terms that have had new pronunciation keys
> > added since the last spreadsheet was generated.
>
> Just to be clear, I'm interpreting that sentence literally (as meaning
> something different from "... have had new or changed pronunciation keys
> ...") even though it would be reasonable for someone to say of a glossary term
> name, "this one had an different pronunciation key at one time, and we've
> replaced that with a new pronunciation key." If a glossary term name document
> has already appeared in one of the spreadsheets we gave Vanessa earlier, you
> won't find it on the attached sheet, even if the pronunciation key changed (and
> even if it showed up on the acronym sheet – not likely to be an issue, since
> most or all of the acronyms have no pronunciation key at all).
>

That is correct. When updating a term that has already had an audio recorded (and published) in the past, we are handling the updated audio outside of this process. We email the term to Vanessa to re-record. She then e-mails us the audio recording (mp3 file) of the term, and we manually update the audio record in the CDR. So what we need for the spreadsheet are:
1. Terms that are new and have pronunciation keys.
2. Older terms that has had new pronunciation keys added.

Comment entered 2012-05-01 09:45:48 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-05-01 09:45:48
BZCOMMENTOR::Volker Englisch
BZCOMMENT::218

Bob, would you mind loading those two weeks of audio files this morning that we still have in the queue? It's weeks 65 and 66.

Comment entered 2012-05-01 10:54:00 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-05-01 10:54:00
BZCOMMENTOR::Bob Kline
BZCOMMENT::219

(In reply to comment #218)
> Bob, would you mind loading those two weeks of audio files this morning that we
> still have in the queue? It's weeks 65 and 66.

Done.

!5472 Tue May 01 09:54:03 2012: CDR689977 u'pepsin' (English) row 49 in Week_066.zip: There is no item named u'Week_066/689977_en.mp3' in the archive
!5472 Tue May 01 09:54:03 2012: CDR689977 u'pepsina' (Spanish) row 50 in Week_066.zip: There is no item named u'Week_066/689977_es.mp3' in the archive
2012-05-01 09:57:59: CDR549434: unable to find name node for u'Uftoral' in CDR549434
2012-05-01 09:58:00: Doc CDR0000549434 made invalid by change - will NOT store it
New pub version would become invalid
No publishable document found for link target: MediaID=CDR0000732901
No publishable document found for link target: MediaID=CDR0000732900
2012-05-01 09:58:00: Warning for CDR0000549434: No publishable document found for link target: MediaID=CDR0000732901
2012-05-01 09:58:00: Warning for CDR0000549434: No publishable document found for link target: MediaID=CDR0000732900
2012-05-01 09:58:00: Warning for CDR0000549434: Non-publishable version will be created.
2012-05-01 09:59:02: CDR689749: unable to find name node for u'ensayos de vigilacia posteriores a la comercializaci\xf3n' in CDR689749
2012-05-01 09:59:32: CDR700148: unable to find name node for u'cervicectom\xedauterina radical' in CDR700148

Comment entered 2012-05-01 15:04:48 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-05-01 15:04:48
BZCOMMENTOR::Volker Englisch
BZCOMMENT::220

The latest batch of 511 audio files has been loaded to Cancer.gov. These audio files will be available after the next weekly publishing job.

Comment entered 2012-05-02 13:42:27 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-05-02 13:42:27
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::221

(In reply to comment #216)
>
> Good catch! I had checked the sheets we gave her for the two previous rounds,
> but I had forgotten about the test round, which got loaded into production
> after approval. Please check the attached set and see if it's right. Sorry
> about that!

No problem. This one looks good. Thanks! It's been sent to Vanessa for recording the audios.

Comment entered 2012-05-02 13:43:29 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-05-02 13:43:29
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::222

(In reply to comment #219)
> (In reply to comment #218)
> > Bob, would you mind loading those two weeks of audio files this morning that we
> > still have in the queue? It's weeks 65 and 66.
>
> Done.
>
The errors have been fixed.

Comment entered 2012-05-30 09:27:21 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-05-30 09:27:21
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::223

The following files are ready to upload to Bach.

Week_070.zip
Week_072.zip
Week_072_Rev1.zip
Week_072_Rev2.zip

The above files are for the most recent spreadsheet that Bob generated which included the acronyms. There are currently no pending terms/spreadsheets either in the CDR or sent to Vanessa for recording.

Comment entered 2012-05-30 10:45:12 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-05-30 10:45:12
BZCOMMENTOR::Volker Englisch
BZCOMMENT::224

(In reply to comment #223)
> Week_070.zip
> Week_072.zip
> Week_072_Rev1.zip
> Week_072_Rev2.zip

Is there no Week 71?

Comment entered 2012-05-30 11:33:30 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-05-30 11:33:30
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::225

(In reply to comment #224)
> (In reply to comment #223)
> > Week_070.zip
> > Week_072.zip
> > Week_072_Rev1.zip
> > Week_072_Rev2.zip
>
> Is there no Week 71?

No. There was no Week 71.

Comment entered 2012-05-31 10:43:46 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-05-31 10:43:46
BZCOMMENTOR::Volker Englisch
BZCOMMENT::226

Bob, could you load those audio files this morning?

Comment entered 2012-05-31 12:14:11 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-05-31 12:14:11
BZCOMMENTOR::Bob Kline
BZCOMMENT::227

(In reply to comment #226)
> Bob, could you load those audio files this morning?

Done.

!5044 Thu May 31 11:52:54 2012: CDR725034 u'vismodegib' (English) row 5 in Week_072_Rev1.zip: There is no item named u'Week_072_Rev1/725034_en.mp3' in the archive
2012-05-31 11:55:21: Document 662964: Unable to check out CWD for CDR0000662964: Document CDR0000662964 already checked out to user efelker
2012-05-31 11:55:40: CDR721252: unable to find name node for u'gene ALK' in CDR721252

Comment entered 2012-06-05 17:40:53 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-06-05 17:40:53
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::228

(In reply to comment #227)
> (In reply to comment #226)
> > Bob, could you load those audio files this morning?
>
> Done.
>
> !5044 Thu May 31 11:52:54 2012: CDR725034 u'vismodegib' (English) row 5 in
> Week_072_Rev1.zip: There is no item named u'Week_072_Rev1/725034_en.mp3' in the
> archive
> 2012-05-31 11:55:21: Document 662964: Unable to check out CWD for
> CDR0000662964: Document CDR0000662964 already checked out to user efelker
> 2012-05-31 11:55:40: CDR721252: unable to find name node for u'gene ALK' in
> CDR721252

The errors have been fixed.

Comment entered 2012-06-08 13:12:11 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-06-08 13:12:11
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::229

As discussed during yesterday's meeting, we will need a new spreadsheet of glossary terms to be generated for Vanessa. I did say yesterday that I will be providing the criteria (which I have below) for generating the new spreadsheet but it seems to me that it won't be necessary for generating the spreadsheet since what we have been doing all along will be able to capture the terms we expect to be included in this new spreadsheet and it will probably be safer to continue the existing process in order to be able capture all other terms (beside the Genetic terms) that have had pronunciation keys added. The terms we expect to be included in the new spreadsheet are:

i. Glossary Terms with a Dictionary value of Genetics
ii. And have an Audience of Health Professional
iii. And have Pronunciation keys
iv. And do not have audios already recorded for them

We also did say that we want to include all new terms that have recently been added and have pronunciation keys and do not have audios already recorded for them. However, there haven't been many terms added since the last spreadsheet was generated so the bulk of the terms will be the Genetics terms. It looks like the last spreadsheet was generated on 04-30-2012 so you can limit the new spreadsheet to terms that have been added since then.
Lastly, we do have a spreadsheet of the Genetics terms but I do not think it will help in generating the new report. However, if it is necessary for me to upload it to this issue, please let me know and I will.

Comment entered 2012-06-11 08:59:16 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-06-11 08:59:16
BZCOMMENTOR::Bob Kline
BZCOMMENT::230

(In reply to comment #229)

> ... it will probably be safer to continue the existing process ....

The previous comment was pretty long, so I'll summarize what that process does here: the report includes all GlossaryTermName documents which;

(1) are on Cancer.gov
(2) were not included in any of the previous reports
(3) have a pronunciation for the English name

If you confirm that this is what you want, I'll run the report.

Comment entered 2012-06-11 10:18:40 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-06-11 10:18:40
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::231

(In reply to comment #230)
> (In reply to comment #229)
> > ... it will probably be safer to continue the existing process ....
> The previous comment was pretty long, so I'll summarize what that process does
> here: the report includes all GlossaryTermName documents which;
> (1) are on Cancer.gov
> (2) were not included in any of the previous reports
> (3) have a pronunciation for the English name
> If you confirm that this is what you want, I'll run the report.

Yes. This is what I want. Thank you!

Comment entered 2012-06-11 13:36:26 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-06-11 13:36:26
BZCOMMENTOR::Bob Kline
BZCOMMENT::232

Good. I have set it up so that, as long as we stick with these requirements, it will be trivial for me to generate future spreadsheets.

Comment entered 2012-06-11 13:36:26 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20120611.xls has been added with description: Next round of terms for pronunciation

Comment entered 2012-06-14 10:00:57 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-06-14 10:00:57
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::233

(In reply to comment #232)
> Created attachment 2260 [details]
> Next round of terms for pronunciation
>
> Good. I have set it up so that, as long as we stick with these requirements,
> it will be trivial for me to generate future spreadsheets.

Thanks! The file has been sent to Vanessa.

Comment entered 2012-06-28 15:12:55 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-06-28 15:12:55
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::234

The following files have been reviewed and are ready to be uploaded onto Bach.

Week_077_Rev1.zip
Week_077_Rev2.zip
Week_77.zip

Comment entered 2012-07-12 11:29:24 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-07-12 11:29:24
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::235

We are ready for a new spreadsheet. Please generate a new spreadsheet of terms to be sent to Vanessa for audio recording.

Comment entered 2012-07-12 14:47:45 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-07-12 14:47:45
BZCOMMENTOR::Bob Kline
BZCOMMENT::236

Week 77 has been processed. Errors:

!340 Thu Jul 12 13:57:58 2012: CDR339330 u'nonpenetrance' (English) row 1 in Week_77.zip: There is no item named u'Week_077/339330_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339331 u'variable expression' (English) row 2 in Week_77.zip: There is no item named u'Week_077/339331_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339332 u'late or variable onset' (English) row 3 in Week_77.zip: There is no item named u'Week_077/339332_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339333 u'genetic heterogeneity' (English) row 4 in Week_77.zip: There is no item named u'Week_077/339333_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339334 u'genomic imprinting' (English) row 5 in Week_77.zip: There is no item named u'Week_077/339334_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339335 u'assisted reproductive technology' (English) row 6 in Week_77.zip: There is no item named u'Week_077/339335_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339337 u'allele' (English) row 7 in Week_77.zip: There is no item named u'Week_077/339337_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339338 u'autosomal dominant' (English) row 8 in Week_77.zip: There is no item named u'Week_077/339338_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339339 u'autosomal recessive' (English) row 9 in Week_77.zip: There is no item named u'Week_077/339339_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339341 u'heterozygous genotype' (English) row 10 in Week_77.zip: There is no item named u'Week_077/339341_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339342 u'homozygous genotype' (English) row 11 in Week_77.zip: There is no item named u'Week_077/339342_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339343 u'inherited cancer syndrome' (English) row 12 in Week_77.zip: There is no item named u'Week_077/339343_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339344 u'penetrance' (English) row 13 in Week_77.zip: There is no item named u'Week_077/339344_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR339348 u'X-linked recessive' (English) row 14 in Week_77.zip: There is no item named u'Week_077/339348_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR415709 u'logarithm of the odd score' (English) row 15 in Week_77.zip: There is no item named u'Week_077/415709_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR425373 u'LOD score' (English) row 16 in Week_77.zip: There is no item named u'Week_077/425373_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR425374 u'linkage analysis' (English) row 17 in Week_77.zip: There is no item named u'Week_077/425374_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460124 u'affected' (English) row 18 in Week_77.zip: There is no item named u'Week_077/460124_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460125 u'allelic heterogeneity' (English) row 19 in Week_77.zip: There is no item named u'Week_077/460125_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460126 u'aneuploidy' (English) row 20 in Week_77.zip: There is no item named u'Week_077/460126_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460127 u'Ashkenazi Jews' (English) row 21 in Week_77.zip: There is no item named u'Week_077/460127_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460128 u'autosomal' (English) row 22 in Week_77.zip: There is no item named u'Week_077/460128_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460129 u'band' (English) row 23 in Week_77.zip: There is no item named u'Week_077/460129_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460132 u'carrier' (English) row 24 in Week_77.zip: There is no item named u'Week_077/460132_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460133 u'carrier rate' (English) row 25 in Week_77.zip: There is no item named u'Week_077/460133_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460134 u'clone' (English) row 26 in Week_77.zip: There is no item named u'Week_077/460134_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460135 u'codon' (English) row 27 in Week_77.zip: There is no item named u'Week_077/460135_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460136 u'conformation-sensitive gel electrophoresis' (English) row 28 in Week_77.zip: There is no item named u'Week_077/460136_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460138 u'consanguinity' (English) row 29 in Week_77.zip: There is no item named u'Week_077/460138_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460139 u'cosegregation' (English) row 30 in Week_77.zip: There is no item named u'Week_077/460139_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460140 u'consultand' (English) row 31 in Week_77.zip: There is no item named u'Week_077/460140_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460141 u'deletion' (English) row 32 in Week_77.zip: There is no item named u'Week_077/460141_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460142 u'de novo mutation' (English) row 33 in Week_77.zip: There is no item named u'Week_077/460142_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460143 u'new mutation' (English) row 34 in Week_77.zip: There is no item named u'Week_077/460143_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460144 u'disease-causing mutation' (English) row 35 in Week_77.zip: There is no item named u'Week_077/460144_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460145 u'domain' (English) row 36 in Week_77.zip: There is no item named u'Week_077/460145_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460146 u'exon' (English) row 37 in Week_77.zip: There is no item named u'Week_077/460146_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460147 u'false-positive result' (English) row 38 in Week_77.zip: There is no item named u'Week_077/460147_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460148 u'familial' (English) row 39 in Week_77.zip: There is no item named u'Week_077/460148_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460152 u'frameshift mutation' (English) row 40 in Week_77.zip: There is no item named u'Week_077/460152_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460155 u'index case' (English) row 41 in Week_77.zip: There is no item named u'Week_077/460155_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460156 u'informative' (English) row 42 in Week_77.zip: There is no item named u'Week_077/460156_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460158 u'kindred' (English) row 43 in Week_77.zip: There is no item named u'Week_077/460158_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460161 u'linkage' (English) row 44 in Week_77.zip: There is no item named u'Week_077/460161_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460163 u'locus heterogeneity' (English) row 45 in Week_77.zip: There is no item named u'Week_077/460163_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460164 u'missense mutation' (English) row 46 in Week_77.zip: There is no item named u'Week_077/460164_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460195 u'mutation analysis' (English) row 47 in Week_77.zip: There is no item named u'Week_077/460195_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460196 u'mode of inheritance' (English) row 48 in Week_77.zip: There is no item named u'Week_077/460196_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460197 u'mosaicism' (English) row 49 in Week_77.zip: There is no item named u'Week_077/460197_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460198 u'negative predictive value' (English) row 50 in Week_77.zip: There is no item named u'Week_077/460198_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460200 u'novel mutation' (English) row 51 in Week_77.zip: There is no item named u'Week_077/460200_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460202 u'null allele' (English) row 52 in Week_77.zip: There is no item named u'Week_077/460202_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460203 u'phenotype' (English) row 53 in Week_77.zip: There is no item named u'Week_077/460203_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460204 u'point mutation' (English) row 54 in Week_77.zip: There is no item named u'Week_077/460204_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460205 u'population risk' (English) row 55 in Week_77.zip: There is no item named u'Week_077/460205_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460206 u'positive predictive value' (English) row 56 in Week_77.zip: There is no item named u'Week_077/460206_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460208 u'predisposing mutation' (English) row 57 in Week_77.zip: There is no item named u'Week_077/460208_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460209 u'susceptibility gene' (English) row 58 in Week_77.zip: There is no item named u'Week_077/460209_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460210 u'presymptomatic testing' (English) row 59 in Week_77.zip: There is no item named u'Week_077/460210_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460211 u'proband' (English) row 60 in Week_77.zip: There is no item named u'Week_077/460211_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460212 u'propositus' (English) row 61 in Week_77.zip: There is no item named u'Week_077/460212_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460214 u'risk assessment' (English) row 62 in Week_77.zip: There is no item named u'Week_077/460214_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460215 u'genetic screening' (English) row 63 in Week_77.zip: There is no item named u'Week_077/460215_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460216 u'cancer screening' (English) row 64 in Week_77.zip: There is no item named u'Week_077/460216_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460217 u'single-stranded conformational polymorphism' (English) row 65 in Week_77.zip: There is no item named u'Week_077/460217_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460219 u'Southern blot' (English) row 66 in Week_77.zip: There is no item named u'Week_077/460219_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460220 u'splicing' (English) row 67 in Week_77.zip: There is no item named u'Week_077/460220_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460222 u'trinucleotide repeat' (English) row 68 in Week_77.zip: There is no item named u'Week_077/460222_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460223 u'trisomy' (English) row 69 in Week_77.zip: There is no item named u'Week_077/460223_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460224 u'unaffected' (English) row 70 in Week_77.zip: There is no item named u'Week_077/460224_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460226 u'carrier frequency' (English) row 71 in Week_77.zip: There is no item named u'Week_077/460226_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460229 u'proposita' (English) row 72 in Week_77.zip: There is no item named u'Week_077/460229_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR460230 u'counselee' (English) row 73 in Week_77.zip: There is no item named u'Week_077/460230_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR464169 u'loss of heterozygosity' (English) row 74 in Week_77.zip: There is no item named u'Week_077/464169_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR485392 u'haplotype' (English) row 75 in Week_77.zip: There is no item named u'Week_077/485392_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR485393 u'attributable risk' (English) row 76 in Week_77.zip: There is no item named u'Week_077/485393_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR485395 u'second-degree relative' (English) row 77 in Week_77.zip: There is no item named u'Week_077/485395_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR538227 u'phenocopy' (English) row 78 in Week_77.zip: There is no item named u'Week_077/538227_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556483 u'noncarrier' (English) row 79 in Week_77.zip: There is no item named u'Week_077/556483_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556485 u'uninformative' (English) row 80 in Week_77.zip: There is no item named u'Week_077/556485_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556486 u'deleterious mutation' (English) row 81 in Week_77.zip: There is no item named u'Week_077/556486_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556493 u'variant of unknown significance' (English) row 82 in Week_77.zip: There is no item named u'Week_077/556493_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556494 u'unclassified variant' (English) row 83 in Week_77.zip: There is no item named u'Week_077/556494_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556495 u'variant of uncertain significance' (English) row 84 in Week_77.zip: There is no item named u'Week_077/556495_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR556497 u'indeterminate' (English) row 85 in Week_77.zip: There is no item named u'Week_077/556497_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR570712 u'founder mutation' (English) row 86 in Week_77.zip: There is no item named u'Week_077/570712_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR580966 u'dirty necrosis' (English) row 87 in Week_77.zip: There is no item named u'Week_077/580966_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR600383 u'copy number variant' (English) row 88 in Week_77.zip: There is no item named u'Week_077/600383_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR649780 u'yttrium Y 90 epratuzumab tetraxetan' (English) row 89 in Week_77.zip: There is no item named u'Week_077/649780_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR649780 u'itrio Y 90 epratuzumab tetraxet\xe1n' (Spanish) row 90 in Week_77.zip: There is no item named u'Week_077/649780_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR656227 u'nutrigenomics' (English) row 91 in Week_77.zip: There is no item named u'Week_077/656227_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR660737 u'intron' (English) row 92 in Week_77.zip: There is no item named u'Week_077/660737_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR660739 u'genotype' (English) row 93 in Week_77.zip: There is no item named u'Week_077/660739_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR660750 u'genetic anticipation' (English) row 94 in Week_77.zip: There is no item named u'Week_077/660750_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR666091 u'tagSNP' (English) row 95 in Week_77.zip: There is no item named u'Week_077/666091_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR666092 u'tagging SNP' (English) row 96 in Week_77.zip: There is no item named u'Week_077/666092_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR666094 u'linkage disequilibrium' (English) row 97 in Week_77.zip: There is no item named u'Week_077/666094_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671174 u'nonsense mutation' (English) row 98 in Week_77.zip: There is no item named u'Week_077/671174_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671177 u'heteroduplex analysis' (English) row 99 in Week_77.zip: There is no item named u'Week_077/671177_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671816 u'modified FOLFOX-6' (English) row 100 in Week_77.zip: There is no item named u'Week_077/671816_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671816 u'FOLFOX-6 modificado' (Spanish) row 101 in Week_77.zip: There is no item named u'Week_077/671816_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671817 u'modified FOLFOX-6 regimen' (English) row 102 in Week_77.zip: There is no item named u'Week_077/671817_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671817 u'r\xe9gimen FOLFOX-6 modificado' (Spanish) row 103 in Week_77.zip: There is no item named u'Week_077/671817_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671821 u'mFOLFOX-6 regimen' (English) row 104 in Week_77.zip: There is no item named u'Week_077/671821_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671821 u'r\xe9gimen FOLFOX-6m' (Spanish) row 105 in Week_77.zip: There is no item named u'Week_077/671821_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671823 u'FOLFOX-4 regimen' (English) row 106 in Week_77.zip: There is no item named u'Week_077/671823_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671823 u'r\xe9gimen FOLFOX-4' (Spanish) row 107 in Week_77.zip: There is no item named u'Week_077/671823_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671825 u'FOLFOX-7 regimen' (English) row 108 in Week_77.zip: There is no item named u'Week_077/671825_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671825 u'r\xe9gimen FOLFOX-7' (Spanish) row 109 in Week_77.zip: There is no item named u'Week_077/671825_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671827 u'FOLFOX-6 regimen' (English) row 110 in Week_77.zip: There is no item named u'Week_077/671827_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR671827 u'r\xe9gimen FOLFOX-6' (Spanish) row 111 in Week_77.zip: There is no item named u'Week_077/671827_es.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR681111 u'single-stranded conformation polymorphism analysis' (English) row 112 in Week_77.zip: There is no item named u'Week_077/681111_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR681112 u'SSCP analysis' (English) row 113 in Week_77.zip: There is no item named u'Week_077/681112_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR681114 u'Crohn-like reaction' (English) row 114 in Week_77.zip: There is no item named u'Week_077/681114_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR681115 u'Crohn disease-like reaction' (English) row 115 in Week_77.zip: There is no item named u'Week_077/681115_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR687002 u'multiplex ligation-dependent probe amplification' (English) row 116 in Week_77.zip: There is no item named u'Week_077/687002_en.mp3' in the archive
!340 Thu Jul 12 13:57:58 2012: CDR689607 u'Z-score' (English) row 117 in Week_77.zip: There is no item named u'Week_077/689607_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR695591 u'genodermatosis' (English) row 118 in Week_77.zip: There is no item named u'Week_077/695591_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR699860 u'multiplex genomic test' (English) row 119 in Week_77.zip: There is no item named u'Week_077/699860_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR712689 u'segregation analysis' (English) row 120 in Week_77.zip: There is no item named u'Week_077/712689_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR712691 u'inversion' (English) row 121 in Week_77.zip: There is no item named u'Week_077/712691_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR717445 u'epimutation' (English) row 122 in Week_77.zip: There is no item named u'Week_077/717445_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR717446 u'epigenetic alteration' (English) row 123 in Week_77.zip: There is no item named u'Week_077/717446_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR728326 u'childhood cancer' (English) row 124 in Week_77.zip: There is no item named u'Week_077/728326_en.mp3' in the archive
!340 Thu Jul 12 13:57:59 2012: CDR728326 u'c\xe1ncer infantil' (Spanish) row 125 in Week_77.zip: There is no item named u'Week_077/728326_es.mp3' in the archive

Comment entered 2012-07-12 16:17:19 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-07-12 16:17:19
BZCOMMENTOR::Volker Englisch
BZCOMMENT::237

The latest batch of 125 audio files has been loaded to Cancer.gov. These audio
files will be available after the next weekly publishing job.

Comment entered 2012-07-12 17:14:31 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-07-12 17:14:31
BZCOMMENTOR::Bob Kline
BZCOMMENT::238

Comment entered 2012-07-12 17:14:31 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20120712.xls has been added with description: Next set

Comment entered 2012-07-19 11:49:40 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-07-19 11:49:40
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::239

(In reply to comment #238)
> Created attachment 2266 [details]
> Next set

Thanks! This has been sent to Vanessa.

Comment entered 2012-08-02 10:03:46 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-08-02 10:03:46
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::240

The following files are ready to be uploaded unto Bach:
Week_081.zip
Week_081_Rev1.zip

Comment entered 2012-08-03 10:10:20 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-08-03 10:10:20
BZCOMMENTOR::Volker Englisch
BZCOMMENT::241

Bob, you may load the latest audio batch (Week 81).

Comment entered 2012-08-03 15:19:29 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-08-03 15:19:29
BZCOMMENTOR::Bob Kline
BZCOMMENT::242

(In reply to comment #241)

> Bob, you may load the latest audio batch (Week 81).

Done.

2012-08-03 15:15:39: CDR736216: unable to find name node for u'carcinoma de piel no melanoma del p\xe1rpado in situ en estadio 0' in CDR736216
2012-08-03 15:15:40: CDR736217: unable to find name node for u'carcinoma de piel in situ no melanoma en el p\xe1rpado ' in CDR736217
2012-08-03 15:15:41: CDR736219: unable to find name node for u'c\xe1ncer de piel no melanoma del p\xe1rpado en estadio I' in CDR736219

Comment entered 2012-08-03 16:10:05 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-08-03 16:10:05
BZCOMMENTOR::Volker Englisch
BZCOMMENT::243

(In reply to comment #240)
> The following files are ready to be uploaded unto Bach:
> Week_081.zip

I have a minor suggestion:
If we're continuing using this process I think it would be better to use consecutive numbers for the weekly files, i.e. Week_081, Week_082, Week_083, ... instead of skipping numbers. The "Week numbers" aren't representing actual calendar weeks anymore and I believe it's less confusing when looking at the loaded batch comments.
If someone were to look at comments like 'Loaded weekly data for Week 77' followed by 'Loaded weekly data for Week 81' his/her immediate thought could be: "What happened to Weeks 78-80?"

We may want to suggest this to Vanessa.

Comment entered 2012-08-03 17:12:57 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-08-03 17:12:57
BZCOMMENTOR::Volker Englisch
BZCOMMENT::244

The latest batch of 81 audio files has been loaded to Cancer.gov. These audio
files will be available after the next weekly publishing job.

Comment entered 2012-09-04 13:25:12 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-09-04 13:25:12
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::245

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2012-09-04 18:41:53 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-09-04 18:41:53
BZCOMMENTOR::Bob Kline
BZCOMMENT::246

Comment entered 2012-09-04 18:41:53 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20120904.xls has been added with description: Next set

Comment entered 2012-09-06 10:11:41 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-09-06 10:11:41
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::247

(In reply to comment #246)
> Created attachment 2274 [details]
> Next set

Thanks! It has been sent to Vanessa.

Comment entered 2012-09-13 14:09:54 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-09-13 14:09:54
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::248

(In reply to comment #243)
> (In reply to comment #240)
> > The following files are ready to be uploaded unto Bach:
> > Week_081.zip
>
> I have a minor suggestion:
> If we're continuing using this process I think it would be better to use
> consecutive numbers for the weekly files, i.e. Week_081, Week_082, Week_083,
> ... instead of skipping numbers. The "Week numbers" aren't representing actual
> calendar weeks anymore and I believe it's less confusing when looking at the
> loaded batch comments.
> If someone were to look at comments like 'Loaded weekly data for Week 77'
> followed by 'Loaded weekly data for Week 81' his/her immediate thought could
> be: "What happened to Weeks 78-80?"
>
> We may want to suggest this to Vanessa.

Vanessa is going to be informed about this.

Comment entered 2012-09-13 14:10:58 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-09-13 14:10:58
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::249

The following file is ready to be uploaded to Bach:

Week_089.zip

Comment entered 2012-10-04 14:01:13 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-10-04 14:01:13
BZCOMMENTOR::Volker Englisch
BZCOMMENT::250

Bob, please go ahead and load these audio files.

Comment entered 2012-10-04 14:33:24 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-10-04 14:33:24
BZCOMMENTOR::Bob Kline
BZCOMMENT::251

(In reply to comment #250)
> Bob, please go ahead and load these audio files.

Done.

2012-10-04 14:30:02: CDR732554: unable to find name node for u'angiograf\xeda con resonancia magn\xe9tica' in CDR732554

Comment entered 2012-10-12 13:02:52 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-10-12 13:02:52
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::252

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2012-10-12 14:34:14 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-10-12 14:34:14
BZCOMMENTOR::Bob Kline
BZCOMMENT::253

(In reply to comment #252)
> Please generate a new spreadsheet to be sent to Vanessa.

Done.

Comment entered 2012-10-12 14:34:14 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20121012.xls has been added with description: Requested report

Comment entered 2012-10-17 10:23:33 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-10-17 10:23:33
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::254

(In reply to comment #251)
> (In reply to comment #250)
> > Bob, please go ahead and load these audio files.
>
> Done.
>
> 2012-10-04 14:30:02: CDR732554: unable to find name node for u'angiograf\xeda
> con resonancia magn\xe9tica' in CDR732554

The error has been fixed.

Comment entered 2012-10-17 10:24:45 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-10-17 10:24:45
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::255

(In reply to comment #253)
> Created attachment 2282 [details]
> Requested report
>
> (In reply to comment #252)
> > Please generate a new spreadsheet to be sent to Vanessa.
>
> Done.

Thanks! The spreadsheet has been sent to Vanessa.

Comment entered 2012-10-24 16:52:12 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-10-24 16:52:12
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::256

The following files are ready to be uploaded to Bach.

Week_090.zip
Week_090_Rev1.zip

Comment entered 2012-10-25 18:10:33 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-10-25 18:10:33
BZCOMMENTOR::Volker Englisch
BZCOMMENT::257

Bob, if you have a few minutes could you please load these latest audio files anytime after 7pm tonight (tomorrow morning will work, too).

Comment entered 2012-10-26 09:34:27 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-10-26 09:34:27
BZCOMMENTOR::Bob Kline
BZCOMMENT::258

(In reply to comment #257)
> Bob, if you have a few minutes could you please load these latest audio files
> anytime after 7pm tonight (tomorrow morning will work, too).

Done.

!6228 Fri Oct 26 09:27:13 2012: CDR740897 u'Stivarga' (English) row 39 in Week_090.zip: There is no item named u'Week_090/740897_en.mp3' in the archive
!6228 Fri Oct 26 09:27:13 2012: CDR740897 u'Stivarga' (Spanish) row 40 in Week_090.zip: There is no item named u'Week_090/740897_es.mp3' in the archive
2012-10-26 09:27:41: CDR740563: unable to find name node for u'peso molecular' in CDR740563

Comment entered 2012-11-06 14:48:13 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-11-06 14:48:13
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::259

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2012-11-06 16:28:25 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-11-06 16:28:25
BZCOMMENTOR::Bob Kline
BZCOMMENT::260

Comment entered 2012-11-06 16:28:25 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20121106.xls has been added with description: Next set

Comment entered 2012-11-13 14:53:56 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-11-13 14:53:56
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::261

(In reply to comment #258)
> (In reply to comment #257)
> > Bob, if you have a few minutes could you please load these latest audio files
> > anytime after 7pm tonight (tomorrow morning will work, too).
>
> Done.
>
> !6228 Fri Oct 26 09:27:13 2012: CDR740897 u'Stivarga' (English) row 39 in
> Week_090.zip: There is no item named u'Week_090/740897_en.mp3' in the archive
> !6228 Fri Oct 26 09:27:13 2012: CDR740897 u'Stivarga' (Spanish) row 40 in
> Week_090.zip: There is no item named u'Week_090/740897_es.mp3' in the archive
> 2012-10-26 09:27:41: CDR740563: unable to find name node for u'peso molecular'
> in CDR740563

The errors have been corrected

(In reply to comment #260)
> Created attachment 2283 [details]
> Next set

The spreadsheet has been sent to Vanessa. Thanks!

Comment entered 2012-11-15 11:22:01 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-11-15 11:22:01
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::262

These files are ready to be uploaded to Bach.
Week_091.zip
Week_091_Rev1.zip

Comment entered 2012-11-15 18:21:29 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-11-15 18:21:29
BZCOMMENTOR::Volker Englisch
BZCOMMENT::263

Please go ahead and load the new audio files.

Comment entered 2012-11-16 09:44:28 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-11-16 09:44:28
BZCOMMENTOR::Bob Kline
BZCOMMENT::264

(In reply to comment #263)
> Please go ahead and load the new audio files.

Done.

2012-11-16 09:42:29: CDR736792: unable to find name node for u'yodo I 123 metayodobenzilguanidina' in CDR736792
2012-11-16 09:42:29: CDR736792: unable to find name node for u'yodo I 123 metayodobenzilguanidina' in CDR736792
2012-11-16 09:42:42: CDR737998: unable to find name node for u'HER2 positivo' in CDR737998
2012-11-16 09:42:44: CDR737999: unable to find name node for u'receptor del factor de crecimiento epid\xe9rmico humano positivo 2' in CDR737999
2012-11-16 09:42:45: CDR738000: unable to find name node for u'c-erbB-2 positiva' in CDR738000
2012-11-16 09:42:45: CDR738000: unable to find name node for u'c-erbB-2 positiva' in CDR738000

Comment entered 2012-11-16 12:42:10 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-11-16 12:42:10
BZCOMMENTOR::Volker Englisch
BZCOMMENT::265

All new audio files have been published to Cancer.gov.

Comment entered 2012-12-06 11:37:02 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-12-06 11:37:02
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::266

(In reply to comment #264)
> (In reply to comment #263)
> > Please go ahead and load the new audio files.
>
> Done.
>
> 2012-11-16 09:42:29: CDR736792: unable to find name node for u'yodo I 123
> metayodobenzilguanidina' in CDR736792
> 2012-11-16 09:42:29: CDR736792: unable to find name node for u'yodo I 123
> metayodobenzilguanidina' in CDR736792
> 2012-11-16 09:42:42: CDR737998: unable to find name node for u'HER2 positivo'
> in CDR737998
> 2012-11-16 09:42:44: CDR737999: unable to find name node for u'receptor del
> factor de crecimiento epid\xe9rmico humano positivo 2' in CDR737999
> 2012-11-16 09:42:45: CDR738000: unable to find name node for u'c-erbB-2
> positiva' in CDR738000
> 2012-11-16 09:42:45: CDR738000: unable to find name node for u'c-erbB-2
> positiva' in CDR738000

The errors have all been fixed.

Comment entered 2012-12-12 15:24:48 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-12-12 15:24:48
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::267

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2012-12-12 15:57:57 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2012-12-12 15:57:57
BZCOMMENTOR::Bob Kline
BZCOMMENT::268

Comment entered 2012-12-12 15:57:57 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20121212.xls has been added with description: Next set

Comment entered 2012-12-18 11:36:32 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2012-12-18 11:36:32
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::269

The following file is ready to be uploaded to Bach.

Week_092.zip

Comment entered 2012-12-19 14:58:03 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2012-12-19 14:58:03
BZCOMMENTOR::Volker Englisch
BZCOMMENT::270

It's OK to load the new audio files.

Comment entered 2013-01-17 14:06:56 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2013-01-17 14:06:56
BZCOMMENTOR::Bob Kline
BZCOMMENT::271

(In reply to comment #270)
> It's OK to load the new audio files.

Done; no errors.

Comment entered 2013-02-04 15:02:58 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2013-02-04 15:02:58
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::272

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2013-02-04 15:13:35 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2013-02-04 15:13:35
BZCOMMENTOR::Bob Kline
BZCOMMENT::273

Comment entered 2013-02-04 15:13:35 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20130204.xls has been added with description: Next set

Comment entered 2013-02-11 09:25:47 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2013-02-11 09:25:47
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::274

(In reply to comment #273)
> Created attachment 2303 [details]
> Next set
Thanks!

The following files are ready to be uploaded to Bach:
Week_093.zip
Week_093_Rev1.zip

Comment entered 2013-02-12 11:00:58 by Englisch, Volker (NIH/NCI) [C]

BZDATETIME::2013-02-12 11:00:58
BZCOMMENTOR::Volker Englisch
BZCOMMENT::275

Bob, from now on please go ahead and load the audio files at any time that's convenient for you. Since the amount of new media files is very small (compared to our original batches) I am no longer running special media doc publishing jobs. These new audio files will be picked up with the next regular publishing job after they have been loaded.

Comment entered 2013-02-12 11:13:11 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2013-02-12 11:13:11
BZCOMMENTOR::Bob Kline
BZCOMMENT::276

(In reply to comment #274)

> The following files are ready to be uploaded to Bach:
> Week_093.zip
> Week_093_Rev1.zip

Done (no errors).

Comment entered 2013-04-02 11:58:52 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2013-04-02 11:58:52
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::277

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2013-04-29 09:00:53 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2013-04-29 09:00:53
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::278

(In reply to comment #277)
> Please generate a new spreadsheet to be sent to Vanessa.

I know you're busy with a host of other things but if you have the chance this morning, would you please generate the spreadsheet for us?

Comment entered 2013-04-29 09:37:00 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2013-04-29 09:37:00
BZCOMMENTOR::Bob Kline
BZCOMMENT::279

Comment entered 2013-04-29 09:37:00 by Kline, Bob (NIH/NCI) [C]

Attachment Report4926-20130429.xls has been added with description: Next set for Vanessa

Comment entered 2013-05-08 17:22:22 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2013-05-08 17:22:22
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::280

(In reply to comment #279)
> Created attachment 2315 [details]
> Next set for Vanessa

Thanks. This has been sent to Vanessa.

Comment entered 2013-05-22 10:16:40 by Osei-Poku, William (NIH/NCI) [C]

BZDATETIME::2013-05-22 10:16:40
BZCOMMENTOR::William Osei-Poku
BZCOMMENT::281

The following files are ready to be uploaded to Bach:

Week_094.zip
Week_094_Rev1.zip
Week_094_Rev2.zip

Comment entered 2013-05-23 08:06:17 by Kline, Bob (NIH/NCI) [C]

BZDATETIME::2013-05-23 08:06:17
BZCOMMENTOR::Bob Kline
BZCOMMENT::282

(In reply to comment #281)
> The following files are ready to be uploaded to Bach:
>
> Week_094.zip
> Week_094_Rev1.zip
> Week_094_Rev2.zip

Done. No errors.

Comment entered 2013-07-10 11:17:37 by Osei-Poku, William (NIH/NCI) [C]

When you get the chance, please generate a new spreadsheet for us. We have some new terms to send to Vanessa.

*my first ever Jira comment. Exciting!!!! :-)*

Comment entered 2013-07-17 13:49:02 by Kline, Bob (NIH/NCI) [C]

Here is the latest set for Vanessa. First report generated on the production bastion host!

Comment entered 2013-08-08 15:54:07 by Osei-Poku, William (NIH/NCI) [C]

The following files are ready to be uploaded to the Production server:

Week_096.zip
Week_096_Rev1.zip

Comment entered 2013-08-08 15:55:53 by Osei-Poku, William (NIH/NCI) [C]

Please generate a new spreadsheet to be sent to Vanessa.

Comment entered 2013-08-09 16:38:10 by Kline, Bob (NIH/NCI) [C]

Next set for Vanessa.

Comment entered 2013-08-27 10:34:44 by Osei-Poku, William (NIH/NCI) [C]

The following files are ready to be uploaded to the Production server:

Week_097.zip
Week_097_Rev1.zip

Comment entered 2013-09-03 11:15:38 by Osei-Poku, William (NIH/NCI) [C]

Just a friendly reminder - The following files (posted in comments above) are ready for upload:

Week_096.zip
Week_096_Rev1.zip
Week_097.zip
Week_097_Rev1.zip

Comment entered 2013-09-13 12:00:55 by Kline, Bob (NIH/NCI) [C]

Media documents created and linked. This was our first run in the new environment, and required some modifications to the software, so it would be worth examining the results extra carefully this time. Here are the errors from the logs:

2013-09-13 11:52:28: CDR750211: unable to find name node for u'carcinoma lobular invasivo' in CDR750211
2013-09-13 11:52:29: CDR750212: unable to find name node for u'carcinoma lobular infiltrante' in CDR750212
2013-09-13 11:52:30: CDR750857: unable to find name node for u'linfoma de Hodgkin con predominio linfoc\xedtico' in CDR750857

Comment entered 2013-09-20 11:12:05 by Kline, Bob (NIH/NCI) [C]

Let's close this issue and create new ones for the individual requests for imports and for new work sheets. OK with you, William?

Comment entered 2013-09-20 12:09:53 by Osei-Poku, William (NIH/NCI) [C]

Yes, sounds good to me.

Comment entered 2013-09-20 13:43:44 by Kline, Bob (NIH/NCI) [C]

I'm closing this issue, which is pretty long in the tooth, and for which the original work was done quite a while ago. Using separate tickets for generating work sheets and for importing new audio files will make it less likely that such requests will get buried in the pile of old issues.

Attachments
File Name Posted User
4926.errors.txt 2011-11-28 10:54:03
4926-20110909.log 2011-09-09 08:41:00
A.xls 2012-02-07 16:59:07
A.xls 2012-02-07 09:48:24
A.xls 2011-03-15 09:52:37
B.xls 2011-03-15 09:53:12
Glossary Audio Pronunciations_V2.doc 2011-03-23 16:27:22
Glossary Audio Pronunciations.doc 2011-03-14 08:55:23
Gloss Terms without Pronuns - Acronyms and others_FINAL_4-23-12.xls 2012-04-23 11:45:26 Osei-Poku, William (NIH/NCI) [C]
Gloss Terms without Pronuns - Acronyms and others_FINAL1_4-24-12.xls 2012-04-24 18:53:44 Osei-Poku, William (NIH/NCI) [C]
Report4926.xls 2010-12-14 09:41:01
Report4926.xls 2010-12-07 14:44:52
Report4926-20120123.xls 2012-01-23 09:11:55
Report4926-20120611.xls 2012-06-11 13:36:26
Report4926-20120712.xls 2012-07-12 17:14:31
Report4926-20120904.xls 2012-09-04 18:41:53
Report4926-20121012.xls 2012-10-12 14:34:14
Report4926-20121106.xls 2012-11-06 16:28:25
Report4926-20121212.xls 2012-12-12 15:57:57
Report4926-20130204.xls 2013-02-04 15:13:35
Report4926-20130429.xls 2013-04-29 09:37:00
Report4926-20130717.xls 2013-07-17 13:49:02
Report4926-20130809.xls 2013-08-09 16:38:10
Request4926.err 2011-08-04 08:25:37
Request4926-20110519092239.log 2011-05-19 09:29:26
Request4926-201105201543-franck.log 2011-05-20 15:48:35
request-4926-round-3.xls 2012-04-30 14:20:03
request-4926-round-3.xls 2012-04-27 11:26:04
Request4926-testrun.log 2011-05-17 09:57:10
Test Spreadsheet 50 Terms for VR.XLS 2011-02-10 10:19:51 Osei-Poku, William (NIH/NCI) [C]

Elapsed: 0:00:00.001442