CDR Tickets

Issue Number 3806
Summary Problem with CTGov job
Created 2014-09-17 10:17:18
Issue Type Bug
Submitted By Osei-Poku, William (NIH/NCI) [C]
Assigned To Englisch, Volker (NIH/NCI) [C]
Status Closed
Resolved 2014-09-25 10:53:36
Resolution Fixed
Path /home/bkline/backups/jira/ocecdr/issue.138111
Description

There appears to be a problem with the CTGov import or download job that is different from the previous failures we encounted. There is an unusual high number of out of scope trials in this set: from the first pass, 35 out of the 51 seems to be out of scope. Also, the trials that were marked to be imported yesterday did not get downloaded into the CDR. Here are some of them:
NCT02240524, NCT02240199, NCT02239679

Comment entered 2014-09-17 12:57:35 by Englisch, Volker (NIH/NCI) [C]

According to the log file it seems that the import process is still running. That may be the reason why some documents aren't available yet?

Bob, feel free to chime in. I don't know the process well enough to have an opinion on the in scope/out of scope issue.

Comment entered 2014-09-17 14:28:06 by Kline, Bob (NIH/NCI) [C]

According to the log file it seems that the import process is still running. That may be the reason why some documents aren't available yet?

That's right.

Comment entered 2014-09-22 12:12:04 by Englisch, Volker (NIH/NCI) [C]

William, is it OK to close this issue?

Comment entered 2014-09-22 12:35:35 by Osei-Poku, William (NIH/NCI) [C]

We still have not received the trials I mentioned in the original post. Somehow those trials were skipped and never downloaded into the CDR. The 3 in the original post were just examples.

Comment entered 2014-09-22 13:22:19 by Englisch, Volker (NIH/NCI) [C]

I see the following entries in the logs for NCT02240524:

Wed Sep 17 07:30:06 2014: Updated NCT02240524 with disposition import requested
Fri Sep 19 07:18:18 2014: Updated NCT02240524 with disposition import requested
Sat Sep 20 06:44:41 2014: Skipping NCT02240524 (already imported, unchanged at NLM)

Similar entries are logged for NCT02240199 and NCT02239679.

I can also find the document NCT02239679 but not NCT02240524.

Comment entered 2014-09-22 13:37:03 by Osei-Poku, William (NIH/NCI) [C]

So, it looks like many of them were imported but they just didn't show up on our processing report. Is it possible for you to provide me with the list of NCT IDs and their corresponding CDR IDs in that batch (for the same date) ?

Comment entered 2014-09-22 13:55:05 by Englisch, Volker (NIH/NCI) [C]

I'd have to check with Bob on how to best create such a report.

Comment entered 2014-09-22 14:45:51 by Kline, Bob (NIH/NCI) [C]

It's not clear what "in that batch (for the same date)" might mean (Volker's previous comment gives entries in the logs from three different dates) so I created a report for the import job corresponding to the most recent of those three dates. Go to https://cdr.cancer.gov/cgi-bin/cdr/CdrQueries.py and bring up the query named "CT.gov Import Job 3766." If you want IDs for a different batch, modify the job ID in the query. To see the recent import job IDs:

SELECT *
  FROM ctgov_import_job
 WHERE dt > '2014-09-01'
Comment entered 2014-09-25 10:53:14 by Osei-Poku, William (NIH/NCI) [C]

Thanks. This was helpful.

Comment entered 2014-09-25 11:00:42 by Osei-Poku, William (NIH/NCI) [C]

The CTGOV jobs have been running for more than a week without any problems.

Elapsed: 0:00:00.001620