Issue Number | 3806 |
---|---|
Summary | Problem with CTGov job |
Created | 2014-09-17 10:17:18 |
Issue Type | Bug |
Submitted By | Osei-Poku, William (NIH/NCI) [C] |
Assigned To | Englisch, Volker (NIH/NCI) [C] |
Status | Closed |
Resolved | 2014-09-25 10:53:36 |
Resolution | Fixed |
Path | /home/bkline/backups/jira/ocecdr/issue.138111 |
There appears to be a problem with the CTGov import or download job
that is different from the previous failures we encounted. There is an
unusual high number of out of scope trials in this set: from the first
pass, 35 out of the 51 seems to be out of scope. Also, the trials that
were marked to be imported yesterday did not get downloaded into the
CDR. Here are some of them:
NCT02240524, NCT02240199, NCT02239679
According to the log file it seems that the import process is still running. That may be the reason why some documents aren't available yet?
Bob, feel free to chime in. I don't know the process well enough to have an opinion on the in scope/out of scope issue.
According to the log file it seems that the import process is still running. That may be the reason why some documents aren't available yet?
That's right.
William, is it OK to close this issue?
We still have not received the trials I mentioned in the original post. Somehow those trials were skipped and never downloaded into the CDR. The 3 in the original post were just examples.
I see the following entries in the logs for NCT02240524:
17 07:30:06 2014: Updated NCT02240524 with disposition import requested
Wed Sep 19 07:18:18 2014: Updated NCT02240524 with disposition import requested
Fri Sep 20 06:44:41 2014: Skipping NCT02240524 (already imported, unchanged at NLM) Sat Sep
Similar entries are logged for NCT02240199 and NCT02239679.
I can also find the document NCT02239679 but not NCT02240524.
So, it looks like many of them were imported but they just didn't show up on our processing report. Is it possible for you to provide me with the list of NCT IDs and their corresponding CDR IDs in that batch (for the same date) ?
I'd have to check with Bob on how to best create such a report.
It's not clear what "in that batch (for the same date)" might mean (Volker's previous comment gives entries in the logs from three different dates) so I created a report for the import job corresponding to the most recent of those three dates. Go to https://cdr.cancer.gov/cgi-bin/cdr/CdrQueries.py and bring up the query named "CT.gov Import Job 3766." If you want IDs for a different batch, modify the job ID in the query. To see the recent import job IDs:
SELECT *
FROM ctgov_import_job
WHERE dt > '2014-09-01'
Thanks. This was helpful.
The CTGOV jobs have been running for more than a week without any problems.
Elapsed: 0:00:00.001620